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EFFECTS OF CHLORPROMAZINE ON LEARNING AND 
RELATED PROCESSES IN EMOTIONALLY 
DISTURBED CHILDREN * 


MALCOLM M. HELPER, R. C. WILCOTT,? ann SOL L. GARFIELD 


Nebraska Psychiatric Institute 


39 children in a psychiatric hospital were tested on 2 types of learning tasks 
and several performance tasks before, during, and aiter a 4-week period on 
chlorpromazine or placebo, the types of medication being assigned randomly 
in a double-blind design. Medication was controlled according to clinical 
criteria. Paired-associate learning was less efficient under chlorpromazine, 
especially on later learning trials and among initially slower learning Ss. 
Serial learning and tapping rate showed less consistent trends. Porteus Maze 
Mental Age scores declined under chlorpromazine, while Q scores on the same 
test tended to improve. No drug effect was noted on remote or immediate 
memory. The results were tentatively interpreted in terms of impairment of 


attention span by chlorpromazine. 


The purpose of this study was to determine 
whether the tranquilizer, chlorpromazine, 
changes the ability of emotionally disturbed 
children to learn and retain new associations, 
and if possible, to determine what the basis 
of any such changes might be. No published 
research on this specific problem could be 
found, although the widespread use of the 
drug with children and the importance of 
learning during childhood make it a problem 
of considerable importance. 

Previous studies of the effects of chlor- 
promazine on cognitive processes have used 
adult subjects almost exclusively, and have 
Produced somewhat inconsistent results. 
Chlorpromazine has been found to impair 
retention of verbal associations in adult schizo- 
Phrenics (Vestre, 1961) and to have no effect 
On a serial learning task (Whitehead & 
Thune, 1958). Gilgash (1961) reported im- 
provement in IQ scores in adult catatonics 
Ollowing chlorpromazine, while Gibbs, Wil- 

ens, and Lauterbach (1957) found greater 
z score improvement in adult patients 
Teated with placebo than in those treated 
ie chlorpromazine. Two reports (Porteus, 
957; Porteus & Barclay, 1957) indicated 
Significant declines in Porteus Maze perform- 
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ance in adult patients treated with chlorpro- 
mazine, while a third (Judson & MacCasland, 
1960) failed to find a significant effect. 

The experimental data on chlorpromazine, 
while inconclusive, raise the possibility of 
negative effects on learning and performance 
in children. However, clinical study of the 
effects of chlorpromazine in children has 
led to incidental reports of improved learning 
of school subjects (Freed & Peifer, 1956). 
In addition, chlorpromazine has been cited 
as a useful adjunct in psychotherapy with 
children, apparently with the belief that the 
drug facilitates the desired change in behavior 
(Freed, 1957). 

This paper reports the results of a con- 
trolled, double-blind study of the effects of 
chlorpromazine on two learning tasks in 
children hospitalized for emotional disturb- 
ance. Effects of the drug on certain perform- 
ance tests were also studied in the hope that 
they would help clarify any learning changes 
that were found. 


METHOD 


Design of the Experiment 


The experiment was divided into three phases: 
A Predrug phase, an On-drug (chlorpromazine or 
placebo) phase, and a Postdrug phase. Each child 
was assessed on each experimental measure at 
the end of each phase. Four-week intervals 
separated the successive assessments, the subjects 
being tested just prior to drug or placebo, again 
at the end of their fourth week on medication, and 
finally, at the end of the fourth week following 
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termination of medication. At least a week prior 
to the frst assessment, the children were given a 
kind of dry run of the assessment procedures to 
familiarize them with the experimenters and the 
settings in which the assessments were made. 

After being selected for participation in the 
study (see below), children were assigned randomly 
to the drug or placebo condition by one of us 
(RCW) who had no clinical contact with the 
children and who did not participate in making 
any of the behavioral measurements. Assignments 
were made in such a way that the two groups were 
closely matched for chronological age. Medication 
was administered and controlled by a psychiatrist 
who did not otherwise participate in the project or 
in the administration of the children’s activities.3 

The procedure for administration of the drug 
was the same as that normally used in clinical 
practice. Dosage level was adjusted individually for 
each subject with the intitial dosage level being 
determined by the weight and age of the subject. 
Dosage level was then increased or decreased as 
indicated by clinical changes such as drowsiness, 
decreased motor activity, or other signs of extra- 
pyramidal involvement. The resultant dosage levels 
ranged from 75 milligrams to 450 milligrams per 
day. The dosage level was divided into three equal 
amounts and administered three times a day in 
liquid form. Placebo dosage level was varied in a 
similar way. All subjects were examined each week 


for evidence of drug sensitivity, endocrine disturb- 
ance, or jaundice, 


Subjects 


All children admitted to the Children’s In- 
patient Service of the Nebraska Psychiatric In- 
stitute between June 1958 and June 1961, were 
considered for the project. The subjects were 
excluded if they fell into one of the following 
categories: (a) Under 6 or over 13 years of age; 
(b) under 6 years of mental age; (c) too dis- 
turbed emotionally to cooperate with testing pro- 
cedures; (d) judged by the seni 
have symptoms which counterind: 


In all, 39 children provided data for the study, 
19 drug subjects and 20 placebo subjects. Both 
groups averaged 115 months i 


&roup ranged from 86 months 


tained 16 boys and 4 girls. The 
for the drug group were as 
reaction of childhood, 6; psyc! 


8 The authors wis! 
Robert B. Mufily, 
tion of the experim 


h to express their gratitude to 


for supervising the administra- 
ental medications. 


5; schizophrenic reaction, childhood type, ‘i 
personality trait disturbance, 2; chronic brain E, 
drome with psychosis, 1. Diagnoses for the pee 
group were: Adjustment reaction of childho oy 
10; psychoneurotic disorders, 7; schizophrenic a 
action, childhood type, 2; personality trait distu 
ance, 1. Although the drug group contains moO 
children diagnosed as psychotic, the two grout 
initially were not significantly different on = 
of 18 behavioral scales rated by ward pone 
These scales, which are being reported on esewiod 
(Garfield, Helper, Wilcott, & Mufily, 1962), provi 
a reliable estimate of severity of disturbance. 
view of the known unreliability of diagnosis, at 
two groups can probably be considered to be abo 
equally disturbed, 


= 


Learning Tasks 


These tasks had to be selected and designed É, 
secure cooperation from emotionally disturb 
children, as well as to yield relevant informatio” 
For this reason, new learning tasks were devis i 
which were thought to have more inherent g, 
peal to children than those typically studied ” 
the psychological laboratory, Similarly, more leew 
was allowed the experimenter in obtaining 3” 
maintaining rapport than would be necessary ” 
dealing with normal children, h 

Paired-associate learning. This task required th 
subject to associate a picture of a child with { 
name. The pictures, obtained from magazine illu“ 
trations, were reproduced photographically in 
standard black and white format and presented ° 
cards with the associated name printed below t 
picture. The printed name was covered by 
folding flap which could be turned down ! 
present the picture and the name together. T 


names were chosen from a list of Popular first nam? 
in order to minimi 


eight-item forms of 
for each phase of 
first phase 


z 


the task were developed, 0” 


names wa 
Pictures of g 
; and during the i" 


phase, Pictures of boys and differe” 
boys’ names were used. 


task, the experiment 
remember the names ‘, 


he did not know. Ti 
the subject’s respo™, 
were presented 19, g 
andom order on succes 
tals. T warded at the end of et 
trial with an “M & M” candy for each cort? g 


response during that trial. Items were elimina! | 
from the series after two successive correct ant h 
pations. This was done in order to equalize level, f 
learning, since retention was to be tested du y 
the subsequent phase of the experiment, Lear” 


7 The pictures 
different prescheduled r: 
trials. The child was re 
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trials were continued until all items were eliminated 
or the subject refused further cooperation. 

Serial learning. Ten 4X6 inch cards con- 
taining line drawing of familiar objects were pre- 
sented manually in a constant serial order with 
instructions to “guess which picture comes next.” 
The correct response was called to the child’s 
attention by the experimenter whenever the child 
made an error. The series was repeated until the 
child had reached a criterion of two perfect trials 
or until the child refused further cooperation. A 
reward of one small piece of candy (M & M) was 
Slven at the end of each trial for each new cor- 
rect response during that trial. Three different sets 
of the drawings were used, a different set at each 
Phase of the experiment. 


Retention Measures 


Retention of learning tasks. During the second 


(On-drug) assessment, the child relearned the 
Series and paired-associate tasks that he had 
originally learned 4 weeks earlier during the 


Predrug assessment. This relearning was done prior 
to the administration of the new forms of the two 
types of tasks. Similarly, on Postdrug assessment, 
the child relearned the serial and paired-associate 
materials that he had originally learned during the 
On-drug assessment. At this last assessment, the 
child was not asked to relearn materials originally 
learned during the Predrug assessment. 

Digit span. Immediate memory for digit series 
forward and backward was tested in the usual 
way, utilizing digit series from the WISC, the 
WAIS, and the Mental Examiner’ Handbook 
(Wells & Ruesch, 1945) for successive assessments. 


Performance Measures 


Tapping rate. The subjects were asked to tap 
a stylus as rapidly as possible on a metal plate 
for three 15-second periods. The taps were counted 
electrically, 

Dotting O’s. The subjects were asked to place 
g dot in the center of as many O’s on a mimeo- 
Taphed sheet as possible during a 1-minute period. 
lean requires somewhat finer coordination and 

po mination than tapping rate. 

ms Maze Test. Two forms of this test were 
S i The Vineland Revision was administered 
the fee Predrug and Postdrug assessments, while 

Xtension Series was used for the On-drug 
assessment, 
Sua first two tasks were included in order to 
cee drug effects upon such nonlearning factors 
ese, een and coordination. The Porteus Maze 
Beer Was included because performance on it has 
adult shown to suffer following chlorpromazine in 

schizophrenics (Porteus, 1957; Porteus & 

arclay, 1957). 


Administration of Tasks 


= Tesearch assistant administered the learning 
S, the tapping rate test, and the dotting test. 


Relearning of old material always preceded learn- 
ing of new material, occurring a day earlier. Within 
any one testing session, learning or relearning of 
the serial task preceded the corresponding operation 
on the paired-associate task. Two research assistants 
were employed over the course of the 3-year 
period of data collection, each testing equal numbers 
of drug and control subjects. The Porteus Maze 
Test and Digit Span were administered by the first 
author in the presence of the third author, who 
was involved in making ratings for other aspects of 
the study. 


RESULTS 
General Form of Analysis 


Changes between the successive assess- 
ments were tabulated separately for the two 
groups for each measure. Effects of chlor- 
promazine administration were then studied 
by comparing the Predrug to On-drug 
changes in the experimental group with those 
in the control group. Since there was wide 
variation among the subjects in initial level 
on most of the variables, and since subsequent 
changes appeared to be correlated with initial 
level, change scores for each variable were 
entered into an analysis of variance in which 
the subjects were classified according to initial 
level for that variable, as well as according 
to kind of medication. For any given vari- 
able, some subjects were lost in the post hoc 
matching of subgroups on initial level. For 
this reason, analyses were carried out on 
each variable ignoring the initial level classi- 
fication and including all subjects providing 
relevant data. Only one result changed in 
significance level under this more inclusive 
form of analysis (see results for Porteus 
Qualitative score). 

Changes in the two groups following the 
termination of medication were also com- 
pared, following the pattern described above. 
Predrug values were again used to make the 
initial level classifications. Three subjects in 
the control group were discharged prior to the 
third assessment, reducing the number avail- 
able for comparisons of Postdrug changes. 


Learning and Retention Measures 
Paired-associate learning. Only the first 
eight trials of each paired-associate task were 
included in these analyses, since the number 
of cooperative subjects dropped after this 
point in both the Predrug and On-drug 
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TABLE 1 
VARIANCE ANALYSES OF CHANGES IN NUMBER CORRECT ON PAIRED-ASSOCIATE LEARNING 


Predrug to On-drug change 


On-drug to Postdrug change 


Source df MS F df MS F 

Drug (D) 1 62.68 4.48* 1 30.99 
Level (L) 1 56 1 33.01 
DXL 1 25.84 1 8.98 
Between subjects error 32 Ee i E- 

i k (T 3 J $ 
Tx n K 3 15.63 asm 3 8.60 2.86* 
TXL 3 1.00 3 3.76 
TXDXL 3 16.69 6.68** 3 1.80 
Within subjects error 96 2.50 84 3.00 

Total 143 127 

E 


assessments. On the Predrug assessment, 20 
of the 39 subjects had reached the criterion 
as of the eighth trial, and the average number 
of correct responses on the eighth trial was 
6.7, out of a possible 8. These data indicate 
that a substantial amount of learning had 
occurred by the eighth trial. 

The eight: trials were divided into four 
blocks of two trials each. This was done so 
that drug effects upon the course of learning 
could be studied. Changes between the Pre- 
drug and On-drug tasks were assessed by 
subtracting the number correct on each Pre- 
drug block from that in the corresponding 
On-drug block. On-drug to Postdrug changes 
were tabulated analogously, after subtract- 
ing the former from the latter. 

In the analyses of variance performed on 
these two sets of change scores, the following 
effects were studied: Drug (chlorpromazine 
or placebo); initial learning level (the sub- 
jects were divided at the grand median on 
total number of errors on the Predrug task); 
and trials (four two-trial blocks). These 
analyses are summarized in Table 1. Thirty- 
six subjects, divided into four subgroups of 
9 each, were utilized in the analysis of 
Predrug to On-drug changes. Only 28 subjects 
could be used in the analysis of On-drug to 
Postdrug changes. Three control subjects 
were discharged from the hospital before the 
final assessment, and only 14 of the remaining 
16 controls could be matched with experi- 
mental subjects on Predrug learning scores, 


It can be seen that treatment (Drug), the 
Drug by Trial interaction, and the triple 
interaction all represent significant effects 1 
the Predrug to On-drug analysis. The changes 
in number correct on the four trial blocks 
are plotted separately for the four subgroup> 
in Figure 1. These changes seem to permit 
a fairly simple interpretation: Chlorpromé” 
zine results in a decrement in correct 1 
Sponses, especially on later learning trials) 
and most especially in subjects who wer? 
slow learners prior to its administration. Tb? 
analogous analysis of Postdrug change scores: 
shown in the right half of Table 1, resulte 
in a significant Drug by Trial interactio™ 
Examination of the relevant mean Postdrué 
changes, plotted in Figure 2, points to 2? 
interpretation which partially parallels that 
above—upon termination of treatment, th? 
chlorpromazine group showed a large 1% 
covery in performance on the later learnin? 
trials, especially as compared with the co” 
trol group. Taken together, the analys 
point to a decline under chlorpromazine ° 
performance on later trials in the paired-a% 
sociate learning task, a loss from which sig 
nificant recovery was made within 
weeks after termination of the drug. ” 
addition, there is some suggestion that slow’! 
learning subjects are affected more P 
chlorpromazine, t 

Serial learning. On the Predrug assessme? A 
15 of the 39 subjects had reached t” 
criterion by the eighth trial of this task, a” 
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m oe number of correct responses on this 
es 7.9 out of a possible 9. These data 
S ~ the serial task was roughly 
a le in difficulty level to the paired- 
p the 2 task. Changes in number correct 
tir tw wt tasks were also tabulated for 
i f mal blocks. Three-way analyses of 
Onde, were performed on the Predrug to 
Bastar E changes and on the On-drug to 
ies changes, just as with the paired- 
mA “ne task. Initial level classifications 
tekt ie on the basis of total number cor- 
serial ta 4 Boat eight trials on the Predrug 
à Pom » No significant effects involving 
Ee azine emerged from the Predrug to 
Movis poi However, two interactions 
in the i a of medication were significant 
EA en: ysis of Postdrug changes. Because 
medi er complex changes upon removal 
meani cation would seem to have little 
ng in the absence of any significant 


i 


of 


Chan A Re 
Rats with initial drug administration, 
detail interactions will not be reported in 


5 


Retention of learned materials. One form 
of each learning task was originally learned 
prior to medication and relearned 4 weeks 
later under drug or placebo. The second form 
of each task was learned originally under 
drug or placebo and relearned 4 weeks 
later after termination of medication. Two 
retention scores were derived and were used 
with each form of each type of task: (a) A 
recall score consisting simply of the number 
of correct responses on the first relearning 
trial; and (b) a percentage savings score, 
computed by subtracting the number of 
errors on relearning from the number of 
errors on original learning and dividing the 
remainder by the latter term. Separate 
analyses of variance were performed on the 
two retention scores for each form of each 
task, subcategorizing subjects on the basis of 
the Predrug learning of the task involved. 
No significant effect on either retention score 
for either type of task was associated with 
either the introduction removal of 
chlorpromazine. 


or 
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Fic. 1. Predrug to On-drug changes i 
four two-trial blocks of paired-associate 


for initially fast and slow learning subjec 
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learning. (Changes are shown separately 
ts within each treatment group.) 
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Fic. 2. On-drug to Postdrug changes in 


mean number of correct responses on 


four two-trial blocks of paired-associate learning. 


Digit Span. No significant difference be- 
tween experimental and control groups was 
found on Digit Span, nor was any tendency 
toward such a difference discernible. 


Performance Measures 


Tapping rate. Each set of change scores 
for (Predrug to On-drug and On-drug_ to 
Postdrug) was entered into a three-way 
analysis of variance, utilizing Drug, Trials, 
and Initial Level as main classifications. 
Initial Level was determined from the total 
tapping score on the Predrug assessment. 
Thirty-six subjects could be included in the 
first analysis, but only 28 in the second, 
because of the Postdrug loss of subjects. 

No significant effects involving medication 
appeared in the analysis of Predrug to On- 
drug changes. The one result to reach statis- 
tical significance in the analysis of Postdrug 
changes, an interaction between Trial and 
Drug, would seem to be of little theoretical 
or practical import in the absence of any 


measurable effect of the initial chlorpromazine 
treatment. 

Dotting O’s. The other performance task; 
Dotting O’s, revealed no significant. effects 
when analyzed in the same fashion as above 

Porteus Maze mental age. Predrug to On- 
drug changes were entered into an analysis 
of variance, as were the On-drug to Post- 
drug changes. In both analyses, the Predrug 
Scores were used to make the Initial Leve 
classifications. Four equal groups, one fo" 
each combination of Level and Drug, wer? 
used in each analysis. A total of 36 subjects 
was used in the analysis of Predrug to OP 
drug changes, while 32 subjects were avail 
able for the analysis of Postdrug change 
The results of these analyses, presented i” 
Table 2, show a Significant drug effect O” 
the Predrug to On-drug changes (the se 
nificant Levels effects are of little interes 
for the present study, since they do not 1” 
volve chlorpromazine in any way). ri 

Examination of the mean Predrug to ore 
drug changes in the two treatment grouP 


Di 
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TABLE 2 
J a rere 
VARIANCE ANALYSES OF CHANGES IN PorTEUS Maze MENTAL AGE 


Predrug to On-drug changes On-drug to Postdrug changes 


Source df MS F df MS F 
Drug (D) : 
Level (L) 1 10.56 4.84* 1 07 
4 XL : ey 7.65** 1 25.38 5.52" 
i a 3 1 2.26 
Ween subjects within groups 32 2.18 28 4.59 
Total . 
35 31 


* 
s? S05, 
- <.01. 


showed ț 
Whole de 
Mental a 


hat the experimental group as a 
clined an average of 1.2 years in 
lost tilly O pone. while the control group 
© nonsie. Fines a net difference of 1.1. 
groups in gnificant difference between the 
failure a changes reflects the 
Tecover i the chlorpromazine subjects to 
S con ia age score as sharply as 
administrati ently as they lost it with drug 
Broup did non However, the chlorpromazine 
tion of th recover enough after the termina- 
Its Predrug drug so that its net change from 
tom ape was not significantly differ- 
ol gr e corresponding change im the 
h WE (These latter data are not 
orteus M ular form.) 
b nti Zaze Q score. This score is derived 
®trorg mm and weighting various kinds of 
at do = crossing lines, cutting corners) 
age score enter directly into the mental 
to Teflect 2 It is considered by Porteus 
@elesen, the amount of impulsiveness and 
Study CSS exhibited by the subject. In this 


Contr 
Show 


by comparison with changes in the control 
group (p= .07), but the latter change was 
significantly different (p= 05) from that 
observed in the placebo group. These re- 
sults were obtained when categorization of 
subjects by initial Q score was dropped so 
that all subjects could be included. It should 
be noted that a reduction in this error score 
signifies an improvement in qualitative 
aspects of maze performance. 


DISCUSSION 


The effects attributable to the administra- 
tion of chlorpromazine in the present study 
were selective and specific rather than general 
and pervasive. Only two of the measures 
studied, the paired-associate learning task, 
and the Porteus Maze Mental Age score, 
changed more with the administration of 
chlorpromazine than with placebo. Selectivity 
in drug effects was noted within the paired- 
associate task itself, in that the later trials 
and slower learning subjects were most af- 


Number E total Q score was divided by the fected by chlorpromazine. The serial learning 
the su of mazes actually administered to task, digit span, tapping rate, dotting rate, 
"eflecte ject, so that the values analyzed and two measures of retention of the serial 
adjust errors per maze attempted. This and paired-associate material showed no 
Varig pe was made because the subjects effects related to the administration of 
P the a ely in ability level and, therefore, chlorpromazine. Significant interactions in- 
befo umber of mazes which they attempted volving chlorpromazine did occur for serial 
eE tes he the criterion for cessation learning and tapping rate following the re- 
g Sc st While very little change in the moval of medication. However, these rather 
ec aoe in the control group, it complex Postdrug interactions alone, without 
Pro zi ollowing the introduction of chlor- corroborative effects with initial drug ad- 
Cregg NE in the experimental group, in- ministration, were deemed too unreliable to 

F lizing about the influence 


q & agai 
"ug, Ti ei after the termination of the 
ecrease was not quite significant 


þe useful in genera 
of chlorpromazine. 
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Although few in number, the significant 
effects that were observed with the introduc- 
tion of chlorpromazine were consistent in 
direction. On both the paired-associate 
measure and the Porteus Mental Age score, 
the subjects who received the active drug 
declined in performance relative to those who 
received placebo. Consistency of response 
was further noted in the paired-associate task 
in that the decrement associated with the 
administration of chlorpromazine was re- 
reversed following removal of medication. 

Since previous experimental work on the 
cognitive effects of chlorpromazine has ap- 
parently not used emotionally disturbed chil- 
dren as subjects, the present results will 
have to be compared with the most nearly 
similar studies with adults. The findings for 
the paired-associate task in the present study 
are in only partial agreement with those 
obtained from adult schizophrenics by Vestre 
(1961) in a study of the acquisition of non- 
preferred word associations. He found a 
trend toward less efficient learning of the 
task as a whole in the chlorpromazine sub- 
jects, but this was not statistically significant. 
He did not examine the later trials separately. 
Also in contrast to the present study, he 
found a significant decrement in retention for 
those materials learned under chlorpromazine, 
The two studies thus seem to agree in show- 
ing negative effects of chlorpromazine, but 
to differ in the exact nature of these effects. 
The nonsignificant results for the serial task 
in the present study agree with those of 
Whitehead and Thune (1958). Using adult 
psychotic subjects, these investigators found 
no difference between chlorpromazine and 
placebo groups in learning or retention of an 
eight-item verbal series, 

The relative decline of about 1 year in 
Porteus Maze Mental Age following chlor- 
promazine in the present study was somewhat 
less than the average deficit of about 2 
years reported by Porteus (1957) and 
Porteus and Barclay (1957), but does in- 
dicate negative effects for chlorpromazine on 
this test in children as well as in adults. The 
present study also revealed a trend toward 
improved qualitative performance (Q score) 
on this test under chlorpromazine. This 
variable has apparently not been studied be- 


fore in experiments on chlorpromazine, 
though it could have valuable implications 
for the nature of the deficit produced by 
chlorpromazine on the Porteus Mental Age 
score. 

The nonsignificant results for digit span, 
tapping rate, and dotting rate agree with 
nonsignificant findings on similar variables 
for adults (Levita, 1961; Shatin, Rockmore, 
& Funk, 1956). 

To sum up, the present results from 
emotionally disturbed children would seem tO 
correspond in broad outline with those ob- 
tained on similar cognitive tasks with adults. 
In both cases the measurable effects of 
chlorpromazine are relatively few, but rather 
consistently negative in direction. 

What do these results indicate about the 
mode of action of chlorpromazine? Tw 
negative conclusions would seem to follow: 
The data do not support the notion that 
chlorpromazine will typically improve learn- 
ing or other cognitive performances 3” 
disturbed children, as might be expected if 
the drug only reduced anxiety or distract! 
bility. At the same time, the specificity of 
the negative effects observed makes it un- 
likely that the drug impairs a factor basic 
to all learning tasks, such as the formatio” 
or maintenance of associational bonds. I” 
fact, the concentration of negative drug 
effects within the latter trial blocks on the 
paired-associate task Suggests that the fac- 
tor affected by chlorpromazine can vary i” 
importance within the course of a single 
learning session, as well as from one learning 
task to the next. 

Do the data contain any more positiv? 
Suggestions as to the nature of the facto! 
impaired by chlorpromazine? In attempting 
to narrow the range of speculation, we cal 
ask whether the Porteus Mental Age score 
which also suffers under chlorpromazine, ha® 
anything in common with the paired-as° 
sociate task, especially with its later trials: 
The Predrug Porteus Mental Age score was 
found to correlate significantly carg 
N = 39, p= 05) with the number corre? 
on the third trial block of the Predrug paired” 
associate task. Interestingly enough, the cor 
responding score for the serial task, whic” 
did not show impairment with the adminis 
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tration of chlorpromazine, did not correlate 
Significantly with Porteus Mental Age (r = 
-25, N = 39). The Porteus Q score, which 
Seemed to improve under chlorpromazine, 
has no Predrug correlation in excess of .11 
with any of the three variables mentioned 
above. These data would suggest, though not 
unequivocally, that chlorpromazine impairs 
Some factor which is shared by the Porteus 
Mental Age score and the later paired-as- 
Sociate trials, but which is not heavily rep- 
resented in the corresponding serial learning 
trials or in the Porteus Q score. One pos- 
sibility is that chlorpromazine impairs the 
maintenance of active attention to novel and 
Significant details. The paired-associate task, 
with its continued reshuffling and elimina- 
Sia of items, and the Porteus Maze, with its 
i of increasingly complicated designs, 
k pa seem to share some requirement for 
i ch sustained alertness. This interpretation 
S made more attractive by the work of 
Mirsky and Rosvold (1960), which related 
na Ses of attention on the Continuous Per- 
tmance Test following chlorpromazine to 
aon of functioning in the midbrain 
ticular formation. It is recognized, of course, 
a other factors than attention may be 
She by chlorpromazine and may be more 
oth, fous in experiments with other tasks and 
er subjects. 
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QUITTING PSYCHOTHERAPY 


DOUGLAS M. McNAIR, MAURICE LORR, asp DANIEL M. CALLAHAN 
Veterans Benefits Office, Washingion, D. C. 


Groups of 106 early therapy Terminators and 176 Remainers differed sig- 
nificantly on 10 patient characteristics, Best predictors of duration were a 
Terminator-Remainer test battery plus rated motivation for psychotherapy. 
Therapists with marked interest in the patients’ problems held significantly 
higher proportions of both predicted Terminators and Remainers in treat- 
ment. 7 other therapist characteristics did not substantially influence termina- 


tion rates. Hiler’s contention that predicted Terminators and Remainers 


respond differently to the same thera 
it was found that some therapists react 


The results indicate that Terminators and Remainers form di 


patient populations. The former reject 


lack the behavioral repertoire required for participation. 


Many studies in the past decade have aimed 
at predicting premature termination of psy- 
chotherapy. Fulkerson and Barry (1961, pp. 
194-197) have presented a review and bibli- 
ography of the principal studies of the prob- 
lem. Lorr, Katz, and Rubinstein (1958) re- 
ported the development and cross-validation 
of a test battery (TR battery) to predict 
early termination of psychotherapy. Results 
of a double cross-validation provided sub- 
stantial evidence that the test battery identi- 
fied a high proportion of patients who termi- 
nated psychotherapy after a few interviews 
without their therapists’ advice or consent. 
The TR battery predicted termination with 
significantly greater accuracy than base-rate 
predictions and, contrary to the statement in 
Fulkerson and Barry’s review, with signifi- 
cantly higher accuracy than predictions by 
an interviewer. The characteristics assessed 
by the TR battery indicated, 


The Terminator is either not anxious or does not 
admit to being anxious and self-dissatisfied. He is 
likely to have a history of antisocial acts, he ad- 
mits to being undependable and impulsive, and may 


be authoritarian or rigid in his social attitudes (Lorr 
et al, 1958, p. 326). 


One aim of the present study was to attempt 
to increase the predictive accuracy of the TR 
battery and to obtain greater knowledge of 
Terminator response patterns by including ad- 
ditional patient measures in a study of pre- 
mature termination of treatment. A corollary 


10 


pist attributes was not supported, but 


differently to the 2 


types of patients. 
istinguishable out- 


psychotherapy, perhaps because they 


aim was to perform a second cross-validation 
of the TR battery. 

Studies of termination have emphasized pa- 
tient variables related to dropping out of 
therapy. Few studies systematically assessed 
the influence of nonpatient variables on pre- 
mature termination of therapy, although some 
investigation suggested the unexplained cri- 
terion variance might be due to variables such 
as therapist behavior and the setting in which 
treatment takes place. Lorr et al. (1958) re- 
ported that therapist sex, profession, and ex- 
perience did not relate significantly to termi- 
nation. They also noted that setting factor! 
such as scheduled treatment frequency, length 
of interview, and type of therapy (group ver- 
sus individual) did not correlate significantly 
with the Terminator-Remainer criterion, Re- 
cently, however, Hiler (1958) presented evi- 
dence that the type of patient terminating 
related to characteristics of the therapist- 
Briefly, he reported that therapist sex, warmth, 
and psychotherapeutic competence were asso- 
ciated with whether therapists retained pre- 
dicted Quitters or predicted Stayers in treat- 
ment. (“Quitter” and “Terminator,” “Stayer” 
and “Remainer” are used synonymously.) He 
typed his patients as Stayers or Quitters 0? 
the basis of total Rorschach responses—a” 
index with some cross-validational evidence 
Tt should be noted that Hiler reported an in- 
teraction effect between therapist characte” 
istic and patient type on outcome—not a? 
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association between therapist characteristic 
and actual termination rate. Thus his findings 
do not necessarily disagree with Lorr’s. 
_ Hiler’s (1958) study provided the interest- 
ing lead that therapist-patient compatibility 
influenced premature termination, and his re- 
sults suggested that the interaction between 
the therapist and patient “type” might be 
Crucial. His findings, however, were based on 
small samples from a single clinic and there 
ie no cross-validation of the interaction. 
te no data were presented giving the cor- 
: ations between sex, warmth, and compe- 
tence, their relative independence could not be 
ges Some of Hiler’s one-tailed test find- 
wide competence and patient type) 
not be significant if two-tailed tests 
a been used. One-tailed tests seem question- 
ert m such an unexplored area. Finally, 
Hi ain relevant data were not presented by 
et He reported an interaction of therapist 
i mth and patient type among Remainers; 
Comparable test for Terminators was 
tioned, He reported the interaction of 
his a competence and patient type among 
erminators but cited no comparable test 
= Remainers. While he reported data rele- 
eo only to stated hypotheses, it is unclear 
T Y comparable (or converse) hypotheses 
ere not tested for both Terminators and 
rar ainen, Thus a third aim of the current 
port Was to investigate the effect of the 
R a on termination and to attempt con- 
Bena of Hiler’s interactions between 
Dist characteristics and patient type. 
mi inatons were operationally defined as 
ess re who refused or stopped therapy m 
sent a 16 weeks without the advice or con- 
Sie Ss their therapists. The study hypothe- 
» based on surveys of previous findings 1m 
Tea, were: 
dinate TR battery predicts better than 
Mites better than the base rate those 
turely. who will terminate therapy prema- 


T Terminators have more poorly devel- 
al behavior than Remainers. 

Sk Terminators have less formal education 

i ~~. component of most social class 
reeman, 1961)—than Remainers. 

« © Terminators admit less anxious and 


eurotic”? x ; 
otic” behaviors than Remainers. 


5. Compared with Remainers, Teminators 
have a history of more impulsive and anti- 
social behavior, more difficulties with the law, 
and lack close interpersonal relationships. 

6. Terminators are less interested than Re- 
mainers in participating in verbal psycho- 
therapies. 

7. Compared with Remainers, Terminators 
admit less dissatisfaction with their own 
characteristics and behavior. 

8. Terminators more frequently than Re- 
mainers endorse rigid, authoritarian, irra- 
tional social attitudes, and opinions. 

9. When patients are classified in advance 
as probable Terminators or Remainers on the 
basis of a predictively valid test, interaction 
effects on outcome will appear between pa- 
tient type and characteristics of the therapist 
such as sex and competence. 


METHOD 


The sample consisted of 282 psychotherapy out- 
patients in seven Veterans Administration Mental 
Hygiene Clinics. All patients satisfied these criteria 
for inclusion in the study: males under age 51; 
without evidence of CNS damage; less than 12 hours 
of therapy during the preceding 90 days; acceptable 
to the clinic for “intensive” psychotherapy—defined 
as weekly interviews lasting about 1 hour, aimed at 
changing personal adjustment patterns, and exclud- 
ing supportive or maintenance therapy. Clinics added 
research procedures to their customary prepsycho- 
therapy processing routines. The assessment included 
a research test battery, evaluation by a social worker, 
and appraisal by the therapist. When a patient termi- 
nated prior to the 16-week evaluation date, a data 
sheet was completed giving the date and reason for 
termination. There were 176 Remainers and 106 
Terminators. Patients who terminated for reality rea- 
sons—e.g., moving, requiring hospitalization—were 
excluded from analysis. Within 6 weeks or less, 74 
of the Terminators quit therapy—confirming the ob- 
servation of Lorr et al. (1958) that most premature 
terminations occur during the first few weeks of 
treatment. 

Only 2 of the 7 clinics involved were among the 
13 in the Lorr et al. (1958) study. All clinics in 


1We are grateful to the following psychologists 
and to other staff members at their stations for par- 
ticipating in the study: R. W. Boyd (Boston), L. 
Hemmendinger (Bridgeport), P. W. Morse (Hart- 
ford), L. Rutledge (Denver), J. E. Tucker (Al- 
bany), H. H. Weiss (Chicago), and B. B. Yormak 
(Buffalo). 

The data were collected in collaboration with a 
larger Veterans Administration Cooperative Project 
(Lorr, McNair, Michaux, & Raskin, 1962). 
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both studies are located in large cities widely dis- 
tributed geographically and all clinics had 10-40 
staff members. Most therapists in both studies had 
Freudian or Rogerian orientations. The present data 
were collected throughout 1957 and 1958; the Lorr 
et al. data in 1955-56. World War II veterans 
comprised more than 90% of both samples. The 
remainder were Korean veterans. A separate study 
(McNair, Callahan, & Lorr, 1962) of the effect 
of the therapist on response to psychotherapy in- 
volved 40 of the current Remainers. 

The Terminator-Remainer criterion in the present 
study differed from those of Lorr et al. (1958) and 
Hiler (1958). Lorr’s Terminators and Remainers 
were defined, respectively, as patients remaining in 
therapy 6 weeks or less and patients remaining 6 
months or longer. Hiler’s Terminators had 5 or 
fewer interviews; his Remainers had 20 or more 
interviews. Both studies excluded the middle group 
from analysis. Inclusion of the middle group in the 
current project probably made for a more severe 
test of the hypotheses, especially Hypothesis 1, 
Obvious Terminators were probably excluded from 
the present sample since the criteria for inclusion 
were more rigorous than the usual clinic criteria 
for acceptance into psychotherapy. The 16-week 
termination rate (38%) is much lower than the 
usual 6-week base rate (about 50%) in Veterans 
Administration clinics, 


Predictor Variables 


Patient predictor variables are briefly described 
below. The arabic number in parentheses indicates 
the hypothesis tested by each Predictor(s). In ad- 
dition to those below, 10 other Patient measures, 
which proved unrelated to the Terminator-Remainer 
criterion, were included as tests of Hypotheses 2-8. 


-point 
scales designed to measure self-satisfaction; (8) 
F Scale: 25 items from the Adorno, Frenkel- 
Brunswik, Levinson, and Sanford (1950) measure of 
authoritarianism. 


TABLE 1 


DIFFERENCES BETWEEN REMAINERS AND 
TERMINATORS ON 10 CHARACTERISTICS 


M 


Re- Ter- 
main- mi- 

Variable ers nators t 
TR-BD 5.9 5.3 3.10%% 
TR-A 3.3 2.8 P al 
TR-F 4.0 3.1 3.84" 
MA scale 29.6 27.1 2,39* 
F Scale 24.5 23.1 3.26** 
Sociability 7.3 9.0 3.70** 
Ideal-Actual Self 34.6 30.9 2.61%" 
Education 5.3 4.8 sam 
Vocabulary 22.4 20.2 3.44% 
Motivation 2.7 2.3 4,93" 


Beatie 5 on education indicates high school graduation. 
Dp <1. 


Therapist predictors, Eight therapist predictors 
were included to test Hypothesis 9, These were! 
sex; profession; years of experience as a therapist) 
amount of personal Psychotherapy; competence 
liking for the Patient; interest in the type of prob- 
lem presented by the Patient; and therapist A or B 
“type.” Competence ratings were obtained by having 
three clinical Psychologists with 5 or more years 
experience audit tape recordings of psychotherapy 
sessions conducted by the therapists (interrater 
agreement was .68 2), Liking and Interest were 
rated on graphic scales by therapists at the con- 
clusion of the initial therapy hour, Therapists were 
typed as A’s or B’s on the basis of their scores oD 
23 items from the Strong Vocational Interest Blank. 
Whitehorn and Betz (1960) found these items dis- 


criminated between groups of therapists who ob- 
tained different succe: 3 


RESULTS 
Cross-Validation of the TR Battery 


Ten patient predictors sieni dif- 
ferentiated Terminators Ta ae 
(Table 1). All three tests in the TR battery 
significantly discriminated the two groups O 
patients. As in previous studies, the TR bat- 
tery subtests discriminated more clearly tha” 
their parent tests, The point biserial correla 
tions of TR-A and TR-F with the criterio” 
were slightly higher than for the MA scale 
and the F scale, while Terminators and Re 


2 
Thterclass correlation coefficient, 
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TABLE 2 
OUTCOME or TREATMENT FOR PATIENTS PRE- 
DICTED TO REMAIN AND TERMINATE 


Outcome 
Configural Re- Termi- 

i prediction main nate Total 
Remain 119 38 157 
Terminate 57 68 125 

Total 176 106 282 


; 
Note.— ?=27.05; df =1, p <.001. 


paners did not differ significantly on the 
RoE D inventory. The product-moment cor- 
ie! between TR-A and MA scale for the 
SO) ire sample was .83. The correlation is 
ti mewhat inflated, as the TR-A items are in 
W MA scale. Even so, the correlation is as 
igh as the MA scale reliability and indicates 
hg can be considered an abbreviated MA 
ae configural prediction approach sug- 
Patte by Lorr et al. (1958) was applied to the 
cutti ™ of scores on the TR battery. Using 
tiene scores from the original study, pa- 
as eith scores on each subtest were classified 
cight er Terminator or Remainer scores. The 
Subt resulting patterns of scores on the three 
hy €sts in the battery were then classified as 
al minator or Remainer predictor patterns. 
or S 2 presents the outcome of treatment 
Chi enta with the two classes of patterns. 
ig eae for the accuracy of prediction was 
ion d significant, Overall accuracy of predic- 
Nato; as 66% with 64% of the actual Termi- 
rs and 68% of Remainers identified. 

Srr’s original beta weights ® were used to 
oF multiple correlation (R) of the TR 
sample with the criterion for the current 
vane; The R obtained was 44 (Ż < 001), 
67 soe compares favorably with Rs of 
Crogg., mite s original sample and .39 in his 
tween k ìdation sample. The difference be- 
configura] .44 and the beta of .36 between the 
(Osburn Scores and the criterion was tested 
tantly ] & Lubin, 1957), and R is signifi- 
multiple S: The cutting score based on the 
regression equation yielding maxi- 


fing 


2B 
et; : 
Upon, Weights and cutting scores are available 


request, 


mum accuracy identified 72% of the Re- 
mainers, 64% of the Terminators, with 69% 
overall accuracy. Compared with the con- 
figural approach, use of the multiple regres- 
sion equation increased accuracy of identify- 
ing Remainers with no loss of accuracy among 
the Terminators. 

Use of the configural scores increased ac- 
curacy of predicting Remainers 6% above the 
sample base rate, while R increased accuracy 
10%. Only the latter represented a signifi- 
cant increase (x? = 4.10, p < .05). Judgment 
of the actual utility of the TR battery should 
consider the facts that these patients had to 
meet rather rigorous criteria for inclusion, 
that the criterion groups were defined dif- 
ferently, and that the analysis was not con- 
fined to extreme groups. It should also be 
noted that the TR battery increased ac- 
curacy of identifying Terminators by almost 
the same percentage as Lorr et al. (1958) 
found. Here the increase was from 38% to 
64%; their sample included 55% Termina- 
tors, and 80% were accurately identified. 


Additional Patient Predictors 


The findings in Table 1 support the hy- 
potheses that Remainers have a history of 
less impulsive and less antisocial behavior, 
admit more anxious behavior, are more 
critical of themselves, and are less likely to 
endorse rigid, irrational beliefs. They are also 
more retiring in interpersonal relationships, 
better educated, have better vocabularies, and 
therapists consider them more highly mo- 
tivated for psychotherapy. 

Do any combinations of the measures of 
these characteristics offer potentialities for 
increasing the accuracy of predicting termina- 
tion of therapy? The Lubin-Summerfield 
(1951) square root method was applied to 
the 10 significant variables in Table 1. The 
procedure selects the minimum number of 
variables which yields the highest possible R 
with the criterion. Variables are added in 
order of decreasing proportion of criterion 
variance accounted for. Five variables were 
added to the minimum set in the following 
order: Motivation, TR-BD, TR-A, TR-F, 
and Sociability. The multiple R for the five 
variables with the criterion was .48. The five 


variables accounted for little more criterion 
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variance than the TR battery alone, using 
the original beta weights. 


Therapist and Duration 


Patients were divided into two groups or 
types—predicted Quitters and Stayers—on 
the basis of the duration predicted by ap- 
plying the multiple regression equation to 
their TR battery scores. Effects of therapist 
variables were tested by determining if actual 
duration for the two groups related to the 
therapist variable operating. Table 3 presents 
the proportion and number of predicted 
Quitters and Stayers who remained in treat- 
ment with therapists who differed on eight 
characteristics. The Vs in Table 3 are smaller 
than in the preceding analyses because Term- 
inators who kept no therapy appointments 
were omitted; Ws in Table 3 are variable, de- 
pending on availability of data. 

Statistical analyses of the Table 3 data 
were similar for all variables. An overall x? 
test for the four or six groups (depending on 
the therapist variable) was highly significant 
due to the association of predicted and actual 
duration. Each contingency table was divided 
into two smaller tables and comparisons made 
to test for association of each therapist char- 
acteristic with length of therapy among the 
predicted Quitters and Stayers separately 
(Cochran & Cox, 1957, pp. 103-105). If 
results for the two groups differed, an inter- 
action effect of therapist characteristic and 
patient type on duration would be suggested. 

Therapist sex did not relate significantly 
to duration among predicted Quitters (x? = 

2.00, df=1, p> -10) or among predicted 
Stayers (x? = 2.12,df=1,p> -10). For the 
combined patient groups, there appeared to 
be a relation between therapist sex and dura- 
tion. However, the X= 640, p< 05, is not 
exact as it is based on a comparison made 
after seeing the data. The results merely sug- 
gest that women therapists in this study held 
higher proportions of patients in therapy. ` 

Profession of the therapist did not relate 
significantly to duration either among Quitters 
or Stayers; nor did his experience. There was, 
however, some suggestion of an experience 
effect. For the combined patient groups, the 
association of experience and duration yielded 
x? = 3.99, p < .05. Unless confirmed, the as- 


TABLE 3 
s 
NUMBER AND PROPORTION OF PREDICTED oomi 
AND STAYERS WHO REMAINED UNDER DIFFEREN 
THERAPIST CONDITIONS 


Quitters Stayers 


Therapist 2 
characteristic N p N P 


Sex 


Male 49 45 84 19 
Female 8 .62 


Profession 


Psychiatrist 11 4 28 Be 
Psychologist 28 «44 53 a 
Social Worker 18 53 31 82 


Experience 


Four years or less 26 41 46 81 
Over four years 30 52 67 8T 


Personal psychotherapy 


Some 2 43 59 80 
None 32 50 54 8i 
Competence 


High 1 
Average 3 


` 
Low 35 31 «83 
Liking 
High 17 47 51 88 


wae 
ini 
N 
= 
oo 
Ma 
an 


Low 40 45 o0 76 
Interest 

High 26 50 63 838 
Low 31 39 48 14 
“Type” 

A 8 
B w oo i S 


sociation also shoul 
because it is based 


and the correlation between experience 4? 
length of treatment is low ($ = 112). 
Therapists with personal psychotherapy 
had a median of 13 months of treatme? 
(range = 4-40 months), They did not diffe 
significantly from thera 
chotherapy in the extent 
either Quitters or St 


5 
d be regarded as tenor 
on a post hoc compatis? 


pists with no pi 
to which they ae 
i ayers in therapy. Like 
wise, Judged therapeutic competence, therap! t 
type,” and liking for the patient did an 
relate significantly to duration among Quitte” 


or Stayers. (Chi square = 4.07, p < .05; fo 
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the association of liking and duration for the 
combined groups, but, again, the reservation 
concerning post hoc comparisons applies.) 
Remaining in therapy related to the degree 
of interest the therapist expressed in the 
type of problem presented by the patient. 
The relationship was consistently significant 
in among predicted Quitters (x? = 4.73, 
j=1, p<.05) and among predicted 
Stayers (= 4.14, df=1, p< 05). The 
Correlations of interest and outcome among 
Quitters (g=.20) and among  Stayers 
ite 17) are so similar as to offer little 
Uggestion of an interaction effect. 
oe additional analyses were performed 
ie aor for suggestions that any therapist 
Nie interact with patient type. All 
Tapists who saw at least two predicted 
Quitters or Stayers were categorized as ob- 
ining predicted or unpredicted results. A 
a “rapist was considered to have achieved 
A saan result if he retained at least 507% 
his Stayers or lost at least 50% of his 
ada _Two or more predicted Stayers 
ther, assigned to 36 therapists. Only 3 
apists failed to retain at least 50% of 
“a. n. Two or more predicted Quitters were 
ae to 29 therapists; 9 of these therapists 
fan ined more than 50% of them (actual 
of 4 = 67%-100%). The proportion (31) 
With crapists obtaining unpredicted results 
05) Quitters was significantly greater (p< 
ee, han the proportion (.08) obtaining un- 
pirea a outcomes with Stayers. Stayers ap- 
erat to respond as predicted with most 
titan in the sample, but there was 4 
retai ct group of therapists who somehow 
“ai potential Quitters in therapy. 
i ws above finding does not indicate a sig- 
c sah interaction between therapist char- 
keen and patient type. It would be 
rently ty to show that Stayers respond dif- 
i entife than Quitters to the nine therapists 
eripias as retainers of Quitters. These nine 
a were assigned 26 Quitters, and only 
Signe, a BY terminated. They were also as- 
Prematy 5 Stayers, 6 of whom terminated 
(. rely, The proportions of Stayers 
t apa Quitters (.77) remaining with these 
> Ti did not differ significantly (.10 > 
95), although the trend is suggestive. 
nal analysis indicated a highly signifi- 


cant association between the type of patient 
a therapist most effectively retained and the 
type of patient assigned to him for treatment. 
The 16 therapists in the study who retained 
100% of their predicted Stayers were initially 
assigned 47 Stayers and 18 Quitters. The 9 
therapists who retained Quitters were as- 
signed 26 Quitters and 15 Stayers. Chi square 
= 13.21, dj = 1, p < .001, for the associa- 
tion between type of patient assigned and 
type retained. Since, typically, the individual 
therapist had considerable freedom in re- 
jecting or accepting a specific patient, he 
was a major factor in the assignment de- 
cision. Thus the evidence on patient-therapist 
compatibility offered no substantial support 
for the hypotheses that Stayers and Quitters 
responded differently to the same therapist 
characteristic. It did, however, uncover two 
distinct groups of therapists who appeared 
able to recognize the two patient types, who 
showed differential preferences in selecting 
them as patients, and who were unusually 
successful in retaining the preferred type in 
therapy. 
DISCUSSION 


In three separate studies, the TR battery 
has demonstrated useful predictive validity. 
‘All three studies have involved veteran popu- 
lations, and the results may not be gen- 
eralizable to other populations. Nevertheless, 
there is reason to believe that the character- 
istics measured by the TR battery are not 
unique to veterans and that these character- 
istics are incompatible with the behavior re- 
quired in conventional, verbal, insight-oriented 
psychotherapy interviews. If so, it would 
appear worthwhile to try the test in other 
settings as a means of screening out pa- 
tients for whom ordinary psychotherapy is 
likely to be simply .a waste of time and 
effort. 

While this study revealed that a sizable 
number of patient characteristics are related 
to premature termination, few of these added 
significantly to the predictive power of the 
TR battery. The one which added most, the 
therapist’s rating of patient motivation for 
psychotherapy, is not available until the con- 
clusion of 1 therapy hour. Most clinics 
would prefer to have a prediction in advance 
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in order to treat the patient appropriately. 
A sizable proportion of Terminators do not 
keep their first therapy appointment, and 
most patients wait 2—4 weeks for their first 
therapy appointment. It would be preferable 
to obtain a comparable rating from an intake 
interviewer, such as the social worker, and to 
rely on the predictor patterns previously 
published (Lorr et al., 1958). Addition of 
the Sociability scale to the predictive battery 
should await cross-validation, especially since 
it increases predictive accuracy very little. 
The evidence did not confirm Hiler’s 
(1958) report that Quitters and Stayers re- 
acted differently to similar therapist qualities. 
Therapists who had “more than ordinary” 
interest in the problems of the patients were 
more likely to hold each type of patient in 
treatment. There was suggestive evidence that 
women therapists, more experienced thera- 
pists, and therapists with strong liking for 
their patients retained higher proportions of 
both types in treatment. The finding con- 
cerning therapist sex seems sample specific. 
A significant relationship was suggested in 
the present study, but Lorr et al. (1958) 
found no such relation, and, in Hiler’s sample, 
women kept proportionately more Quitters 
and lost proportionately more Stayers. Sample 
specificity may also account for the experi- 
ence relationship, as Lorr et al. found no re- 
lation between experience and outcome. While 
the basis for predicting which patients will 
quit or stay differed from Hiler’s, the present 
method certainly qualifies as a rigorously 
cross-validated classification scheme based on 
large samples of cases widely distributed 
geographically, The liking and interest meas- 
ures in the present study are not directly 
comparable to Hiler’s peer ratings of thera- 
pist warmth. Raush and Bordin (1957), how- 
ever, have discussed the multidimensionality 
and unreliability of global assessments of 
warmth. 
The current findings point to an interaction 
possibility with a distinctly different inter- 
pretive significance than Hiler’s (1958), 
There appeared to be subgroups of therapists 
who somehow selected unusually high pro- 
portions of Quitters or Stayers as their 
therapy patients. They were quite successful 
in retaining the preferred type of patient in 


treatment, whereas the nonpreferred type pa 
tient responded to these therapists about as 
would be predicted from his TR score. Thus 
the interaction suggested is that different 
groups of therapists respond differently tO 
the two types of patients rather than the con- 
verse. The interaction appears to take place 
when patients are selected for therapy. 

At this time, the sum of the evidence sug- 
gests that Remainers and Terminators are 
two distinct patient populations who generally 
react differently to what is offered them in 
the conventional, verbal, psychotherapy set- 
ting. Patient characteristics can be assessed 
which identify large proportions of these tw 
populations. While therapists’ reactions tO 
their patients have some influence on the 
Proportions of both populations, they can 
hold in treatment, there is little evidence 
that this is a marked effect, f 

We are at a stage where the potential 
Terminator can be identified readily and ac- 
curately before psychotherapy need be tried 
as the treatment of choice. Since Terminators 
constitute a large proportion of patients seek- 
ing treatment from clinics and reject verbal 
psychotherapy as a method of treatment long 
before it could conceivably modify behavior, 
it appears there is an urgent need to stop 
wasting time and effort on a foredoomed 
treatment procedure. As Bandura (1961) sug- 
gests, the time is ripe for exploring and 
experimenting with other, perhaps drastically 
different and largely nonverbal, treatment 
techniques and methods for modifying the 
symptomatic and problem behavior of these 
patients, 
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A TWO-DIMENSIONAL SEMANTIC DIFFERENTIAL * 


ROGER B. ALLISON, Jr. 
University of Utah College of Medicine 


A “pathology score” summarized the distances of 28 stimulus words from a 


psychological origin for a modified semantic differential. 
control Ss and psychiatric patients were used to assess va 
the pathology scores. The scores had internal reliabilities 


Several groups of 
rious properties of 
of .77 for control 


Ss and .88 for psychiatric patients. Test-retest correlations were in the low 
-60s for control Ss and .45 for patients, Group differences showed a systematic 
increase in mean pathology scores as the severity of the mental illness increased. 


The factorial composition 
scores on perceptual closuri 


Osgood’s theory of the representational 
mediation processes is intimately connected 
with association, thinking, and psychotherapy. 
His semantic differential method also provides 
an approach to quantifying some of the 
manifest and latent reactions imbedded in 
verbal concepts. Both his theory and method 
(Osgood, 1952, 1953, 1962; Osgood, Suci, & 
Tannenbaum, 1957) give additional under- 
standing to the clinician who daily witnesses 
connotative-like responses to core concepts 
in patients long before the significant features 
of the concepts become conscious. Although 
the semantic differential is not capable of 
answering “why” the individual reacts as he 
does, it does enable the clinician to make 
rapid assessments of certain parameters of 
associations. One disadvantage of Osgood’s 
technique lies in the absence of information 
about which reactions are frequent and which 
are atypical. Particularly where individual 
differences are to be studied, it seems desir- 
able to have a semantic differential in which 
deviant responses on basic semantic dimen- 
sions could be discerned, 

This paper presents a modified semantic 
differential method to be used in investiga- 
tions of individual differences, The approach 
is designed to yield information to the fol- 
lowing question: If given only two semantic 
dimensions will the performances of individ- 
uals be systematically related to other be- 
haviors which, in part, are predicated upon 
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was related to clinical diagnosis and inversely to 
e tasks. Further refinement was recommended. 


the same underlying semantic characteristics? 
The study was conducted to determine th 
feasibility of more elaborate projects of less 
restricted semantic systems, which are now 1? 
progress. 


PROCEDURE 


The modified semantic differential consisted A 
task in which the subject located a series of stimi 
words in a two-dimensional semantic space. T 
good-bad and strong-weak continua were chosen 5 
approximate representatives of the first two semant 
dimensions identified by Osgood: evaluation o 
potency. The subject made judgments of the i 
lowing type: Is STUBBORN something you cons! 
to be very bad, bad, neutral, good, or very g0° 
After rating all of the words on the good-ba 
scale, the subject proceeded to make judgments 
the strong-weak continuum. The list of 28 stimul¥ 
words appears in Table 1. The ratings required Ie 
than 10 minutes for most subjects. The scori? 
procedures, which are described later, consumed * 
additional 15 or 20 minutes of clerical time. d 

The basic similarity between Osgood’s metho 
and the present approach is easily recognized. HO“ e 
ever, rather than assuming that cach response sche 
Was an equal-appearing interval scale, each respo™ 
category for each stimulus word was expressed , 
a distance from a psychological midpoint whas 
was determined from an adult sample. Like g 
(1932) unidimensional scaling procedure was use 
convert the percentage of subjects who selected © e 
response option to the average distance from de 
mean of a normal distribution which correspon“ te 
to that particular percentage. The normal devi? 
values were then translated to a scale having 10: 
arbitrary mean of 30 and standard deviation of ot? 
The scale values, which are listed in Table 1, W t 
obtained from 350 university students who sy“ 
tended a summer school course in introductory P*.. 
chology. The scale values constituted deviation °° 
pail AAU of 

?In future studies, “extremely” will be used ie: 
Modifier instead of “very” because of its gt 
scalar effect upon adjectives (see Cliff, 1959). 
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TABLE 1 
SCALE VALUES OF RESPONSE CATEGORIES 


Stimulus Very Neu- Very Very Neu- Very 
word bad Bad tral Good good weak Weak tral Strong strong 
STUBBORN 12 27 40 50 60 10 23 30 3 %9 
POLITE 0 3 9 19 35 0 8 16 25 39 
TALKATIVE 3 15 26 39 54 6 18 28 38 51 
FLASHY 12 24 35 46 357 2 #2 34 42 = 51 
EAGER 0 6 15 27 4 0 6 1528 8 
SOET 6 4 2 33 44 10 20 30 39 49 
ELASTIC 3 12 24 36 46 7 1 2% 37 48 
SMART 0 3 12 22 36 0 3 14 2 38 
MINOW 3 14 31 47 57 7 20 34 49 57 
zay 6 16 2 3 9 11 2 3 a 5 
conu 0 13 4 35 48 6 17 2% 4 5 
REAL 0 0 15 32 41 0 3 16 33 40 
BURNING 13 25 36 46 54 9 19 28 36 45 
VALUABLE 0 3 15 26 39 3 9 17 28 40 
EMOTIONAL 8 20 30 40 52 11 22 29 37 47 
ae 15 2 3 49 00 3 2 3 ā %2 Si 
THIRSTY 13 26 36 45 54 11 21 31 39 48 
CHEAP 2 27 ne 5 00 19 283 W 6 3 
FAIR 3 12 21 30 40 3 12 21 30 42 
Sone 3o 6 138 3 % o i 24 35 49 
oe o ó ú x 9I 6 B B 2 8B 
WHITE 0 3 2 33 44 3 12 24 35 45 
NIMBLE 0 7 19 32 45 6 15 24 34 47 
FANCY 6 B 25 3 5 6 18 2 40 53 
SINCERE 0 0 6 16 33 0 ó 612 19 34 
HAPPY o 0 3 16 33 3 8 13 22 36 
ee 9 18 30 40 32 2 23 2 42 5 
SILENT 7 7 23 4% 5 8 18 227 37 48 


Note =30, s=10. 


o; 
sampl Point above and below which 50% of this 
Sca] val Ponded, The procedure of allowing the 
mulus Mes for response categories to vary across 
from «Words was one of the major departures 
Soci ao d's method. This modification permitted 
Whic e disturbances to be inferred from responses 
e es markedly deviant. 
ced sults: of the unidimensional scaling pTO- 
0. fone ee the appropriateness of the as- 
5 Booq of equal-appearing intervals used by 
MDlieq y € Values reported in Table 1 strongly 
at the psychological meaning varied con- 
or Y response category and by stimulus 
E example, “very bad” ranged from 0 for 
a 0 for cmerar, and “very good” ranged 
J in T SINCERE to 60 for stuBBoRN. The vari- 
wrk me mean semantic values reported by 
Ussell, and Suci (1958) may also be 
“S support that scale values for responses 
separately for each word. However, 
S are summated over words to obtain 
d scale for the response categories, the 
F Equate equal-appearing intervals is justified, 
or indiy Wmonstrated by Messick (1957). 
‘dual use, the two-dimensional semantic 


response: 
Cralize 
as ag Ption 


differential was administered by having the sub- 
ject mark one of the five response categories for 
the two continua for each stimulus word. His 
responses were then transcribed onto a scale sheet 
similar to Table 1 which provided two scores for 
each word. These two scores served as coordinates 
to locate the stimulus words on a two-dimensional 
graph whose axes were the good-bad and strong-weak 
continua. Figures 1 and 2 represent the results ob- 
tained from two psychiatric patients and two con- 
trol subjects. The centers of these grids correspond 


WEAK -STRONG WEAK-STRONG 


BAD- 6000 
BAD-GOOD 


Fic. 1 Scatterplots for two psychiatric patients 
(paranoid schizophrenic) . 
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WEAK-STRONG 


BAD-GOOD 
BAD-GOOD 


Fic. 2. Scatterplots for two control subjects. 


to the joint means of 30 and 30 on the two axes. 
The inner and outer circles correspond, respectively, 
to radial distances of one and two standard devia- 
tions (or 10 and 20 scaled units) from the center, 
In order to reduce the subject's Tesponses to a 
single summary score, an approximation to the av- 
erage radial distance of the responses was established: 
each point was assigned a weight of O if it fell 
within one standard deviation from the joint means, 
a weight of 1 if between one and two standard 
deviations, and a weight of 2 if beyond two stand- 
ard deviations. The sum of the points so weighted 
was defined as a “pathology score” which, taken at 
face value, was a measure of the degree to which 
the meanings deviated from the average. Thus, the 
higher the pathology score, the greater were the 
deviations; and the more likely the possibility that 
an association disturbance existed, Various proper- 
ties of the two-dimensional semantic differential, 


Particularly the pathology score, are considered in 
the remainder of this paper. 


RESULTS 
Reliability 


Pathology scores based upon the odd items 
and the even items were determined for 115 
university students and 114 psychiatric pa- 
tients. The students were a different group of 
subjects than the students used for scaling 
Purposes. The university group had means of 
8.8 and 9.5 and standard deviations of 4.0 
and 3.5 on the two halves, The correlation 
between the two sets of Scores was .63, Cor- 
recting this value for the length of the task 
resulted in an internal measure of reliability 
of .77. The values for the patient group were: 
means, 9.8 and 9.9; standard deviations, 4.7 
and 4.3; correlation, .79; corrected correla- 
tion, .88. 

The difference between the 
the university grou 
was probably assoc 
in variability betw 
spection of the biv. 
all distributions t 


correlations for 
p and the patient group 
iated with the differences 
een the two groups. In- 
ariate scatterplots showed 
o be skewed toward high 
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scores although there was less skewness fot 
the patient group. 4 
In general, the odd-even split resultes 
nearly equivalent halves with internal re 
ability adequate for experimental purposes: 
The higher correlation for the patient grouP 
was attributed to the greater heterogeneity 
exhibited by the patients, Further reseat A 
may support an argument that the higher on 
relation resulted from a restriction in semanti 
freedom for the patients. 
Test-retest correlations of the pathology 
scores were studied with a twofold purp 
in mind. One purpose was to estimate ta 
stability of the scores across time and " 
other was to assess possible changes in pa 
thology scores which were associated wi 
short-term psychiatric hospitalization on 
treatment. Two groups of university studen 
(N = 50, 58) were tested; one group W 


hi 
thi 
th 
d 
retested 3 weeks later, the other 8 weeks late 
In addition, a group of psychiatric pat 
(N = 47) was tested soon after admission 4 
an inpatient unit and again a day or two e 
fore discharge, on the supposition that so 
clinical improvement had been made. Bar. 
length of hospitalization for the patien 
ranged from 3 days to 12 weeks with an & 
erage of 2.5 weeks. sete 
The test-retest correlations for the univ 
sity groups were .61 and -64, which were CO” 
siderably less than the reliabilities of conver 
tional measures of personality or intellige 
Inasmuch as the number of items was smi 
the low reliability was not unexpected. ji 
lengthening the task would increase the 1° e 
ability by the Spearman-Brown formula, fY 
times as many words would have to be u$ 
to obtain a reliability of .90, e 
Although the reliability coefficients for 


student groups were low, the coefficient Hi 
the patient group was even lower (7 = “ye 
and suggested that patients were less st 
in their responses than nonpatients. «octS 
A comparison of the percentages of subJé Jo 
who earned lower scores on retesting (60% 
and 40% for the two student groups as erie 
pared with 62% for the patients) offered l ef 
evidence that’ patients improved more 0°, 
time than subjects who were not involve 
a treatment program, Simply using as a ™ 
ure of improvement the fact that a person 


ea 
a” 
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discharged from an inpatient unit was un- 
doubtedly inappropriate. Some patients were 
discharged as not improved or transferred else- 
Where for further hospitalization and treat- 
a more refined approach noting the 
Le of the patient at the time of dis- 
ie p should be used in future studies of 
a el ationship between changes in pathology 
ores and clinical status. 
Ee summary, the reliability of the pathology 
Ski a was considered to be adequate for ex- 
A oo purposes but would need to be in- 
re for operational use. It was difficult to 
ike oe the findings that the internal meas- 
os i ani was higher for patients than 
ighe : ag but the test-retest indices were 
Underst or nonpatients. Further research and 
cla D anding of semantic functioning could 
uy the differences. 


Group Differences 


te the last few years the two-dimen- 
to nin semantic differential was administered 
in Poe adult groups. These groups are listed 
able 2, along with the respective means 
Brou variances of the pathology scores. The 
iene referred to as Normal Control Sub- 
from and Inpatient Psychotics I were subjects 
Ski gl study (Cole, Allison, Gortatow- 
jects t ranch, 1960). The assignment of sub- 
Sults os © ither group was based upon the re- 
0 se Joint clinical interviews conducted by 
ents mor psychiatrists. The University Stu- 
Which ousisted of the two student groups 
tetisticgs te used to study test-retest charac- 
tst aq. the scores which they earned on the 
administration of the semantic differ- 


Si 


Me, TABLE 2 
NS AND VARIANCES OF PATHOLOGY SCORES 


Vari- 
NPati 
Normal ¢ Character disorders 15 14.20 15.23 
DetPation: rol subjects 60 15.97 28.63 
Niversity errors 44 17.18 38.80 
Pitpatiens tudents 108 1823 43.42 
D Ptice character disorders 49 19.31 34.76 
TytPationt a otics 17 19.65 50.23 
P Patient ,,PS¥chotics 41 21.44 52.94 
Datien, P Ychotics T 44 22.68 79.98 


~ t psychotics IT 30 23.33 68.71 


ential were used in this analysis. The remain- 
ing groups were psychiatric patients whose 
diagnoses fell within the broad categories 
listed in the table. Diagnoses were made on 
the basis of routine clinical procedures of 
the inpatient and outpatient services. In all 
cases, the diagnoses were determined inde- 
pendently of any psychological test data re- 
ported in this paper. 

The data presented in Table 2 showed a 
consistent trend: as the severity of the men- 
tal illness increased there was a correspond- 
ing change in semantic functioning as repre- 
sented by an increase in the mean pathology 


score.? 

The data of Table 2 further suggested a 
relationship between the mean pathology 
scores and their variances across the diag- 
nostic groups. This is a frequent finding in 
clinical studies and perhaps meant that as the 
severity of the mental illness increased an 
ever-increasing range of test scores was sub- 
sumed under the general diagnostic groupings. 

The correlated means and variances 
prompted a transformation of the form y = 


3 The editor has questioned the relationship be- 
tween this study and Berg’s (1955, 1957) deviation 
hypothesis. One similarity between his research, this 
study, and countless other studies is the significant 
differences reported between psychiatric patients and 
nonpsychiatric subjects. When such differences occur 
they could be construed as supporting the devia- 
tion hypothesis. However, psychiatric patients are 
different by definition. The clinical fields also abound 
with techniques by which labels may be placed 
upon people. Further measures developed with the 
sole purpose to confirm the obvious seem inap- 
propriate. It should be recognized that the be- 
havioral congruencies present in “critical” and “non- 
critical” areas of functioning originate from the 
same properties of the individual. It is the at- 
tempt to identify and understand these characteristics 
which constitutes the major difference between 
Berg’s approach and that apparently followed by 
other investigators. In essence, the deviation hy- 
pothesis seems to be only a special case of a basic 
tenet in psychology—namely, the behavior of an 
individual in a given situation is a combination of 
of each attribute he has and the extent 
to which the situation permits these qualities to 
emerge. The study of individual differences is there- 
fore concerned with the nature of the combination, 
the dimensions of human behavior, and the types 
of situation in which the behaviors may occur. 
Paradoxical as it may be, it is not the purpose of 
differential psychology to conclude that people are 


different. 


the amount 
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TABLE 3 
PRESENCE OF SIGNIFICANT DIFFERENCES IN MEAN Loc SCORES BETWEEN GROUPS 


Group 

Group 1 2 3 4 5 6 7 8 
1. Inpatient character disorders 
2. Normal control subjects No J 
3. Outpatient neurotics No No 
4. University students Yes No No . 
5. Outpatient character disorders Yes Yes No No B 
6. Inpatient neurotics Yes No No No No : 
7. Outpatient psychotics Yes Yes Yes Yes No No 
8. Inpatient psychotics I Yes Yes Yes Yes No No Wo No 
9. Inpatient psychotics II Yes Yes Yes Yes No No No 


log1o x before analysis of variance techniques 
were applied to ascertain whether significant 
differences existed between the group means, 
An F value of 6.10 was significant beyond the 
-001 level of confidence and indicated that 
the mean scores between some of the groups 
were statistically different, 

Kramer’s extension of Duncan’s range test 
(McGuigan, 1960, p. 185) was employed to 
determine which means were significantly dif- 
ferent at the .05 level of confidence. The re- 
sults, which are Presented in Table 3, indi- 
cated that the psychotic subjects had higher 
scores than normal controls, outpatient neu- 
rotics, and university students. However, the 
attempt to differentiate neurotic subjects from 
normal people was not successful, 

In addition to the quantitative findings, 
certain relationships were noted between the 
patterning of responses on the two-dimen- 
sional graphs and personality characteristics. 
Although the relationships must be treated as 
hypotheses which evolved from the study, 
they warrant mentioning because of the im- 
plications they have for semantic functioning 
and personality. H,: Character disorders re- 
verse the good-bad continuum (€.g., CHEAP is 
“very good,” SMART is “very bad,” etc.). Hy: 
Depressed people give a Preponderance of re- 
sponses on the bad side of the good-bad con- 
tinuum. Hs: Male homosexuals have a nega- 
tive correlation between the two semantic 
axes—that is, concepts rated as “good” are 
also rated as “weak,” and concepts rated as 
“bad” are also rated as “strong.” Hy: Appre- 
hensive individuals mark the “neutral” cate- 


gory for most words on the strong-weak W, 
tinuum. Additional research is required 
test these hypotheses. 


Correlations with Other Variables 


The semantic differential was included pi 
two other research projects in order to oP 
tain zero-order correlations with as many 
other variables as possible. The first study 
utilized a group of 88 adults who were care 
fully evaluated by psychiatrists and assign” 
to one of the following categories: Norma's 
Neurotic, Character Disorder, and Psychol 
Some of these subjects were used in groups T° 
ported in Table 2. The pathology scores ha 
significant correlations, at or beyond the -0: 
level of confidence, with the following vat! 
ables: Diagnosis (.42), Wechsler Adult Intel 
ligence Scale (WAIS) Comprehension (—.32)1 
WAIS Arithmetic (—.26), WAIS similarities 
(~.28), WAIS Verbal IQ (—.28), Gestal 
Completion (—.29), Mutilated Words (—.30); 
and number of literal Tesponses on Benjamin 
Proverbs (,32), Nonsignificant  correlatio”® 
were obtained with SEX, age, education, WA 7 
Information, WAIS Digit Span, WAIS V° 
cabulary, and number of concrete respons? 
to a linear schematization task. 

The relationship between the pathology 
Scores and diagnoses represented a corre z 
tional statement of the group differences ig 
Ported in Table 2. The importance of t a 
relationship was the fact that the two-dimen” 
sional semantic differential provided an ©& 


perimentally independent measure of psy’ cho 
pathology, 


A Two-DIMENSIONAL SEMANTIC DIFFERENTIAL 23 


oad other correlations showed that the pa- 
oe were related to several vari- 
For he hich were also related to diagnosis. 
B the disturbed person generally 
tasks p more poorly than other subjects on 
a quiring abstract thinking, judgment, 
of visu pee attention, and the organization 
Sout percepts, Thus, the pathology scores 
able tom been duplicating information avail- 
ully E other sources. To understand more 
tholo e novel contribution which the pa- 
‘i thee score had with respect to diagnosis, 
dependa analy. sis of the 15 experimentally in- 
ormed Te discussed above was per- 
Unities į he matrix of intercorrelations, with 
dialys principal diagonal, was factor 
ve hi 1 y standard computer methods. The 

% ighest eigenroots, which accounted for 
Veal the total variance, and their corre- 
simple st vectors were retained and rotated to 
Major ‘os ucture, None of the 15 variables had 
Or; th oadings on more than one rotated fac- 
ors wi € correlations between the rotated fac- 

ere <|.31]. 

hal of the rotated factors was an expected 
r Fa ha Piehiension ability; the remaining 
Permit ctors were not defined sufficiently to 
Was the mcrpretation. Of major interest here 
ic diff nature of the loadings for the seman- 

es Pay and diagnosis. These two vari- 
Vere BS or one of the four factors which 
and .64 interpreted and had loadings of .82 
Smaller her ee Two other variables had 
&stalt ut negative loadings on this factor: 
Words completion (—.50) and Mutilated 
factor (>34). It was quite possible that this 
Ut wi epresented a disturbance in thinking 
tig, WOuld require major loadings of addi- 
ee of association, thinking, and 
suc, tion before it could be interpreted as 
iferentia factor suggested that the semantic 
the sa lal and diagnosis were measures of 
forts Me underlying attribute. Further ef- 
© clarify this factor are obviously de- 


Ver 


Percep 


of gz Second study was based upon a group 
ance Sera who sought psychiatric assist- 
WVailay m an outpatient clinic. The variables 
the 16 z for correlational purposes included 
manifest needs” from the Edwards 


Per 
Son 
al Preference Schedule and the Thorn- 


dike Vocabulary Test. “Aggression” on the 
Edwards schedule had a correlation of .37 
with the pathology scores from the semantic 
differential. None of the other correlations 
was significant at the .05 level of confidence. 
Thus, performance on the semantic differential 
had little, if any, relationship with stated 
needs of the individual. The absence of a 
significant correlation with the Thorndike test 
was further confirmation that the pathology 
score was not a measure of verbal ability. 
In sum, the correlations which the pathol- 
ogy scores had with other measures suggested 
that the modified semantic differential was of 
diagnostic importance and tapped other areas 
of cognitive functioning such as perceptual 
closure. Whether the “processes” which un- 
derlie a disturbance in semantic functioning 
are similar to those which disturb thinking 
and perception requires more extensive re- 


search. 
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THEMATIC AND COGNITIVE RESPONSES OF GOOD 


PREMORBID SCHIZOPHRENICS TO CUES OF 
NURTURANCE AND REJECTION? 


KENNETH E. LEBOW 2 anp SEYMOUR EPSTEIN 


University of Massachusetts 


emotional involvement. Rather than a 


phrenics exhibit deficit for cues of emotional inv 


The present study has its roots in two 
sources. One is a paper on the Measurement of 
approach-avoidance conflict (Epstein & Fenz, 
1962), in which it was demonstrated that 
specially constructed stimulus dimensions in 
projective techniques provide effective tools 
for the measurement of conflict, and it was 
recommended that they be used to explore 
conflict areas in the behavior disorders. The 
second is the highly interesting work on 
schizophrenia Teported by Rodnick and Gar- 
mezy (1957) in the Nebraska Symposium, in 
which the value of working with structured 
cues of theoretical relevance to schizophrenia 
was indicated. Their conclusion that schizo- 
phrenics experience a “censure-cue behavioral 
deficit” is Promising for the conflict dimen- 
sional approach of the first source, which uses 
Cognitive deficit along a stimulus dimension 
as one index of conflict, 

Following theoretical Conceptions of the 
etiology of schizophrenia (Arieti, 1959), as 
well as experimental findings (Rodnick & 


a project on the measure- 
ment of drive and conflict, of which S. Epstein is 


principal investigator, and which is being sup- 
ported by Grant M-1293 from the National In- 
stitute of Mental Health, United States Public 
Health Service. K, Lebow, under the supervision of 
S. Epstein, conducted the study in partial fulfillment 
of the requirements for the master’s degree at the 
University of Massachusetts, Both authors par- 
ticipated in scoring the stories. 
2 Now at the University of Nebraska. 


1 This study is part of 
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figures represented in nurturant, ambiguous, 
oy hero. The schizophrenics were found to 
eer pictures following mother pictures (.01 
ores of p Nurturance (.05 level), produced 
urturance as a function of cues varying from rejection to 


1 matched controls were tested with 


responses to ambiguous scenes, It was suggested that good 


a defense against 
specific “censure-cue deficit,” schizo- 
olvement, in general. 


Garmezy, 195 7), it was assumed that a major 
source of disturbance for the schizophrenic 
lies in the mother-child relationship. It be 
further assumed that the most fundamental 
characteristic of interpersonal relationships) 
in general, lies in the realm of acceptance an! 
rejection. Accordingly, a dimension of cues 
varying from nurturance through rejection 17 
reference to the behavior of a mother towar 
a young boy was constructed, Partly as a con- 
trol for the mother figure, and partly for 
investigation in their own right, paralle 
dimensions were constructed for father an 
peer figures, 


METHOD 
Subjects 


3We wish to express o 
cooperation of the admin 
Northampton 
Psychologist ai 
committee for 


y he 
ur appreciation for ua 


ief 


istrators and patients 


pi re 
ur appreciation to the hospital wb@ 


the controls Were tested, they requested to rem 


anonymous, 
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at the results more dependable. Twelve of the 
te aaa were diagnosed as paranoid, three as 
E ean and six as chronic, undifferentiated. All 
tion EA lergoing drug therapy. Length of hospitaliza- 
409 Mere from 3-92 months, with a mean of 
meai T Age range was 19-45 years, with a 
eae a and a standard deviation of 7.30. 
Auten a ucation varied from 6-16, with a mean 
level ae K a standard deviation of 2.54. Vocabulary 
O Sie e Shipley (1940) scale, which was used 
with a verbal intelligence, varied from 9-38, 
9.02 a 25.2 and a standard deviation of 
the “Water eoan background was determined by 
ateoa ed States Bureau of the Census (1950) 
skilled fa The sample consisted primarily of un- 
a canada Hat and clerical workers, but contained 
scatterin m le number of skilled laborers, and a 
. aS individuals in other occupations. 

Patients ee group consisted of 21 volunteer 
disorders who were under treatment for physical 
piae an Air Force hospital. Duration of 
of 145 A ion varied from 3-21 days, with a mean 
range, h ays. The groups were closely matched on 
tion cree and standard deviation for age, educa- 
ök ne vocabulary level. They were also matched 
mber of subjects in occupational categories. 


Materials 


i ie materials consisted of four practice pic- 
lowed by nine experimental pictures, all 
formeq on in size, The experimental pictures 
gure ae dimensions, one depicting a mother 
i interacti a father figure, and one a peer group 
consisted aa with a young boy. Each dimension 
a rejecti of a nurturant scene, a neutral scene, and 
Other ee scene, presented in that order. For the 
Were rel Sore the scenes of nurturance and rejection 
Kee ed to food in order to represent the 
ar Ae nature of the relationship with the 
o represent the father figure, scenes were selected 
Buide SEI the role of the father in providing a 
e role p ael for the boy. Peer scenes related to 
e experi the group in the socialization process. 
Where it imental pictures are presented in Figure 1, 
ambiguity ®? be seen that there is little room for 
e a. the nurturant and rejecting scenes. 
Nurtura uli can be further defined by the mean $ 
order of Scores obtained by the control group. 
fans are nurturance, ambiguity, and rejection, the 
“3 fath as follows: mother figure: 8.4, 6.9, am 
85, gg it figure: 8.6, 6.7, and 4.3; peer figures: 
ance ae It is apparent that levels of nurtur- 
between different within dimensions and similar 
P. mensions, as intended. 
"Ocedure 
The sons 
iate “hizophrenics were individually tested in a 
q °Y woul ae in their ward. Subjects were asked if 
ae with fe to participate in a study which 
ae ects refi elling stories to pictures; only two 
tin ‘On: ‘used. A tape recorder which was visible 
Spicuous was described as an aid that would 


tu 


allow the subject to speak freely without being 
concerned about the experimenter getting down the 
material. 

Control subjects were tested in the hospital wards, 
where privacy was obtained by drawing a curtain 
around the subject’s bed, and by selecting only 
subjects for whom the adjoining beds were vacant. 
To control for examiner practice effects, the two 
hospitals were visited on alternate days. To control 
for and evaluate sequence effects, seven schizophrenics 
and seven controls were randomly assigned to each 
of the three following sequences of presentation of 
the dimensions: MFP (mother-father-peer), FPM 
(father-peer-mother), and PMF = (peer-mother- 
father). 

In order to insure the task being understood, and 
to reduce practice effects, four practice pictures 
were presented before the nine experimental pic- 
tures. An inquiry was conducted for the first four 
pictures by asking for a beginning, an ending, and 
a full description of feelings and action. Following 
this, the experimenter made no further comment 
after introducing the pictures. The initial instructions 
were as follows: 


This is a test of creative imagination. I will 
show you some pictures, one at a time, and I 
would like you to make up a story about each 
one. Try to tell a complete story—one that has 
a beginning, a middle, and an end. If you can, 
tell what happened before the event, what is 
happening now, how the characters feel, what 
they are thinking, and what the outcome will be. 
Just speak your thoughts as they come to mind, 
Tell the stories from the viewpoint of the young 
boy who is the main figure in the pictures. When 
you are finished with a story, place the picture 
face down. Do you understand what I want you 
to do? All right, let’s begin. 


Each picture was introduced with a comment 


such as, 
Can you tell me a story about this boy and 
his mother [or “this boy and his father,” or 
“this boy and these other boys”]? 


The stories were typed verbatim on 5X8 inch 
cards with an identifying code number so that 
they could be scored blindly. Before scoring, the 
cards were shuffled, and the experimenter read a 
sufficient sample to obtain a frame of reference for 
an initial sorting. The major score, p Nurturance, 
varied from strong rejection of the boy hero at 
the low end of the scoring dimension to strong 
nurturance toward the hero at the high end. 
Nurturance was defined as “to express sympathy in 
action, to be kind and considerate for the feelings 
of others, to encourage, pity, and console. To aid, 
protect, defend, or rescue an object” (Murray, 1943, 
. 10). Rejection was defined as “any act, thought, 
or feeling that indicates a lack of concern for the 
child’s welfare.” Stories were first scored relative 
to other stories to the same picture on a five-point 
scale, with the restriction that each pile had to 
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Fic. 1. The nine experimental Pictures. (The 
of relationship and figure related to.) 


pictures form a 3 x 3 design for nurturance 


contain at least five stories, except for the two iabili i 
end piles. Two judges independently did the scoring. A W 
Where disagreement was within two points, averages are misi : a 
were taken; where greater than two points, differ- increase RAET 
ences were resolved by discussion. Interscorer of the judges E 


ents for the individual picture? 
to .82, with a mean of .67. Thess 
mates, as combining pictures shoul 
ty, as should averaging the rating? 
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à we ay point, stories were scored relative to 
pictur icular picture, so that comparisons between 
‘told could not be made. In order to establish 
a te values across pictures, two typical stories 
patici: ected to represent each of the scores on a 
each aa prenre: The 90 representative stories (two 
sorted -rel ave values for nine pictures) were then 
a ia nue to each other by two judges along 
scorer s TE scale of equal-appearing intervals. Inter- 
igh om iability for this phase of scoring was very 
represen indicated by a coefficient of 97. The two 
original tative stories were than averaged and the 
respondi 5-point scales were translated into cor- 
the Bint © values on the 11-point scale, In summary, 
interval Scores were on an 11-point equal-appearing 
extreme scale ranging from extreme rejection through 
ESDMeSEtod ee The midpoint of the scale, six, 
rejection or either an absence of nurturance and 
Stories or an equal balance between the two. 
‘Goodne were also rated on a 5-point scale of 
rom ess of Response.” As the range was similar 
avalon ce to picture, no attempt was made to 
Were ate 11-point scale across: pictures. Raters 
Boodnes ed to make a global judgment of the 
ollow; S of the stories by taking into account the 
A four elements: 
diaa Integration—the extent to which the 
rer iay of the stimulus are accurately noted 
Biven ee into the story with no credit to be 
details or simply enumerating perceptions. Noting 
Story ie compatible with the development of the 
teint, be considered a poor response. 

Which ization and Development—the degree to 
e; vents logically and necessarily follow from 
i ther. A good story should have a begin- 
events” an end as well as a description of ongoing 
the sitivity and Richness—the degree to which 
Mere dee Teasonably and effectively departs from 
the Scription of the stimulus. The originality of 
Ot, the vividness of description, the degree of 
and drama, and the extent to which the 
‘Aptures the reader’s interest are favorable 
org ations. 

ry Ta Presentation—the degree to which the 
rors _ Well presented and is free of grammatical 
Propet inappropriate comments and laughter, 1m- 
Stutterin Se of words, speech disturbances, such as 
“eliabititie: and inappropriate pauses. Interscorer 
51 ies for Goodness of Response varied from 
of 63,” l for the individual pictures, with a mean 


n ses 

the antition to the two scores already described, 
NVestigg nine four measures involving time were 
Uses ed: reaction time, response time, rate of 
measuremen fluency. In a previous study on the 
hem, ic ent of conflict with a specially constructed 
Press), , aPperception test (Fenz & Epstein, in 
me, ta *Stinctive gradients were found for reaction 
2 function of pauses, and Goodness of Response as 
Tea g ie of a stimulus dimension relevant to the 
brovi e nflict, It was concluded that these measures 
eq fapmising indices of anxiety induced cog- 

it. Reaction time, which was taken from 


the tape recording, consisted of the interval in 
seconds between the introduction of a picture and 
the beginning of a story. Rate of pauses was defined 
as the number of silences longer than 2 seconds 
divided by response time. Response time was defined 
as the interval in seconds between the beginning and 
end of a story. Response time was investigated in 
its own right only because it was available as one of 
the steps in determining rate of pauses. Fluency was 
measured by the reciprocal of rate of pauses, and is 
thus an alternate measure of the same variable. 


RESULTS 


The data were analyzed by separate anal- 
yses of variance for the dependent variables 
of p Nurturance, Goodness of Response, and 
the four formal measures. The analysis of 
variance consisted of 2 X 3 X 3 X 3 factorial 
designs for diagnosis (schizophrenic versus 
control), relationship (nurturance, ambiguity, 
or rejection depicted in the stimulus), figure 
(mother versus father versus peers), and 
sequence (MFP versus FPM versus PMF). 


Formal Measures 


Of the four time measures, only reaction 
time revealed differences between the schizo- 
phrenic and control groups. In Table 1 it 


TABLE 1 
ANALYSIS OF VARIANCE OF REACTION TIME 


Source df MS F 


Between subjects 


Diagnostic group (D) 1 1.06 <1.00 

Sequence (Q) 2 135.15 <1.00 

DX 2 259.37 <1.00 

Subjects (Ss) (within groups) 36 384.967 13.314 
Within subjects 

Figure (F) 2 77.39 2.31 

FXD 2 52.25 1.56 

FXQ 4 108.94 3.26* 

FXDXQ 4 164,11 4,01 

FXSs 72 33.40% 

Relationship (R) 2 41.17 

RXD 2 4.62 

RxQ 4 16.86 

RXD XQ 4 60.99 

RXSs 72 59.398 

FXR 4 92.20 

FXRXD 4 59.29 

FXRXQ 8 34.40 

FXRXD XQ 8 22.13 

F XR XSs 144 28.929 


ms above it to the next error term. 


a Error term for ter! 
: erm for other terms. 


R XSs was used as error t 
Re * p <.05. 
wep < 01. 
week p = .001. 
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@= MOTHER, FATHER, PEER 


SEQUENCE: A= FATHER, PEER, MOTHER 


O= PEER, MOTHER, FATHER 


— — — — SCHIZOPHRENIC 
CONTROL ? 


TIME IN SECONDS 


MOTHER FATHER PEER 


Fic. 2. Reaction time as a function of diagnosis, 
figure presented, and sequence of presentation, 


can be seen that the Diagnosis x Figure 
XxX Sequence interaction is significant at the 
-01 level. Inspection of Figure 2 indicates 
that the significance is a result of the highly 
elevated reaction time of the schizophrenics 
to the peer figures in the MFP sequence. The 
results cannot be attributed to fatigue, as 
neither the mother nor father figures produce 
similar results when last, The MFP sequence 
is the only one in which the mother figures 
precede the peer figures. It thus appears that 
either the mother cues or the combined 
mother and father cues have an effect upon 
the schizophrenics which increases their re- 
action time to the peer cues, 

Other statistically significant findings on 
the formal measures are of only incidental 
interest. For reaction time (see (Table Dy 
relationship is significant at the -05 level be- 
cause of relatively long reaction times to the 
ambiguous pictures. It is, of course, not sur- 
prising that ambiguous stimuli do not suggest 
a story so quickly as more highly structured 
stimuli. A significant Figure x Relation- 
ship interaction is a consequence of the am- 
biguous picture of the peer relationship 
eliciting a higher reaction time than the other 
ambiguous stimuli. This suggests that the 
ambiguous picture of the peer group is simply 


more ambiguous or difficult than the other 
ambiguous stimuli. Inspection of the responses 
supports such a view, as there are sever 
spontaneous comments about it being difficult 
to tell if the peer group is welcoming of 
rejecting the hero. The significant Figure X 
Sequence interaction is solely a result of the 
elevated reaction time of the schizophrenics to 
the peer figures in the MFP sequence, which 
has already been discussed, The only other 
significant finding is on frequency of pauses; 
where sequence is significant at the .05 level, 
with the highest rate of pauses occurring in 
the MFP sequence. 


Goodness of Response 


Following Rodnick and Garmezy’s (1957) 
conclusion that schizophrenics suffer from 4 
censure-cue behavioral deficit, it was ang 
ticipated that schizophrenics would obtain 
lower scores of Goodness of Response on the 
rejecting than on the ambiguous and nur- 
turant stimuli. The results fail to support 
expectancy. While the schizophrenics perform 
significantly more poorly (.001 level) than 
the controls on all pictures combined, with 
means, respectively, of 2.40 and 3.33, H 
differences associated with stimuli groupe 
either according to rejection, ambiguity, an 
nurturance, or mother, father, and peer figures 
are not significant. There is a tendency (.10 
level), however, for the schizophrenics to tell 
their poorest stories to the nurturant stimuli 
and their best stories to the ambiguous 
stimuli. Thus, it is the nurturant cues that 
are more disruptive for the schizophrenic, a 
the nonemotional ones which he responds tO 
most effectively. The controls do poorest 0? 
the ambiguous cues, which is understandable 
in that ambiguous cues offer the least 
material for a story. 


Press Nurturance 


While Goodness of Response provided a” 
evaluation of the disruptive effects of cues 
on cognition, p Nurturance provides an ap 
proach to the dynamics and response biases 2 
the schizophrenic, Figure 3 presents 14 
results grouped according to cues of nurtur” 
ance, ambiguity, and rejection. Two observ? 
tions are immediately apparent. One is th? 
the schizophrenics produce a less steep 
gradient than the controls; they attribute 125 
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CONTROLS 
GOOD PREMORBIOS 


NON 
ou 
.  ' 


TAT p NURTURANCE 


45 
4.0- 


— 
REJECTING 


at 
NURTURANT AMBIGUOUS 


RELATIONSHIP DEPICTED 


Fig i 

and R Press Nurturance as a function of diagnosis 

a ationship. (A neutral response is represented 
Score of 6.) 


hu 
teen? to the nurturant stimuli, and less 
is signif, to the rejecting stimuli. The effect 
e p ant at the .01 level, as indicated by 
Table lationship x Diagnosis interaction in 
Turtur, 4, The other is that the denial of 
denial ofS is considerably greater than the 
ing, rejection, which corresponds to the 
b hrenic on Goodness of Response that schizo- 
tespond; tend to exhibit greater deficit in 
"election © to cues of nurturance than of 
that the. In Figure 3 it can be seen 
tura oe schizophrenics obtain lower p Nur- 
stimu; Scores than the controls on the pooled 
© for he overall mean p Nurturance 
Pateq the schizophrenics is 5.7 as com- 
‘categ NA for the controls. Table 2 in- 
ne 05 ES t the difference is significant at 
e ; el. A comparison of the groups on 
K e Suous stimuli, which should be free 
ane rs of avoidance of affect laden cues, 
a WoUs f while the controls describe the 
‘zo gures as primarily nurturant, the 


r i $ . . 
igure 2, Nics describe them as rejecting (see 
3) J 5 


` Signi 
Action C acant Figure X Diagnosis inter- 
level) in Table 2 indicates that 


the schizophrenics and controls differ in the 
relative nurturance they attribute to the 
mother, father, and peer figures. The inter- 
action is illustrated in Figure 4, where it can 
be seen that the groups differ most on mother 
figures and least on peer figures. The schizo- 
phrenics describe the mother figure as highly 
rejecting, while the controls describe her as 
highly nurturant. 

In order to determine whether the differ- 
ences between the groups on p Nurturance 
are a result of differences in story telling 
ability, the groups were subdivided into the 
best 3 and poorest 4 on Goodness of Response 
within each sequence. The mean Goodness of 
Response scores for the 12 poorest and 9 
best schizophrenics, respectively, are 1.81 and 
3.14. The corresponding means for the con- 
trols are 2.96 and 3.91. Thus the better 
schizophrenics outperform the poorer con- 
trols. In-view of the small and unequal num- 
bers in each subdivision, and the considera- 
tion that levels of performance were not 
orthogonal with respect to diagnostic groups, 
statistical analysis was not attempted. Rather, 
the data were inspected to determine whether 
the relationships already noted held across 


TABLE 2 


ANALYSIS OF VARIANCE OF p NURTURANCE 


Source df MS F 
Between subjects 
Diagnostic group (D) 1 24.90 4.47* 
Sequence (Q) 2 8.94 1.60 
2 2.07 <1.00 
Subjects (Ss) (within groups) 36 5.575 3.35 9" 
Within subjects 

Figure (F) 2 6.69 3.98* 
FXD 2 5.93 3.52* 
FXO 4 2.54 1.51 
FXDXQ 4 ui <1.00 
FXSs 72 1.68% 1.01 
Relationship (R) 2 478.85 146.43% 
RXD 2 23.52 7.19" 
RXQ 4 6.34 1.93 
RXDXQ 4 2.52 <1.00 
RXSs 72 3.278 1.96 
FXR 4 9.26 5.570" 
FXRXD x 1.95 Ok 
FXRXQ 8 3.33 1.88 
FXRXDXQ 8 2.80 1.68 
F XR XSs 144 1.66% 


above it to next error term. FXR 


Error term for terms 
sEm for other error terms. 


Ss was used as error term 
a * p <.05. 


wp Z 01. 
weep < 001. 
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MOTHER FATHER PEERS 


Fic. 4. Press Nurturance as a function of diagnosis 


and figure. (A neutral response is represented by a 
score of 6.) 


levels of performance. In Figure 5, it can be 
seen that the differences between schizo- 
phrenics and controls on Ż Nurturance for 
mother, father, and peer figures cannot be 
explained away by differences in story telling 


7.3F 
7.2 

7al 
7.0 
6.9 
6.8 
6.7 
6.6 
65> 


6.4 
63 [ 


6.2 
eit 


(N=12) 


POOREST CONTROLS 


ability, as the best and poorest performing 
schizophrenics produce parallel curves, E 
tributing least nurturance to the moti 
figure and most to the father figure. The PoS 
controls, on the other hand, describe the pee 
figures as least nurturant. Pa 

There are two further observations of in 
terest in Figure 5. One is that within bo x 
diagnostic groups there is an inverse relation 
ship between adequacy of performance an 
$ Nurturance. This is not an artifact 


U 
ness of Response failed to reveal a tendeno% 
for nurturant pictures to obtain higher scor® 
on Goodness of Response than rejecta 
pictures. The second observation concerns F; 
distribution of Ż Nurturance among the pn 
figures. Whereas the best controls descri 
the mother figure as the most nurturant, 
poorest controls describe her as a relative 
neutral figure. Thus, on both overall 
Nurturance and pattern of $ Nurturance, “af 
poorest controls fall between the best co 
trols and the pooled schizophrenics. het 
further interest, responses to the mos g 
figure provide the greatest discrimination a 
tween the better and poorer performing C° i 
trols, while responses to the father figure D 
next, and responses to the peer figures are 
least effective discriminators. This is the $4” 
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with groups divided on 
by a score of 6.) 


THEMATIC AND COGNITIVE RESPONSES TO CUES 31 


Pattern as was found for schizophrenics rela- 
tive to controls. 
š When p Nurturance is examined as a func- 
tion of levels of nurturance in the stimulus, 
the best controls obtain the steepest and 
the poorest schizophrenics the least steep 
Sradients, Of more interest, the best schizo- 
eats, despite higher Goodness of Response 
Cores, produce a less steep gradient than the 
Poorest controls. Thus, the relatively flat 
gradient of the schizophrenics cannot be 
attributed to a general factor of poor re- 
anding, but is indicative of an avoidance 
nat to cues of nurturance and rejection. 
ithin diagnostic groups, the poorest and 
te performing subjects differ most on the 
im stimuli and least on the rejecting 
ance i. Thus, again the pattern of p Nurtur- 
Scores of the poorest controls places them 
a the best controls and the pooled 
zophrenics, 
eae are some significant findings of 
Geen interest only, in Table 2, which 
e ant brief reference. As might be expected, 
$ = Niilus dimension accounts for a major 
ance of the variance for scores of p Nurtur- 
Cene The mean score for the nurturant 
Nes is 7.90, for the ambiguous scenes, 6.10, 
le the rejecting scenes, 4.00. Pooling 
tan gies of nurturance, there are oo 
evel) ft a among the figures ie 
ighes ith the father figure obtaining 
est score of p Nurturance, with a mean 
of 20, the mother figure next, with a mean 
me 5, and the peer figures the lowest, with 
tionshi of 5.75. A significant Figure X Rela- 
s ms interaction (.001 level) is a result of 
Sree oh gradient of p Nurturance for peer 
Whic than for mother and father figures, 
Bures may indicate that responses to peer 
Sticg a determined by stimulus character- 
Mother. a greater extent than responses to 
are and father figures, which presumably 
Ore influenced by personality factors. 


each Discussion 
action Time 


iq 
Dhreng,Clevated reaction time of the schizo- 
Sequen, to the peer figures in the MFP 
Wenge. Suggests the possibility of a se- 
The y cUe behavioral deficit in schizophrenia. 
Which te sequence was the only one in 
he mother figures preceded the peer 


figures, so that it appears that either the 
maternal cues by themselves, or in combina- 
tion with paternal cues, interfered with re- 
sponses to the peer cues that followed. It is 
hypothesized that the sequence-cue behavioral 
deficit is indicative of a conflict between relat- 
ing to others and being dependent upon the 
mother, i.e., thoughts about relating to the 
mother are incompatible with thoughts about 
relating to contemporaries. The findings on 
reaction time are in particular need of verifi- 
cation, as a sequence effect had not been 
anticipated, and the results are not supported 
by other measures of cognitive deficit. A 
study is currently under way in which se- 
quence effects are being more thoroughly 
investigated in a complete Latin square de- 
sign, and in which a measure of GSR is used 
as an alternate index of conflict. 


Press Nurturance 


There were three major findings on p 
Nurturance. First, the schizophrenics pro- 
duced flatter gradients of p Nurturance as a 
function of nurturant cues than the controls, 
ie. they told less rejecting stories to cues 
of rejection and less nurturant stories to cues 
of nurturance. Second, the schizophrenics ob- 
tained an overall lower score of p Nurturance 
than the controls across all figures. While 
the controls tended to perceive others as 
nurturant, the schizophrenics described them 
as rejecting. Third, the schizophrenics de- 
scribed the mother figure as particularly re- 
jecting, while the normals described her as 
highly nurturant. The first finding is, of 
course, consistent with the widely held view 
that the most fundamental defense in schizo- 
phrenia is emotional withdrawal. In light of 
this, the second and third findings are of 
particular interest, as emotional denial should 
make the schizophrenic view others as neutral 
rather than rejecting. Why should the schizo- 
phrenic then be prone to describe others, 
particularly the mother figure, as rejecting? 
The most obvious explanation is that the 
schizophrenics were, in fact, rejected by their 
mothers, and the thematic responses reflect 
their past experiences. An alternate explana- 
tion, not incompatible with the first one, is 
that denial of nurturance serves as a defense 
mechanism, which reduces the possibility of 
personal involvement and rejection. If others 
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are rejecting there is little reason to attempt 
to relate to them, and should one make the 
attempt and be rebuffed, this simply confirms 
expectation and one is prepared. Thus, a 
proclivity to perceive people as rejecting, and 
to deny nurturance, in particular, serves the 
same function as emotional withdrawal, 
namely, protection of the schizophrenic from 
rejection. 

Evidence supporting the interpretation of 
denial of nurturance as a defense mechanism 
is provided by some recent validity studies on 
thematic responses (Epstein, 1962; Fenz & 
Epstein, in press). It was found, for example, 
that novice parachutists who acknowledged 
that they were afraid, and were attempting to 
control their fear, told stories about TAT 
heroes who were described as unafraid (Fenz 
& Epstein, in press). The stories, thus, re- 
flected the defense of the parachutist rather 
than his emotional state. Moreover, as will 
be seen later, there is considerable evidence 
apart from the present study that nurturant 
cues are disruptive to schizophrenics, which 
is consistent with the assumption that they 
arouse defensive behavior., 

For the mother figure, whom the schizo- 
phrenics described as particularly rejecting, 
denial of nurturance may well be a means of 
coping with a conflict over dependency. In 
this respect, it is important to recognize that 
the schizophrenics are all good premorbids, 
who would be expected to actively cope with 
their problems. Such an interpretation is con- 
sistent with the findings on reaction time, and 
is supported by several other studies. In an 
unpublished study by Ussery, cited in Rod- 
nick and Garnezy (1957), good premorbid 
schizophrenics told TAT stories characterized 
by themes of dominance and submission, with 
heroes attempting to establish independence 
from mother figures. In a study by Harris 
(1957) it was found that while poor pre- 
morbids overestimated the size of pictures 
of mother-child relationships, good premor- 
bids underestimated them. Harris interpreted 
underestimation as indicative of control. In a 
study on projective responses of paranoid 
schizophrenics, Lane and Singer (1959) re- 
ported, as one of their most clear-cut results, 
that schizophrenics revealed greater depend- 
ence on the mother figure than controls. A 


final reason for interpreting low p Nurturancé 
scores for the mother as a defensive reaction 
rather than simply as a biographical de- 
scription is that the schizophrenics atribi 
considerably higher p Nurturance to i 
father than the mother figure, whereas bas 
evidence there is indicates that the fathers i 
good premorbids are more rejecting than ja 
mothers (Rodnick & Garmezy, 1957). T 3 
schizophrenics’ thematic responses may Ha 
not only reveal an attempt to cope Wi 

dependency by devaluating the mother sgua 
but may represent an attempt to find a s \ 
able model for masculine identification z 
favorably evaluating the father figure. It Wa 
be interesting to determine whether poor pre 
morbids, who presumably have experienc 
greater rejection from their mothers tha 
their fathers (Rodnick & Garmezy, 1957); 
will show a reverse pattern of p Nurturan¢ 
for mother and father figures. Presumably 
they would be seeking the dependent relation — 
ship with a mother figure which the g00 i 
premorbid is attempting to reject. 


Goodness of Response 


The results on Goodness of Response fail 
to support the hypothesis that schizophreni® 
exhibit increasing cognitive deficit as a fur? 
tion of decreasing cues of nurturance. Rathe! 
there was a strong tendency for the schizo 
phrenics to tell their Poorest stories tO 
nurturant cues and their best stories to a 
ambiguous cues. Thus, if anything, the schizo” 
phrenics find nurturant as well as rejecti?? 
cues disruptive, and the former more per 
Rather than a specific censure-cue def je 
(Rodnick & Garmezy, 1957), the schizophre™ 
exhibits a general deficit for cues associat? 
with emotional involvement, whether of J 
positive or a negative nature. While this oe 
clusion is supported only tenuously by ef 
results on Goodness of Response, there is ot? 
evidence in the Present study and in oF 
studies which strongly supports such a va $ 
In the present study, the flat gradient ° Hi 
Nurturance produced by the schizophrenies 
is indicative of a failure to vary respons, 
to correspond with changes in the stima - 
and can thus be interpreted as poor respe at 
ing. In this respect, it is noteworthy r | 
responses to the nurturant cues were P°° | 
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a ew to the rejecting cues. Turning 
(1961), 1 studies, Atkinson and Robinson 
Rhine in an investigation of paired-associate 
ocd concluded that not only does social 
for schi a serve as a positive reinforcer 
a debilitat, renics, but it may actually have 
gating ating effect. Turbiner (1961), investi- 
varyin oe are discrimination of stimuli 
and = ong dimensions of positive, neutral, 
phrenic gative affect, found that schizo- 
ctiminati were poorer than controls on dis- 
and po ing pairs of stimuli on both negative 
on Mid inpast; but did not differ 
signed f stimuli. In a particularly well-de- 
eT , Johannsen (1961) compared 
ation an and normals on a double alter- 
Nonsociel lem under conditions of social and 
er oe bae Be found that normals 
Schizoph better with social feedback, and 
ae tenics with nonsocial feedback. Thus, 
veral picture supports the conclusion 

es schizophrenics demonstrate deficit for 
Sener] social and emotional involvement in 
È ge than only for cues of censure. 
„ĉason that some studies find greater 

Or en Cues of negative affect and others 
diffe. Ve affect may be that the stimuli 
in features other than the affect 


“pict, 
Positive: - In the present study, for example, 
Socia affect for the mother figure was as- 


deni mth feeding and negative affect with 
Scenes al of food. It is possible that if other 
have n nad been used, nurturant cues would 
l Sak been more disruptive and avoided 
for f e ng cues, There is obviously a need 
rej er work in which cues of nurturance 
Contei, Hon are presented in a variety of 
E se S, before it can be established that 
“ther, of cues is more disruptive than the 
So 
merged Particularly interesting observations 
E when Goodness of Response was 
t Was subdivide subjects within groups. 
betw n Ound that the poorest controls fell 
oes best controls and the pooled 
s ome in both magnitude and pattern 
orm g turance scores. While the best per- 
8S mor, ONtrols described the mother figure 


e 
fure nurturant than any of the other 
ab ; tae poorest performing controls de- 
hizo, T as relatively neutral, and the 


tenics as highly rejecting. These find- 


ings suggest that perception of p Nurturance, 
particularly in regard to mother figures, may 
be related to a very broad dimension of social 
and intellectual competence, with schizo- 
phrenia at one end. It will be interesting to 
test this hypothesis by obtaining  Nurtur- 
ance scores from mental defectives and under- 


achievers. 
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TRANSFERENCE IN CO-THERAPY GROUPS 
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Literature on co-therapy groups, 


for patients of both sexes to identify 


opportunity to work out anxieties wi 


th 


led jointly by male and female therapists, 
is discussed. Their advantages, against grow 
include: (a) reproduction of the original 


ps led by 2 like-sexed therapists, 
family situation, (b) opportunity 
with a like-sexed therapist, and (c) 
a therapist of the more-dreaded sex. 


Case histories describe a Don-Juan type of man whose adjustment improved 
after working through defenses against Oedipal feelings in co-therapy, and 


an immature borderline wom 
when reassured by the sim 
transference phenomena 
therapy are discussed. 


Co-therapy groups, led jointly by a man 
and a woman therapist, seem logical in view 
of the wide agreement that a basic value of 
group therapy is that the patient can experi- 
ence and work through multiple transfer- 
ences; indeed, many writers believe also that 
another basic value of the group is its 
resemblance to the family. Yet, with some 
notable exceptions, literature on co-therapy 
with groups, except for reports on its use in 
training, usually focuses on its utilization 
as a kind of parameter in group therapy, a 
deliberate variation of technique to solve a 
specific problem or achieve a limited goal. 

i sually done by a prearranged 
Structuring of the Situation so that each 
therapist plays a given role. Thus male co- 
therapists, one behaving authoritatively and 


toward adults (Adler 
social worker leaves 


representing the reality Principle (Klapman 
& Meyer, 1957). A 


to resemble the 
stereotypes of masculinity and 
femininity. Thus two therapists offer them- 
selves as ideal substitute parents to the 
mothers and fathers of children under treat- 
ment at an agency, and find that questions 
about feeding and weaning are addressed to 
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an who overcame panic reactions to both sexes 
ultaneous presence of 2 parent figures. Special 
» research possibilities, and possible problems in co- 


the woman therapist, questions about educ 
tion to the man (Grunwald & Casella, Bee 
Three co-therapists, representing fat le! 
mother, and older brother, attempt to pa‘ jp 
the family constellation of the cultur’ 
treating hospitalized Porto Rican a ie 
Phrenics; the “father” therapist is aut ivi 
tarian; the “mother” therapist is submis 
and offers food; the “brother” is a a 
companion (Maldonado-Sierra, Trent, us 
nandez-Maria, Flores-Gallardo, Viger 15 
Rivera, & De Colon, 1960), The therapi 6 
wife, an assistant rather than a peer, provi in 
nourishment and comfort to the patients e 
stress situations, appearing as the ideal W 
and mother (Fink, 1958). 
Besides the special advantages of mle 
taking, the co-therapy situation may &4 p! 
one therapist to “supplement and complem® y 
the resources of the other” (Demarest 4 
Teicher, 1954); makes it easier for the A 
therapists to sustain group hostility 5° ye 
tense that it might be difficult for a ee 
therapist to bear (Kassoff, 1958); alee 
blind spots in one therapist to be cou?” g 
acted by the other’s insight (Loefil ae! 
Weinstein, 1954); and possesses various Oat 
advantages as recently enumerated by Mo): 


and Sangiuliano (1960). Slavson Ovi 
however, questions the desirability of & "ce 
Sion of the positive and negative transfer 
upon two distinct individuals,” believing re 
a real resolution of ambivalence cann© 
sult from a fragmentation of transfere 
This viewpoint, in the writer’s opinion, 7 gl 
well be opposed on grounds that the “no” 


Je 
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oad is developed, and ambivalence 
tked out, in a two-parent situation. 


Co-THERAPY as A FAMILY SITUATION 


tnt Prestructured co-therapy groups as 
therapist TETA above, in which the two 
VAC ae plan their respective roles in ad- 
whick he essentially different from groups 
cedure of ai tor basic psychoanalytic pro- 
tences etting each patient develop trans- 
conflicts ee to his own needs and 
ulse L ach an approach is reported by 
stein” ulow, William, Rindsberg, & Ep- 
p eterogeneous leadership represent- 

parental figures seemed to have 
the reproduction of the patients’ 
nflicts.” Demarest and Teicher 
also report that “the presence of 
: d female therapists made it possible 
Which oe to structure a family group 
Conficts A the acting out of family 


palp an 


tien 
Patien if both therapists are male, the 


tles an structure the situation them- 
ists tre react differently to the two thera- 
decide o dess of whether or not they have 
One the m advance to play definite roles. 
the dher et may seem less threatening than 
May Seem (Loeffler & Weinstein, 1954); one 
tewart good” and one “bad” (Cameron 
that atin’ 1955); indeed, some writers state 
3S to a oe may react to one male therapist 
T to a ather and to another male therapist 
The tater ther (Lundin & Aronov, 1952). 
“rapists writers find that, though both 
are male, there is 


a sim 
9 ‘ul, 
; vo Faia family setting created by the presence 
tistics EAA figures. . . . The physical char- 
the therapists become less important 
Psychological differences. . . . One thera- 
“ther ee as more aggressive and masculine, 
More protective and feminine (p. 77)- 


This 
elie nding is in line with the general 
1Ps i. individual psychoanalysis, the 
Bin) Writ 3 Seldom crucial; as Fenichel 
moths Can a Both men and women pa- 
Pt Sears do develop both father and 
erences toward their analysts, 


Tey he M 
Dore. Writ 
t on Pi has not found in the literature any 


Women serving as co-therapists. 


» (1956), who found that “double, 


whether male or female,” though he cites 
some types of patients as exceptions. Mullan 
and Sangiuliano (1960) also hold that 
“regardless of whether the therapists are of 
different sex or not, the patients will conceive 
of them unconsciously as being of different 
sex.” 

Group therapy, however, does not, in this 
writer’s opinion, really duplicate the classical 
psychoanalytic situation in which the impact 
of the analyst’s real personality is minimized, 
so that an important part of his function is 
to serve as a screen for the patient’s fantasies 
and projections. The real personality of the 
therapist is relatively more important in 
group treatment. Consequently, many values 
usually considered characteristic of group 
therapy are enhanced by the presence of a 
man and woman as co-therapists, but would 
presumably be lessened if both therapists 
are of the same sex. 

If group therapy offers an especially good 
opportunity to work through transference 
distortions originating in relationships to 
parents and siblings, the presence of male 
and female parent figures would seem to be 
particularly helpful in eliciting and clarifying 
these distortions. But if the patient reacts, 
consciously or unconsciously, to one man as 
a father and to another man as a mother, 
the unraveling of transference distortions be- 
comes needlessly difficult. 

If we believe that the patient can utilize 
the group as a kind of second family, more 
accepting and more growth stimulating than 
his original family, it seems desirable to 
have both a father and a mother figure 
present, but a family of like-sexed parents is 
not acceptable. 

And if the group is of value in continually 
confronting the patient’s perceptual distor- 
tions with reality, then therapists of op- 
posite sexes offer a unique opportunity for 
reality testing. But an appreciation of reality 
can hardly be fostered by a situation in 
which it is tacitly accepted that, of two male 
therapists, one will unconsciously be regarded 
by the patients as a woman. 

Indeed, it may well be that confusion about 
sex identification and sex role, almost always 
found in people seeking therapy, could well 
be heightened by a situation in which like- 


36 ELIZABETH E. Mintz 


sexed therapists are seen as playing op- 
posite-sexed roles. But in a group with male 
and female therapists, both of whom pre- 
sumably are clear about their sexual identity 
and reasonably happy with it, a patient of 
either sex has an especially good chance to 
introject this sexual self-acceptance. 


Co-THERAPY FOR AN OEDIPAL PROBLEM 


The illustrative cases to follow are based 
on the writer’s 3 years of experience with 
two male co-therapists, comprising three 
groups in all. This way of working seems a 
trend among therapists in private practice, 
usually not because of practical considera- 
tions but as a quest for the optimum thera- 
peutic situation. The therapists work on a 
peer basis, with a group drawn from the 
private practice of both. Often a patient 
enters the group after a period of individual 
treatment, and may continue to see his 
therapist individually while in group. 

The case of “Jack” demonstrates how a 
patient may use the co-therapy situation to 
work through a series of transference re- 
actions, which in this case seemed to parallel 
very clearly the vicissitudes of Jack’s rela- 
tionship to his mother, and also the current 
difficulties he experienced with women. As 
with the other case to be described, only 
those aspects of treatment relevant to co- 
therapy can be discussed. 


after brief and un- 
oman, entered treat- 
who placed him in a 
functioned 


is first year as a Patient 
in the co-therapy group, Jack’s attitude toward 


2 For convenience, male and female therapists will 
be called, respectively, Dr. M and F., Thomas E. 
Tierney was the male therapist in the first case 
reported; Samuel Shrut in the second case. Grateful 
acknowledgments are due to both. 


Dr. M remained predominantly positive. He ide 
ized and respected Dr. M, adopting the aa 3 
of a boy toward a revered father, thouknm 
tended to be petulant and demanding 
financial arrangements. , d DeF 
In sharp contrast, Jack’s attitude towar ‘th 4 
showed dramatic fluctuations. He began bee 
markedly flirtatious attitude, a brilliant s a 
openly appraising glances. Group mamit Fe 
both therapists all perceived the contemp it 
by Jack’s seductiveness. To Jack himah dged it 
barely below consciousness, and he acknowle! $ 
quite easily when it was pointed out to him iil! | 
currently, he recognized the contempt and ion Í 
which he felt toward his endless process 
girl friends. i 
> Snort afterwards, Jack began to express otf 


t 
men 
contempt for Dr. F. He found her Eo grat? 
stupid, and did not understand why ot con 


members seemed to take them seriously. 


a 


taking Jack with him. He recognized this fa” py 
as deriving from an early childhood A 
his father would get fed up with his him. J 
mother and walk out, taking Jack with, h to 
this time, Jack was not aware that his wis a inst 
alone with father was essentially a defense amt 
direct Oedipal feelings for his mother, as 
clear later in treatment. orons y 
Now Jack began to attack Dr. F bases A 
“sloppy sentimentality.” He felt she Bae al 
own patients in the group, especially | w 
patient whom he considered her favorite- cb 
attitude seemed to derive from one O 
major childhood problems: a younger favo 
charming but crippled, had apparently been 5 va 
by the mother. There was a period of hom ja 
longing for his brother figure in the group» f 
Jack was able to acknowledge. He possesse pe 
cient ego strength to see almost without 
peutic intervention that the homosexual sal 
had an unreal quality and were essen? 
defense against his wish for exclusive por o 
of Dr. F’s attention. He now recalled that * cle! 
homosexual impulses, which he had never qi 
upon, were likely to occur when he vom y 
appointed in obtaining the affection of a WO™ ge 


Now, for the first time, Jack began ie 
directly with his Oedipal attraction tow’ uP ad 
mother. Seated next to Dr. F in the 8t0\ gf 
lit her cigarette (contrary to group custome 
was surprised to find himself experiencing his i 

the fear and excitement he had felt on iih 
date. He related this to early sexual red É 
toward his mother, for which he had fea 
father’s rebuke, as he now feared Dr. it y 
was aware also that frequently he still sE "y 
little boy pretending to be a man. Throve sd 


i ; itat? 
experience, Jack realized that he still besit 
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ee with male authorities, as he had feared 
Ree with his father. 

k i a new attitude toward Dr. F appeared. 
el (unwillingly, then openly, he expressed 
3 - “She does say something good some- 


tim 

aver a I hate to admit it.” The contemptuous, 
Eora attitude vanished. About this time, Jack 
: a change in his social life. Sexuality for 


its ow, a 
wn sake seemed less important, and he began 


to . 
Sareea regularly with a girl whom he liked 
M he ed, Concurrently, Jack’s attitude toward 
eae rage less juvenile, more. man to man. 
ack is no end of his second year of co-therapy, 
Problems ae one to deal with pre-Oedipal 
nthe O ai had been repressed even more deeply 
Wish to k par conflict. He sees that an infantile 
dependence ependent, and a concomitant fear of 
rom ei contributed to his pattern of running 
Often ae girl to avoid an involvement. He 
reco ty for Dr. F’s sympathy and affection, 
Ongings, a zes this as an expression of infantile 
Motherly = actuality, he does not try to obtain 
Crapist y ie from Dr. F, but accepts her as a 
Setbac] ii h friendly appreciation. With occasional 
ave steadily business dealings with male authorities 
Sitls have be, improved, and his relationships with 

ecome warmer and more stable. 


Co- 
THERAPY FOR A FRICHTENED PATIENT 


Te oe of “Lois,” like Jack’s, demon- 
Nee reg emergence of a series of transfer- 
ine related to levels of feeling for 
Specigy cont Parent. It also illustrates two 
Patient is alues of co-therapy: it enables a 

usl O relate to two parent figures simul- 
la king } where this opportunity had been 
Securit ìn real life; and it may afford extra 


i 
Y for a patient almost paralyzed by 


At ay Lo 
treatment Lois came to Dr, F after 3 years of 
go tse o with two male therapists, undertaken 
he Severe attacks of agitated depression and panic 
PSbitalizatin to carry her to the very brink of 
aa TOs] ee The first therapist had resorted to 
"alyticay, 22d advice. The second, @ psycho- 
Z om psychologist, succeeded in 
ois © hold jobs intermittently, but finally 
nae work with a woman because she 
ae ed by the sexuality and hostility of 
S haq ae feelings toward him. 
family nown no men. Her father had deserted 
mopa ter, When Lois was five, and died shortly 
Re Was ig ny sibling was a sister. Her 
near-psy aS immature immigrant woman. 
tateq be otic attacks had apparently been 
oe the marriage of the sister, which 
espe and her mother alone together, 
Lost rel: ‘ately to one another, with almost 
Ois da ationships, 
Occasionally, but became panicky and 


Oi 


furious at any sexual gesture from the man. She 
could not hold a job for more than a few months, 
because she grew terrified as soon as she, became 
sexually aware of her boss or any other man in 
the ofñce. 

In 2 years of treatment with Dr. F, there 
was some improvement. She held jobs longer, 
seemed further from a psychotic break. But she 
still found it difficult to endure the presence of men 
at work or on dates, and she could not break 
away from the frantic dependency on her mother, 
now extended to Dr. F also. “Let me come home 
and live with you . . . I won’t make any trouble. 
Tl do all the housework.” 

To offer Lois a relationship with a strong male 
figure in a situation where she could still feel 
protected by her mother-therapist, Dr. F placed 
Lois in a co-therapy group. At once, she began to 
see Dr. M as strong, aloof, unfeeling, and inhuman. 
Simultaneously, her attitude toward Dr. F began to 
change. She saw Dr. F, a vigorous woman in the 
early forties, as pitifully old and frail. Although 
she knew that Dr. F had a family and a full 
practice, she saw her as dependent. “When a 
patient leaves, you just go home and cry and 
cry... . You don’t want me to go on a vacation 
and have fun. You don’t want me to get well and 
leave you. . - - You will die when I leave.” 

Not until much later in treatment did Lois 
seem to realize fully that, in fact, Dr. F did not 
depend on her. Both therapists interpreted to 
Lois that she was projecting upon Dr. F the same 
pathological dependency which existed between her 
mother and herself. They tried to help her see that 
she herself had contributed to the mutual depend- 
ency, and had never been able to admit that her 
mother could survive without her. 

However, insight with Lois did not ever seem as 
clear or as important therapeutically as it was with 
Jack. For her, actual experience seemed far more 
important. Still insisting that Dr. F would die at 
Lois’ departure from treatment, and that her 
mother could not survive without her, Lois, now 
almost 30, succeeded with great apprehension in 
taking an apartment away from her mother. The 
therapists speculated that this had become pos- 
sible because, although Lois had blurred Dr. F 
and her mother into a single image, she also 
recognized unconsciously that Dr. F was really 
independent, and was able to use this recognition 
to correct her distortion of her mother and move 
out without feeling like a murderess. Indeed, the 
mother, now in supportive therapy, responded to 
the separation very favorably. 

With Dr. M, Lois, who had been quiet and 
frozen with her previous male therapists, became 
gradually more at ease. Though she still saw him 
as inhuman, she became able to joke and argue 
with him in the group. She asked for occasional 
individual sessions with him, granted with the 
consent of both therapists. And for the first time, 
she had dates with the same young man for a 
period of several months. 
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Perhaps because Dr. M’s presence gave security, 
Lois now began to express frankly a death wish 
toward Dr. F: “You look tired tonight. Your work 
is too much for you. Maybe you belong in an 
old-age home.” This death wish reached its peak 
when | she began to talk, only half playfully, of 
what a pleasure it would be to poison Dr. F 
“because there isn’t room on earth for both of us.” 

Here co-therapy was terminated by scheduling 
difficulties, and Lois was offered the choice of 
working with either therapist. After considerable 
conflict, Lois chose Dr. M, as both therapists had 
hoped she would. “I neéd to learn to get along 
with men.” 

A few months thereafter, while working with 
Dr. M, Lois still sought occasional individual 
sessions with Dr. F, as if to assure herself that 
she had not destroyed her therapist mother. In 
the last of these sessions, she reported gleefully 
that she no longer found Dr. M aloof and strong. 
“He’s just as weak as you are. Maybe worse. 
He’s only human.” 

Outside treatment, also, males became human. 
Lois could now hold a job, because she did not 
fear her bosses any longer. She began to permit and 
enjoy physical contact with the man she had been 
dating, apparently without serious conflict, She 
developed new interests, traveled on vacations. A 
year later, Dr. M reports that Lois has finally 
relinquished her transference distortion, and now 
sees Dr, F simply as her ex-therapist. 


THEORY AND OBSERVATION 


Both these cases illustrate familiar phe- 


nomena: the emergence in transference of a 
series of attitudes 


toward an important parent 
figure, each attitude serving as a defense or 
disguise for a more deeply repressed feeling; 
concomitant improvement in real-life relation- 
ships; lessened anxiety; and the final de- 
velopment of a comfortable, fairly realistic 
relationship with the therapist. Equally 
striking cases from co-therapy groups could 
be described, as well as cases less successful. 

But patients improve, or do not improve, 
in various situations, What special contribu- 
tion can co-therapy make? 

A striking feature of both cases is that 
co-therapy confronted these patients with a 
therapist of the sex which they particularly 
feared. Lois had avoided men, and had made 
but little progress with two male therapists. 
Jack had run from genuine relationships 
with women, and had left his first female 
therapist. Yet both patients profited by 
dealing with therapists of the more dreaded 
sex. Similarly, the writer has found co- 
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therapy groups especially valuable for ceria 
types of homosexual male patients who i 
partly worked out their fear of dealing wi ; 
heterosexual men and are now ready to K 
a masculine authority figure. Here, t 
therapist of the less-dreaded sex serves, 
merely by being present, as a reassurance. 

As to the actual vicissitudes of transferencé, 
they do not always follow what might i 
been predicted a priori. A series of comp A 
mentary transference reactions to the a 
therapists might well have been predicted; 
perhaps a patient would alternately like we, 
dislike both therapists, or perhaps a pers 
would feel toward both therapists what hi 
had felt toward father and mother at an 
earlier age. This latter type of complementary 
transference does sometimes occur, as one 
lified by Jack’s fear that Dr. M would rebuk 
him for relating as an adult man to Dr. F 
involving both therapists in a repetition 0 
Oedipal feelings. 

Often, however, a patient appears to foc 
transference reactions predominantly on on 
therapist at a time. Sometimes a patiens 
uses a positive, stable relationship with Ca 
therapist as a background or an anchor } 
reality, which enables him to experience a” 
express vehement transference feelings towa" 
the other therapist. Thus, for the most part 
Jack used his relationship with Dr. M as t 
background against which he could work ou 
his vivid transference attitudes toward Dr. ©’ 
The transference reactions had an intensit 
and immediacy seldom found in patient? 
whose reality testing is as good as Jack’s. . P 

With Lois, co-therapy seemed effecti” 
partly because she could relate simultaneous! 
to a father and a mother figure, an oppos 
tunity denied her in childhood; and part 
because she was able t 


o use each therapi? 
as a shelter or protection against her tra” 


ference fears of the other. In Dr. F’s pien 
ence, Lois could learn how to deal with a a 
authority without panic. And with Dr- pe 
in the group, she could finally experience t 
paralyzing death wish toward her mothe". 
Thus, in these patients, the co-theral, 
Situation facilitated two aspects of pe 
chotherapy generally considered effecti 
with Jack, the development of insight iy 
emotional patterns derived from €? 
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oe relationships, through transference; 
ae is’ case, a “corrective emotional 
Xperience.” 
—_ from its therapeutic values, the co- 
sittatien sane provides a natural laboratory 
nowled hs ich may well contribute to our 
he ar of psychotherapeutic processes. 
ingly ie of the therapist, increas- 
events, ig owledged to affect therapeutic 
observatio easily held constant in co-therapy 
ists alten e In private practice, both thera- 
© the oe ami groups singly in addition 
e aua a group. This situation could 
arisons c a easily in clinics. Thus com- 
Ms solo ould readily be made between Dr. 
thelh eoo E Up; Dr. F’s solo group, and 
k no ae group. 
pi Ai — to comparisons as to thera- 
answered rad specific questions might be 
tients bri rough recorded sessions. Do pa- 
emotional S up Oedipal conflicts with more 
situation? po orema in the co-therapy 
an the }© they speak more to the woman 
Material ins in dealing with pre-Oedipal 
the Writer’ oes further observation confirm 
actions ioa finding that transference Te- 
Cothen expressed with special intensity 
Necessarily D? If so, does this intensity 
S reflected imply more therapeutic progress 
inally 43. behavioral changes? 


rapist fa or use a relationship with one 

i fo S a defense against experiencing 

r the other. Unresolved tensions 

Pi therapists would heighten such 

t $ es Co-therapy comes to be used more 

fhnic, Uther study of these aspects of the 
ue Is needed. 
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LEVEL OF ASPIRATION AND SOCIAL DESIRABILITY 


IN CHRONIC SCHIZOPHRENICS? 


RUSSELL P. NORMAN 
Queens College, Charlotte, North Carolina 


The relationship between response to social desirability (SD) and level of 
aspiration behavior (LA) was studied in a chronic schizophrenic population. 
107 hospitalized schizophrenics were given a modified form of the Edwards 
SD scale and were rated as to mental health (MH) status. 4 experimental 
groups of 12 Ss each were formed representing the high and low extremes 
on each variable. All Ss were given 10 trials on a letter-digit substitution task 
and their LA was obtained after each trial. Results indicated that the 
performances of the high SD Ss were superior (p <.05), and discrepancy 
scores (LA minus performance) were significantly related to the MH variable 


(p<.01), with low MH groups having negative and high 
positive discrepancy scores. The use of MH in combination w 
valuable in demonstrating the existence of several major behavior: 


in a group of chronic schizophrenics, 


With the increased use of objective person- 
ality inventories, psychologists have become 
aware of test-taking attitudes or response 
sets which affect individual performances on 
these instruments. The response to social 
desirability (SD) has been the subject of 
much study in both normal and psychiatric 
populations. Clinicians have found that in 
working with psychiatric patients, many ap- 
pear strikingly normal when responding to 
the objective type personality inventory 
(Feldman, 1956; Friedman, 1953; Hillson 
& Worchel, 1957; Meehl, 1946). Leary 
(1957) adds emphasis to these findings when 
he comments that many psychotics, especially 
those with a paranoid diagnosis, seem to 
present a two-layer picture; a facade of 
normality covering intense feelings of rage 
and despair. 

Some rather strong evidence has been 
gathered linking SD with self-concept, 
(Cowen & Tongas, 1959; Norman, 1961; 
Zuckerman & Monashkin, 1957). Subjects who 
achieve high SD scores appear to have more 
favorable self-attitudes than subjects with 
low scores. If it is true that psychotics as 


1This paper is based on a dissertation submitted 
to the faculty of the University of North Carolina 
in partial fulfillment of the requirements for the 
degree of Doctor of Philosophy. For their helpful 
assistance, the author would like to express his 
appreciation to W. G. Dahlstrom, E. L. Streuning, 
Harold Wilensky, and Leonard Solomon. 
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MH groups 
ith SD was 
al variations 


individuals do not all appear to be psychoti¢ 
on objective psychodiagnostic tests, that ÍS: 
if their scores are distributed with some high 
degree of variance, one might postulate 
certain behavioral correlates of these 1° 
sponses. It should follow that if an individua 
is suffering from a severe depletion of self- 
esteem, this would be revealed by means ° 
a task calling for goal-striving and achiev 
ment behavior. On the other hand, if th? 
individual were able to erect some defenses 
against this unfavorable self-concept, 
might give quite a different picture on t 
same tasks. 

The level of aspiration technique (LA) P$ 
been used frequently in the experiment’. 
study of personality. Most investigation, 
have concentrated on the use of this meth® 
in terms of achievement motivation. Howeve’: 
it has been suspected by many that LA i 
an excellent measure of defensive behai 
(Harris, 1955; Holt, 1946; Sears, 1940 5 
Several investigators have used the LA tec fi 
nique in the study of schizophrenia, the mO t 
noteworthy studies being . those of on 
(1955), Miller (1951), and Winder (1952 
The most consistent findings concern 
extreme variability of LA behavior in 
schizophrenic groups, where LA tended , 
be unusually high or low in comparison W' 
normal subjects. However, the problem , h 
discovering which schizophrenics set 


the 
the 
to 


OS. < 
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alata ones set low levels, and which 
Tt sée pond normally is yet to be resolved. 
has a = one of the main drawbacks 
terms Hn tendency to classify subjects in 
Fitis t e standard nosological categories. 
iin rue that schizophrenics as a group 
of Bhs deal of variability on a variety 
classif hee other methods of selecting and 
dealing g subjects may be more useful in 
t a Ban these particular variables. 
contro] ms that an important dimension to 
evel of in research of this type is actual 
show so mental health. Since schizophrenics 
it is as ee variability in degree of illness, 
reflect yar e that SD differences will not only 
sive test ying predispositions toward defen- 
Sociateq responses, but also may be as- 
ence of with variations in the actual pres- 
e ie acilar symptoms or undesirable 
esirabilit This “reality” factor in social 
investigate, has been mentioned by several 
Perber ors (Crowne & Marlowe, 1960; 
Seems wo Spanner, 1960) and certainly 
a tthy of consideration. 
father Present study was designed to test one 
aspiration howe proposition; that level of 
response 4 chavior would be associated with 
Chizoph © social desirability in chronic 
D a SS More specifically, that high 
Spiratio ects would show higher levels of 
(differen. and positive discrepancy scores 
L : caer, between actual performance and 
uti ile low SD subjects would be more 


us A 
ettive demonstrating very low or even 


healt 'S, high SD subjects who are relatively 
ne, h J 


S, an 
$ d 


lveness, 


A METHOD ” 


Mod: 
8 Odificat; 
Sile wag ution of the 39-item Edwards (1957) SD 
ea ed. The change represented an attempt 
Tn ee evenly balanced as to responses 
Situs Y dro and “False.” Ten False responses were 
ed from He and 11 True responses were sub- 
Si the original 79-item SD scale (Fordyce, 
deg Nee É 
Sig the ori 


Partien only 


ginal study was more extensive in 
fr those methods pertaining to this 
Port will þe cited. 


1956). Thus, the final balanced scale contained 40 
items. 

An independence of functioning scale developed by 
Wilensky and Solomon (1960) was used to rate the 
mental health of all subjects. This scale concerns the 
patient’s ability to assume independent responsibility 
in caring for himself, and ranges from ability to live 
and work outside the hospital (score of 1) to requir- 
ing maximum supervision for daily activities (score 
of 6). Each subject’s final rating was an average of 
three independent ratings gathered from ward 
personnel (ward psychologist, psychologist trainee, 
ward physician, or head nurse). Product-moment 
reliabilities for the ratings ranged from .73 to .82. 

The SD scale was administered to 107 chronic 
schizophrenics at the Veterans Administration hos- 
pital, Montrose, New York. Mean length of hospitali- 
zation was approximately 5 years. Those subjects 
who could not respond relevantly to the scale were 
eliminated from the study. High and low cutoff 
points were established for the SD scale and the final 
mental health ratings were dichotomized. Four 
experimental groups of 12 subjects each were estab- 
lished, representing the four possible high-low 
combinations of social desirability and mental health. 

A simple letter-digit substitution task was used in 
the experimental situation. This was patterned after 
the well-known digit-symbol subtest of the Wechsler- 
Bellevue scale. The task was explained to the sub- 
ject and he was given 10 30-second trials, his score 
(number of substitutions attempted) being reported 
to him after each trial. Following each trial, the 
subjects were asked to estimate their performance 
on the next trial. This estimate, which served as the 
level of aspiration measure, was given in numerical 
form. Thus, the experimenter would say, “You did 
of them that time. How many are you going 
to try to do this time?” In general, the subjects 
had little difficulty understanding the level of 
aspiration concept. At the conclusion of the task, 
all were told that they had done very well. 


RESULTS 


Table 1 shows mean performance, level of 
aspiration, and discrepancy scores (LA minus 
performance) for the four experimental 
groups. One notices that the high SD groups 


TABLE 1 


Mran PERFORMANCE, LEVEL oF ASPIRATION, AND 
DISCREPANCY SCORES FOR THE Four ExPERIMENTAL 
Groups over 10 Letrer-Dicir SUBSTITUTION 


TRIALS 
Mean Mean 
perform- Mean discrep 
Group ance LA ancy 
I High SD, high MH 23.35 25.82 2.47 
HERD fa B9 TM 
y SD, high i K $ 
IIT Low ig TRE 19.98 ae 


IV Low SD, low MH 
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TABLE 2 


ANALYSIS OF VARIANCE OF MEAN DISCREPANCY SCORES 
FOR THE Four EXPERIMENTAL Groups OVER 10 
Lerrer-Dicit SUBSTITUTION TRIALS 


Source SS df MS F 
Social desirability 2.30 1 2.30 44 
Mental health 55.66 1 55.66  10.68** 
Interaction 13.66 1 13.66 2.62 
Within groups 229.39 44 5.21 

Total 301.01 47 
p <01. 


tend toward superior performances, but that 
the discrepancy scores seem to reflect even 
more striking differences, particularly as re- 
lated to the mental health variable. The high 
mental health (MH) groups (I and III) 
maintained a positive discrepancy score 
throughout the series of 10 trials, although 
the Group III subjects seemed more restricted 
and stayed closer to their actual performances. 
However, the low MH groups (II and IV) 
show negative mean discrepancies. This effect 
was most noticeable in Group II which showed 
positive discrepancies on the first two trials 
followed by gradual increase in the negative 
direction. In other words the low MH sub- 
jects, by and large, tended to predict scores 
that were lower than those they had just 
achieved. 

Two separate double classification analyses 
of variance were performed on these scores. 
Table 2 depicts the analysis of discrepancy 
scores. Although these scores do not seem to 
be related to the SD variable, the MH dimen- 
sion is significant beyond the .01 level. 


TABLE 3 


ANALYSIS OF VARIANCE OF MEAN PERFORMANCE SCORES 
FOR THE Four EXPERIMENTAL Groups OVER 10 
Letrer-Dicit SUBSTITUTION TRIALS 


Source SS df MS F 
Social desirability 231.44 1 231.44 5.01* 
Mental health 11.43 1 11.43 25 
Interaction 7.86 1 7.86 17 
Within groups 2030.51 44 46.15 

Total 2281.24 47 
a b <05. 


However, in examining the performance scores 
(Table 3) something of the reverse seems to 
be true. The high SD groups tend toward 
superior letter-digit performances (p < 05); 
although there is no discernable relationship 
between performance and MH. 


Discussion 


To initiate discussion, it would seem 
most helpful to examine each of the four 
experimental groups in terms of the main 
independant variables and actual behavioral 
observations. 


Group I: High SD and High MH 


The responses of these subjects approxi- 
mated those obtained from normal subjects 19 
other studies. The LA was always positive, 
with discrepancy scores gradually decreasing 
as performance neared an asymptotic level. 
These subjects were reserved but cooperative, 
and it did not seem as if they could be 
appropriately called “symptom deniers.” They 
appeared to be individuals who had made a? 
excellent adjustment to the hospital com- 
munity, and symptoms, which are a primary 
referent for the nonhospital community, may 
not have been crucial. These seemed to be 
chronic hospital citizens who had become ad- 
justed to a simple and regimented existence: 


Group II: High SD and Low MH 


In this group, LA behavior was strikingly 
inconsistent with the picture of symptom 
denial presented on the SD scale. Many ° 
these subjects showed signs of intense anxiety: 
and some varied from extremes of near mute 
ness to almost manic speech, Although on¢ 
must interpret their high self-concept (SD. 
as a rather shaky and defensive one, their 
ability to approximate normal test response? 
allows them to function more adequately 2” 


with fewer restrictions within the hospital 
environment, 


Group III: Low SD and High MH 


These subjects were similar to those ot 
Group I, but their general LA, although 4 
ways positive, was never as high as that ° 
Group I and their predicted improveme? 
were uniformly small. Miller (1951) fon 
this reaction in his group of paranoid schi? 
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aes subjects. It appeared to the writer 

A = individuals demonstrated firmly en- 

negative self-concepts. Their inci- 

hes verbalizations contained numerous self- 

toate remarks, and although they were co- 

an they felt compelled to constantly 
e examiner of their shortcomings. 


Group IV: Low SD and Low MH 


on group was characterized by very small 
A pany scores which changed little over 
subjects o of 10 trials. Behaviorally, these 
in the wi more disturbed than those 
mute and a three groups. Many were nearly 
Eata had difficulty communicating with 
and eee They seemed easily threatened 
close iE of their illness, and were upset by 
eir nterpersonal contact with a stranger. 
aspirations varied little from what they 
e iyon on the previous trial. Although 
emed the original subjects in this group 
Were unable to understand the task and 
Cony ay replaced, the writer was 
is me that the others, although obviously 
Xperiny, 2 were responding relevantly to the 
Tead — situation. In addition, they had 
n Canin oe their ability to respond 
at ully to a personality questionnaire. 
a pg of interesting questions seem to 
Quite gy as a result of these findings. It seems 
My “lear that the degree of illness in the low 
br groups is reflected in their cautious ap- 
of actu to the experimental task. Regardless 
ang tUa] performance (on which Groups II 
Willin did differ) few of these subjects were 
Meir Mee anticipate any improvement in 
M Gro res. In fact, the tendency, especially 
as — H, was to become more pessimistic 
8eneray trials progressed even though the 
Drover, trend of performance was one of im- 
this ne ent. As has already been pointed out, 
interest discrepancy score is even more 
denig ¿8 in light of the picture of symptom 
formance eo SD) and relatively superior per- 
Ptobeq > Tt is possible that the LA task 
Where a much deeper emotional level, 
Onfray “true” self-concept was revealed 
4 sented to the rather grandiose picture 
o ites the SD scale. Although it is pos- 
Ym om ag the Group TI pattern as one 
ona nial, lack of insight, etc., there 
ibt that whatever the process might 


Se 


have been, it has at least allowed these indi- 
viduals to be more effective within the hos- 
pital setting. In some ways they seemed like 
Group I patients in that they often engaged 
in productive activity and were able to move 
about with fewer restrictions. 

Another way of interpreting the data would 
be that the low SD subjects were performing 
in tune with their own expectations, in that 
their ability at the letter-digit task was not 
equal to that of the high SD subjects. With 
no defenses to bolster their sometimes obvious 
lack of self-esteem, these individuals may 
have been relating to the experimental situa- 
tion in the one way that would have been 
consistent with their self-image. Undoubtedly, 
a major question remains as to the possibility 
of specifying more directly the relationships 
between SD and self-concept. If low SD is 
equated with a negative self-concept, why do 
Group III subjects react more like Group I 
subjects in their LA behavior, but actually 
perform less adequately on the task? More 
research is clearly needed to untangle a 
possible complex interaction between these 
factors. 

The results of this study would seem to 
have implications for psychodiagnostic test- 
ing as well as for a better understanding of 
the symptomatology of schizophrenia. While 
some would interpret these findings as indi- 
cating the doubtful validity of objective test- 
ing procedures in dealing with this type of 
patient, the writer feels that an individual’s 
ability to respond as a normal person in the 
presence of a severe psychosis is certainly a 
valuable piece of information. In many cases 
it may be better to interpret the apparent 
normality of a test response as an achieve- 
ment, rather than a denial or an indication of 
some imperfection in the test itself, Tt would 
seem, however, that in many cases this in- 
formation could not be properly evaluated in 
the absence of additional data gathered 
through the use of more penetrating tech- 
niques. The SD-LA discrepancy in Group II 
was a clear demonstration of this fact. 

There is a remarkable similarity between 
these findings and those of other investigators 
in the area of schizophrenic symptomatology 
(Guertin, 1955; Guertin & Zilaitis, 1953; 
Kleinmuntz, 1960). Since traditional nosolog- 
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ical categories fail to take normality factors 
into account, an SD-MH comparison or 
some other such breakdown might be more 
useful. Guertin’s four categories of schizo- 
phrenic behavior (socially normal, grandiose 
and deluded, evasive well-integrated, sensi- 
tive, inadequate and withdrawn) are highly 
similar to those categories developed in the 
present study. Although, the former were 
derived from MMPI responses by factor 
analytic methods. 
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A method for measuring the speed of the spiral aftereffect is described. 
Results of measurements on a group of 51 brain damaged and 37 control 
Ss are presented. Both binocular and interocular transfer conditions were 
used. It was found that brain damaged Ss varied over a much greater range 
than did controls. Also, the brain damaged S's aftereffect varied more from 


time to time. Among controls, 
be as great, if not greater, tha 
damaged Ss, 


a patterning which was significantly di 


the interocular transfer and the bin 
fierent from that of controls. Measure- 


ment of aftereffect rate by the method 
promise as an indicator of presence or absence of brain damage. 


V 
aitaa cmuth (1911), in writing on visual 
p = motion, said “. . . it is evident 
e enee aal increase or decrease in 
Some yal y of the aftereffect may afford 
Nosig» uable aid in (brain damage) diag- 
investiga? 115). This suggestion was not 
Josey ( ea directly until Freeman and 
Mi A 49) found greater memory impair- 
aftereffe patients who failed to report the 
foung Cct Later Price and Deabler (1955) 
Port at Patients with brain damage re- 
frequent] Spiral aftereffect significantly less 
Nethoq Y than control subjects. Their 
Of the an! determining presence or absence 
Spira] T aftereffect has been named the 
Methog yecretiect Test (SAET). In this 
: © elicie, weiect views a rotating spiral 
Ud y Ing stimulus). The spiral then stops 
leet as ind the subject reports the after- 
Action of T apparent expansion or con- 
s) € now stationary spiral (the test 
: This work was replicated by Page, 


and 
< (1958 
rable sh 
Tsensys 
~ Valu 


e : ; 
es rejp, S & screening device. Other 


1tetiect oe presence or absence of the 
(agate (19 brain damage are reported by 
the 8), Th 53) and Goldberg and Smith 
8 ese studies will be referred to as 

l Studies, 
Saia at 


Veterans i 
Hospital. 
orth Carolina. f 
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interocular transfer aftereffect was found to 
n the binocular aftereffect. Among brain 


ocular aftereffect displayed 


described (Metamegethograph) shows 


While the SAET is brief and easy to 
administer, it does suffer from certain dis- 
advantages. First, the test result is dependent 
on the ability of the patient to put his sub- 
jective experience into words. London and 
Bryan (1958) and Holland and Beech 
(1958) found that by using more specific 
inquiries they could obtain reports of the 
aftereffect from virtually all subjects whether 
brain damaged or not. These findings sug- 
gested that the differences between brain 
damaged patients and normals may have 
resulted from differences in the ability to 
organize verbal responses rather than dif- 
ferences in the aftereffect itself. 

A second disadvantage is that the method 
yields numerous false negatives (Gilberstadt 
et al, 1958; Stilson, Gynther, & Gertz, 
1957). An untested but possible explana- 
tion for the false negatives is that some 
brain damaged patients may have abnormally 
intense aftereffect. Since the SAET is based 
upon reported presence or absence of the 
aftereffect, the method identifies as abnormal 
only those who fail to report it. Using the 
method of Price and Deabler, patients with 
abnormally great aftereffect would be grouped 


with normals. 

A third difficulty in this area has been a 
of standardization of test conditions in 
laboratories. Indeed, there has 
thod of deciding upon 
f test conditions might 
brain impairment. Such 
of spiral, illumination, 
fixation time, texture, 


lack 
different 
existed no easy me 
which combination 0 
be most sensitive to 
ameters as type 


par ` 
f rotation, 


speed © 
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adaptation, and distance of the subject from 
the spiral have rarely been constant between 
any two experiments. Several of these para- 
meters have been shown to influence the 
aftereffect significantly (Cann, 1961; Durup, 
1928; Scott, 1962; Scott & Medlin, | 1960, 
1962; Singer, 1959; Stern, 1959; Wohlge- 
muth, 1911). 

Persistence time of the illusion has also 
been studied in connection with brain damage. 
This measure has been used by Eysenck, 
Holland, and Trouton (1957), Spivack and 
Levine (1957, 1959), and Truss and Allen 
(1959). These studies will be referred to as 
the persistence time studies. The first of 
these did not include brain damaged sub- 
jects, but both studies by Spivack and Levine 
and the study by Truss and Allen found 
significantly longer durations in the brain 
damaged subjects, Suggesting that brain 
damaged subjects have more aftereffect than 
normals. The SAET results, on the other 
hand, suggest that brain damaged subjects 
have less than the average amount of after- 
effect. The persistence time studies, then, 
would at first glance seem to contradict the 
SAET studies under the apparently reason- 
able assumption that an aftereffect which is 
stronger will also persist longer, 

Perhaps the difficulties and apparent 
contradictions mentioned above could be 
clarified by the perfection of a technique for 
measuring the speed of the aftereffect in a 
quantitative manner. Several methods for 
measuring speed are Suggested in the non- 
clinical literature. Durup (1928) used a sub- 
jective rating on a four-point scale for judging 
“intensity” of the aftereffect, Wohlgemuth 
(1911) had the subject make comparisons 
of recently experienced aftereffects with 
actual motions of visual stimuli presented 
after the aftereffect had completely dis- 
sipated. Wohlgemuth also had the subject 
make an adjustment of the test stimulus for 
measuring rotational aftereffect. He did not 
make extensive use of the method, however. 
This last method has been extended by 
Singer (1959) and Cann (1961) for measur- 
ing spiral aftereffect using an adjustable iris 
diaphram as a test stimulus. Scott and Medlin 
(1962) have developed a more refined method 
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of measurement utilizing an electronic test 
stimulus and a psychophysical method. A 
The purpose of the present study is fa 
measure the speed of the spiral aftereffect E 
an objective, quantitative manner which d 
relatively independent of verbal ability a 
which is sensitive to abnormally fast as Ta 
as abnormally slow aftereffect. Previously 
obtained psychophysical measurements wis 
used in choosing the conditions of the a 
ment so as to maximize the aftereffect. Spee 
of the aftereffect is operationally defined E 
that rate of change of size of the test stimulu: 
which exactly cancels the aftereffect, thu 
appearing subjectively of constant size. 


METHOD 
Apparatus 


A technique for making psychophysical measure 
ments of the rate of change of perceived size by 
the spiral aftereffect has been briefly described 
Scott and Medlin (1962). With the exception 0 
few minor modifications, the same method ie 
used in the present study. The apparatus, cal of 
the Metamegethograph (MMG), consists of a me 
tachistoscope which alternately presents a spiral a 
a test stimulus. In this study the spiral was Pot 
sented for 10 seconds and the test stimulus i 
2 seconds. These two stimuli were recycled pe: 
tinuously with no delay between presentation 
The spiral used in this study was a fourth aeg 
single Archimedes spiral, drawn counterclock w, 
outward with black India ink on a white ag 
disk. It was illuminated by two NE-34 neon wi 
operating at 150 volts dc. The spiral was roti 
counterclockwise at 160 rpm. The test stimt 5 
was a circular figure produced on a cathode The 
tube (5CP1) by a special signal generator.? a 
circular test stimulus always appeared with f 
initial diameter of 3 inches and then expan 


The 21 settings 
from 20% 
per second expansion, incl 
no change in circle size, test 
The rate of expansion or contraction of the ing 
stimulus was selected by a stepping relay hav st 
21 positions, corresponding to the 21 settings ibet 
described. The subject could step the relay in n of 
direction, one step at a time, by pressing either 


s veut diagr” 

>A 3-page description and one circuit diag ys, 
have been deposited with University Micro 1m 
Inc, Ann Arbor, Michigan. Order Special ive 


eda k; it 
$-294, remitting $2.75 for microfilm or pos 
copies, 


tam for 
uding a middle setting 
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S buttons. He could thus adjust the rate of 
po e of the test stimulus by fixed, equal 
e ea . An automatic graphic record was kept of 
throw eg Positions of the stepping relay 
e the experiment. The push buttons were 
the, a3 a = the stepping relay so that pressing 
ETA ae button stepped the relay in the 
die sth it on the next cycle, would produce a 
expanding | was objectively contracting more (or 
left-ha ne less) than on the preceding trial. The 
nd button had just the opposite effect. 


Procedure 


seat entering the testing room the patient was 
eyes wer, ront of the tachistoscope so that his 
landum he Š feet from the spiral. The manipu- 
Placed in wa containing the two push buttons was 
turned ar te subject’s lap. With the spiral motor 

» he was given the following instructions: 


: ei a to look at the center of this spiral. 
exact ¢ et your eyes wander away from the 
iape In a few seconds the spiral will 
e cir i and a circle will appear. Sometimes 

ee will be expanding (or getting larger), 
smaller) pees it will be contracting (or getting 
You to | Each time the circle comes on I want 
Setting ha me whether it seems to you to be 
8Ppears arger or getting smaller. If the circle 
and h to be getting larger, press the right- 
Press ey If it appears to be getting smaller, 
Buess, e left-hand button. If you are not sure, 


The 
Still aa was then darkened, and with the spiral 
Un to ionary a series of discrimination trials was 
instructio? sure that each subject understood the 
po could No subject was included in the study 
contracti d not correctly identify 4% expansion or 
10 i a Of the circle. Most subjects were able 
action minate 2% per second expansion or Con- 
a expl { the subject appeared confused his task 
= © sur ained again. Great care was taken to 
i al re that the subject understood the task and 
train: e to respond correctly. Following this 

S eem the spiral motor was started 
it On 4 S pa test was begun. 
ch Middle rst trial the relay was always set to 
ange «© Position (the setting producing zero 


ng i 
mbject m rde size). Thus, for example, if the 
poes an aftereffect on the first trial which 
th, 8s the stationary circle seem to expand, he 
this Telay to 8ht-hand button. This response steps 
w Produs, the position which will, on the next 
Der q trial 2% per second contraction. If on the 
but, Cond Se subject’s afterefiect exceeds 2% 
expo sign, E will again press the right-hand 
tea od Tiling that the circle still appears to 
Sea to the Will result in moving the stepping 
aft cont, Position which will produce 4% per 

®reffe, faction. If, for example, the subject’s 


press the left-hand button. This response will 
return the stepping relay to the 2% per second 
contract setting. The subject continues in this 
manner, thus constantly readjusting the rate of 
change of size so that it moves back and forth 
around a value at which his aftereffect exactly 
cancels the objective motion of the test stimulus. 
If the objective shrinkage of the test stimulus is 
thus canceled by the subject’s aftereffect rate he 
will experience subjective size constancy. In this 
case the subject is instructed to guess. It can be 
scen that this procedure produces a sort of track- 
ing behavior which bears some similarity to the 
classical method of limits. 

Subjects were required to verbalize their responses 
as well as to depress the buttons, and the experi- 
menter kept careful check to be sure that the 
subject’s verbal responses agreed with his button 
responses. The subject was allowed to make 75 
consecutive responses under each of three con- 
ditions. The binocular condition was one in which 
the subject viewed both the spiral and the test 
stimulus with both eyes. There were two inter- 
ocular transfer conditions during which a shutter 
was placed in front of the subject so that the 
spiral was viewed by one eye and the test stimulus 
by the other. The interocular condition in which 
the spiral was viewed by the left eye will be called 
the left-right transfer condition, the opposite case 
being the right-leit transfer condition. The order in 
which these three conditions were presented to 
each subject was varied systematically so that 
approximately equal numbers of subjects were 
run under each of the six possible orders. 

The particular values for the number of turns 
of the spiral, its speed and direction of rotation, 
the number of trials, and the timing of the ex- 
posure period and test period were chosen on the 
basis of psychophysical data which had been 
previously obtained (Scott, 1962). 


Subjects 

In all, 107 hospitalized male veterans were tested 
by the procedure outlined above. These subjects 
were selected on the basis of admitting symptoms 
which are likely to be found in patients with brain 
syndromes Or in patients with peripheral neuro- 
pathy. Of these 107 subjects, 51 were chosen for 
the brain damaged group. Before classifying a 
patient in this group, the diagnostic criteria were 
critically examined. When any reasonable question 
existed as to the presence of brain damage the 
patients were not included in this group. A control 
f 37 patients was selected. The diagnostic 


group 0 j 
data on the control subjects was also carefully 


examined to eliminate doubtful cases in which brain 
damage did not seem to be thoroughly ruled out 
on the basis of neurological and laboratory findings. 
Nineteen patients could not be satisfactorily classi- 
fied by the above criteria and consequently were 
not included in the study. Twelve of these were 
alcoholics in whom presence or absence of brain 
damage could not be definitely established. 
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RESULTS 


As mentioned above, a graphic record was 
kept of the successive positions of the step- 
ping relay. Each position of the stepping 
relay represented a particular rate of change 
of circle size. In the subsequent discussion 
an average of the 75 successive rates ob- 
tained from a subject under a given con- 
dition will be used as a measure of that 
subject’s aftereffect rate under that con- 
dition. Positive values indicate aftereffects of 
perceived expansion. Negative values in- 
dicate perceived contraction. 

Figure 1 shows the frequency distributions 
of aftereffect rate for controls (black bars) 
and brain damaged subjects (open bars). The 
results for the three conditions for each sub- 
ject were included in preparing these dis- 
tributions. Thus, the number of measures 
represented in the control group distribution 
is 3 X 37 = 111, and the number of measures 
represented in the brain damaged group dis- 
tribution is 3 x 51 = 153, Figure 1 shows 
that the aftereffect rates for the control sub- 
jects had a comparatively narrow range, while 
the range of aftereffect rates for brain dam- 
aged subjects was much greater and extended 
both above and below the control group 
distribution. The modal aftereffect rate for 


brain damaged subjects was somewhat greater 
than for controls. 
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The spiral and direction of rotation we 
would be expected to elicit aftereffect © 
perceived expansion (indicated by positive 
aftereffect rates in Figure 1). On a number 
of occasions the brain damaged subjects evi- 
dently perceived aftereffects of contraction 
(designated by negative rates in Figure 1). 
These results suggest that some of the brain 
damaged subjects occasionally experienced 
aftereffect in the same direction as the elicit- 
ing motion. Aftereffect of this kind has been 
reported by Szily (1905, 1907), Wohlge- 
muth (1911), and Fuchs (1928), although 
there has been some controversy on this point 
(Stern, 1959). 

Figure 2 compares the aftereffect rate for 
the two groups broken down also according 
to the three conditions: the left-right transfer 
condition (L), the binocular condition (B) 
and the right-left transfer condition (R): 
The white bars indicate a range of +! 
standard error. As was evident in Figure 1, 
the variance is heterogeneous, and a con- 
ventional analysis of variance would not be 
appropriate. It is apparent that the average 
aftereffect rate for the brain damaged p% 
tients was somewhat higher than for the con- 
trols, but the meaning of this difference i5 
not clear in light of the extremely big 
variability in the brain damaged group. 12 
the control group, slightly more aftereffect 
was produced by the transfer conditions 


60) 
A controls 
(N=3x37 
Brat 
> 40 228 a damaged 
2 N=3x5)) 
wW 
=I 
o 
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a 
m 
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perceived contraction 
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Fic. 1. Frequency distribution of aftereffect rate base 
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AFTEREFFECT RATE (% per sec.) 


L B R L B R 
Mean oo BRAIN DAMAGED 
SE 55 541 6.00 572 734 581 
E Re Bt gy Lo 77 124 


Fig, 

control = Aftereffect rate in brain damaged and 

interoout, ubjects shown separately for the left-right 
ar transfer condition (L), the binocular 


conditi 
dition (R) (B), and the right-left transfer con- 


tha 
fain the binocular condition; while in the 
Produced breve group more aftereffect was 
Makes by the binocular condition. Table 1 
I we a chi square test of this effect. 
lett, 3 le 1 a subject was classified in the 
tects S column if both of his transfer after- 
Subject, exceeded his binocular aftereffect. 
Vales), Whose binocular aftereffect had a 
val es termediate between the two transfer 
l bo Were classified in the second column. 
the in transfer aftereffects were less than 
Sign ered aftereffect, the subject was as- 
pera © the third column of the table. The 
ond a square of 16.20 is significant 
lence is e .001 level of confidence. The evi- 
therefore very strong that the brain 
subjects responded to the transfer 
e contro a manner different from that 
Wo rol subjects. It is remarkable that 
A bin of the 51 brain damaged subjects 
Pediat Ocular aftereffect which was inter- 
a mo the two transfer values. 
the isctepa, appears to increase markedly 
bin Banser 7 between the binocular and 
Cage ar af conditions. In some cases the 
& the o tereffect is greater, and in some 


8 

gecti of aster aftereffect is greater. In- 

ites ae diagnoses in these two sub- 
Ste a to reveal any systematic 


The data of Table 1 can also be analyzed 
for the separate groups. In the brain damaged 
group, for example, one would expect that 
17 subjects would be grouped in each column 
under the null hypothesis that the transfer 
conditions produced no differential effects. 
Stated in slightly different form, this null 
hypothesis is that the two transfer measures 
and the binocular measure were drawn 
randomly from the same population. Under 
this assumption, the probabilities of each 
of the six permutations of L>R> B are 
equal. Each column of the table represents 
a pair of these six permutations and thus 
represents a probability of 1%. The chi square 
associated with this hypothesis is 20.59 which 
is significant beyond the 001 level of con- 
fidence. A similar analysis can be performed 
for the control subjects. The chi square for 
the control group is 3.29 and has a prob- 
ability slightly less than .20, thus failing to 
provide strong statistical evidence for the 
differential effects of transfer in normals. 
The significance of the overall chi square is 
thus seen to be almost entirely attributable 
to the departure from chance expectancy of 
the brain damaged group. It is interesting to 
note that all previous investigators who have 
observed transfer effects have held that the 
magnitude of the transfer aftereffect is less 
than the binocular aftereffect. The present 
study provides the first quantitative informa- 
tion on this question and indicates that in 
normals transfer aftereffect is not smaller 
and is possibly larger than the binocular 


aftereffect. 
TABLE 1 


DISTRIBUTION OF BRAIN DAMAGED AND CONTROL PA- 
TIENTS ACCORDING TO RELATIVE MAGNITUDE OB- 
TAINED UNDER TWO TRANSFER CONDITIONS AND 
A BINOCULAR CONDITION 


B<T B~T B>T Total 


Group 
Brain damaged 22 2 27 51 
Control 17 12 8 37 
Both groups 39 14 35 88 
x? = 16.2035* 
Poren: =Binocular less than transfers; B~T =Binoc- 
Note-—B ST, inocula ranslers; BST =Binocular greater 


ular intermediate be 
than transfers. 
*p <.001. 


TABLE 2 


MEAN VARIABILITY OF AFTEREFFECT 
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L-R Transfer Binocular R-L Transfer 
condition condition condition 
Group M +1 SE M +1 SE M +1 SE 
5 58 
i 7 5 69 +2.41 17.59 +2. 
B damaged (N = 51) 23.62 +3.50 17.6 zi 
Control (= 37) 3.23 + .28 3.13 =+ .29 4.14 E 


In addition to the magnitude of the after- 
effect, the technique of this study makes 
possible the measurement of variability of 
the aftereffect within a given series of trials. 
The measure of variability used was the 
variance of the 75 rates within a condition. 
Table 2 shows the average variance obtained 
from both groups under three different con- 
ditions. Inspection of Table 2 reveals at once 
that the brain damaged subjects displayed 
considerably more variability of aftereffect 
rate than did the controls, Again, there is 
heterogeneity which prevents the use of a 
parametric statistic. Statistical tests of the 
significance of differences in Table 2 will be 
approached in the discussion of diagnostic 
utility which follows. 

On the basis of data from a smaller group 
of subjects, three criteria for diagnosing 
presence or absence of brain damage were 
set up. Criterion A classifies a subject as 
brain damaged if his aftereffect under any 
one or more of the three conditions exceeds 
a value of 10% per second. Criterion B 
classifies as brain damaged a subject who, 
at any time after the first trial, adjusts the 
stepping relay on the “wrong” side of the 


TABLE 3 


DISTRIBUTION OF BRAIN DAMAGED AND CONTROL 


zero position, i.e., in a way suggesting ara 
effect in the same direction as the elicit i 
motion. Criterion C classifies a subject k 
brain damaged if his variance under a 
condition is greater than 10. Tables Si 
5, and 6 contain contingency tables whi 
show the number of subjects correctly a 
incorrectly classified by each ont 
separately and by combining all Ri 
criteria. All chi squares are highly significa 
showing that departure from normal ar 
on any of the criteria is associated W! 
presence or absence of brain damage. 


TABLE 4 i 
~oNTRO” 
DISTRIBUTION OF BRAIN DAMAGED AND Ao A 
SUBJECTS ACCORDING TO CRITERION B: OCGURR if 
or AFTEREFFECT IN THE SAME DIRECTION 
Eticirinc MOTION 


Hit 
Group Absent Present Total rate 
Control 36 1 37 oF 
Brain damaged 21 30 51 5 
Both groups 57 31 88 5 
x? =23.17 
** p < 0005. 
TABLE 5 


DISTRIBUTION OF 


ob 
TR 
BRAIN DAMAGED AND Cor ee 
SUBJECTS ACCORDING To CRITERION A: MAGNITUDE SUBJECTS ACCORDING TO Criterion C: VARIAN’ 
Less More Less More 
than than than than 
10% 10% 10% 10% it 
per per Hit per per Bi 
Group second second Total rate Group second second Total as 
00 
7 36 1 37 97 Control 37 o 37 ae? 
Huai damaged 25 26 51 St Brain damaged 11 40 51 oe 
Both groups 61 27 88 -70 Both groups 48 40 88 8 
x? =21.279* x? =50.09%* 


** p <.0005. 


™* p < .0005. 
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TABLE 6 


Dis = 
aang OF Brain DAMAGED AND CONTROL 
Ts Accorpinc TO ALL CRITERIA COMBINED 


Gr Hit 
oup Negative Positive Total rate 

Control 
Boe 

Tain damaged 3 48 Ši 3 
3 a 

Oth groups 38 50 8s 94 

x? =65.19%* 
> < 0005, 
DISCUSSION 


top pults of the present study confirm 
and fxs findings based on persistence time 
Sistence based on the SAET. The per- 
7 ioe studies (Spivack & Levine, 
larger aai Truss & Allen, 1959) predict 
su ects ees among brain damaged 
the bras able 3 shows that roughly half of 
fect okt damaged subjects did have after- 
Y virtu S h was greater than that experienced 
e S ly all of the controls. 
Sect ig AET studies suggest that the after- 
Amageq normally slight or missing in brain 
Ptoximat Subjects. Table 4 shows that ap- 
Jets ately half of the brain damaged sub- 
Slight ad aftereffect which was either so 
Ocean Ot Of opposite direction, that they 
tte in ally, or frequently, adjusted the 
thus ara on the “wrong” side of zero, 
i Th Gee the SAET results. 
Sults q Päring MMG results with SAET 
aE thre should be remembered that there 
techni © Major differences between the two 


Non” With elicited by the SAET (Scott, 
ihe Y sh moe to subjects having ab- 
Ciggtion ght aftereffect, it may be that 
atte Decan ails in a number of cases pre- 
this fect the the MMG does elicit more 
the Possibili an does the SAET. However, 
afte, Duch ity is offset to some extent by 

"elec, Ager sample of the subject’s 
afforded by the MMG and the 


greater variability of the brain damaged 
subject. 

Thus far it has been assumed that the 
sometimes highly erratic performance of the 
brain damaged subject reflects irregular 
changes in his aftereffect rate. Another pos- 
sible interpretation is that the brain damaged 
subject becomes confused, makes “errors,” 
or perseverates. Such an interpretation could 
account for most of the findings reported 
above. There are clear reasons for doubting 
that this interpretation is correct. First, no 
subject was included in the study who failed 
to give conclusive evidence during the dis- 
crimination trials that he understood and 
could perform the task correctly. The pos- 
sibility remains, of course, that the subject 
became confused or began to perseverate 
later, during the actual test. The subject’s 
response was both verbal and motor. If the 
subject became confused, it might be ex- 
pected that his push-button response and 
his verbal response would disagree. This 
very seldom happened, and a correction was 
always made whenever it did occur. In addi- 
“there is the significant interaction of 


tion 
groups by conditions (Table 1). This inter- 
action might be interpreted to mean that the 


introduction of the shutter for the transfer 
conditions simply increased the confusion of 
the brain damaged subjects so that their 
transfer results were deviant from their 
binocular results. If this were so, the variance 
measures obtained on the transfer conditions 
for the brain damaged subjects ought to be 
significantly inflated above those obtained 
under the binocular condition. Inspection of 
Table 2 does not suggest that any significant 
inflation of variance occurred. 

The data seem to indicate, therefore, that 
aftereffect rate can be measured in brain 
damaged subjects and that the findings of the 
present study are based on perceptual impair- 
ment rather than general confusion or per- 
servation. The hit rates obtained with the 
MMG compare favorably with the highest 
hit rates reported for the SAET and are con- 
siderably higher than those obtained in the 
majority of SAET studies. 

Eysenck, Holland, and Trouton (1957) 
have recently put forward a theory based 
on the concept of reactive inhibition. Accord- 
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ing to their theory, the brain damaged pa- 
tient should have more reactive inhibition 
than the normal person. Reactive inhibition 
is supposed to conteract the aftereffect 
process. They predict, therefore, less after- 
effect and shorter persistence of aftereffect 
for brain damaged subjects. The present 
findings fail to support this prediction if 
it is to be applied to brain damage in general, 
since the average aftereffect rate for brain 
damaged subjects was actually greater than 
for normals. 

An important controversy in the area of 
motion aftereffects has always been the locus 
of the mediating structures. Wolgemuth 
(1911), for example, thought that the phe- 
nomenon was cortical, while Hunter (1914, 
1915) advanced a purely retinal theory. 
Actual measurement of speed of transfer 
effects and the discovery that they are at 
least as great as the binocular effects casts 
much doubt on theories of retinal mediation. 
Also, the greater degree of disruption of 
transfer effects in brain damaged subjects 
than in controls seems more consistent with 
theories of central mediation than with 
retinal theories. 

Methodologically speaking, 
study reiterates the superiority 
quantitative methods over more 
approaches. It illustrates the hazards in- 
herent in the popular practice of relating 
gross behavioral observations to clinical 
variables. Often, as in the present case, find- 
ings based upon gross observations may be 
misleading because the finer detail is screened 
from the investigator’s view. 

In conclusion, a word of caution seems 
appropriate. Studies like the present one 
should perhaps be thought of as exploratory 
studies aimed at improving understanding of 
brain function rather than simply attempting 
to diagnose brain damage. In fact, since the 
brain mediates behavior of all kinds, it would 
not be surprising to find that almost any 
behavioral measure discriminates between 
brain damaged and normal individuals. The 
importance of studies of this kind lies rather 
in the fact that different behavioral measures 
are affected to different degrees and in dif- 
ferent ways by damage to the central nervous 
system. Only through the systematic explora- 


the present 
of objective, 
observational 
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tion of such behavioral measures can sig- 
nificant contributions be made to the under- 
standing of brain function. Clinical a 
plication should be accompanied by cauti A 
and the realization that much further wor 
is needed in this area. 
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AN APPLICATION OF WITKIN’S HYPOTHESIS ? 


GERALD M. FISHBEIN 2 
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The problem of tying personality formulations described by Witkin to be- 
havioral correlates was investigated. The Rod and Frame Test and Embedded 


indicated that rate of 


experimental situation, 


the asthmatics does lend some credence to bot 


asthma Personality and 


Witkin, Lewis, Hertzman, Machover, 
Bretnall-Meissner, and Wapner (1954) pub- 
lished the results of a long-term study dealing 
with the relationship of Personality traits to 
perception of the vertical. Depending on the 
accuracy of perception, the subjects were 
classified along a continuum called perceptual 
dependence-independence, The extreme groups 
had projective personality examinations ad- 


ministered to them (Witkin et al., 1954, p. 
469). 


The present study was concerned with this 
problem, especially the passivity-activity 
dimension. 

It was assumed, for the Purpose of research, 
that perceptually dependent individuals “tend 
to be characterized by passivity in dealing 
with the environment.” If the assumption is 
tenable, then a psychosomatic symptom ex- 
hibited by perceptually dependent subjects, 
which owes its genesis at least in part to 
environmental stimuli, ought to be partly 
alleviated when the enviromental stimuli gen- 
erating the symptom are removed. On the 


1Based on a dissertation submitted in Partial 
fulfillment of the requirements for the Doctor of 
Philosophy Degree at the University of Denver, 
1958. 

2 Formerly, Jewish National Home for Asthmatic 
Children, Denver, Colorado. 
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other hand, independent or analytical pe 
ceptual performers who are more independen’ 
in relation to their environment, and who hav? 
the same and relatively equally severe psycho, 
somatic symptom, ought to be less ater 
when a similar change from the pathogen! 
environment is made. b 
The symptom chosen for study was proni 
chial asthma. This disease has been recog” 
nized from the time of Hippocrates as wa 
heavily influenced by emotional factors. a 
though in recent years the basic immunolog! 
mechanisms in asthma have been describe 
current medical opinion acknowledges th 
importance of Psychological factors. T” 
subjects utilized in this study were chrom 
Severe, asthmatic children in whom psyche 
logical factors were thought to be primar} 
responsible for the chronicity of their asthm® 
A further hypothesis considered was t” 
contention that mode of perceiving is a att 
character trait, and, therefore, probably ”} 
sistant to change. In the subjects studiot 
strong pressures toward emotional matt 
exist as a result of separation from the fami 7 
of group living, and Psychotherapy. The fi- 
fects of length of time in this combined me 
cal and Psychological residential treatme” 
Program were considered. tal 
The final question concerned developmen. 
changes described by Witkin, He has a 
cated that a progression from percept", 
dependency to independency, reflected 
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experimental tasks, is a part of normal 
evelopment, characterized by a predictable 
en There is reason to believe, however, 
i N children might represent a 
af cted range of the distribution of modes 
ee, ar The particular symbiotic rela- 
i between the asthmatic child and his 
(Fine often described in the literature 
Mille, Foci French & Alexander, 1941; 
fice Baruch, 1951) should predispose to 
fet ms dependency. The problem for this 
Wales ch was to determine whether the de- 
a ae changes described by Witkin for 
sample children obtained for the present 
mod » and whether the range of perceptual 
ode differs in the asthmatic as compared 


wi 
ith normal subjects. 


METHOD 


qT 
af aad and Frame and the Embedded Figures 
0 Were administered to 60 children, ages 9 to 15, 
sthmat; Patients at the Jewish National Home for 
They liad Children (JNHAC) Denver, Colorado. 
ih been removed from their home environ- 
e su an attempt to alleviate their chronic asthma. 
Selecteg 0*5 in residence from 9 to 18 months were 


In 
Selected, total group from which subjects were 
ew day 0% were free of asthma upon or within a 
Progresi after admission. In another 30% there was 
ANd fre ve lessening of asthma during the first year 
last venom from asthma during the second and 
asthma t of residence, An additional 10% lost their 
uring ae admission but the condition recurred 
Pationts < Pollen season. In still another 3% of the 
Mterfere asthma continued unchanged but did not 
eat mäst usual activities. 

aning 5% continued to have asthma un- 
to be Dri Y institutionalization. The factors thought 
ment Wi marily responsible for the patients’ improve- 
Boy, Yete the separation from their parents, the 
indepen INg experience which appears to foster 

cnce, and brief individual psychotherapy- 


e 
Afecteg, 
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The subjects were assigned to one of three groups 
depending on the severity of their conditions. 
Group I included those patients who had rapid 
change of symptoms following arrival at the Home. 
There was no asthma present except for infrequent 
attacks which were easily controllable. Group II 
included those patients whose improvement following 
arrival was slower than Group I. There was con- 
tinued asthma intensified by hay fever, pollen asthma, 
upper respiratory infection, or a combination of 
these. They occasionally required hospitalization. 
Group III comprised the most severe group; almost 
all of the children in this group were taking con- 
tinued steroid (Sterane: Pfizer & Co., New York) 
medication. Hospitalization was often required. 

Selection of the groups was made by utilizing as 
an index of severity cutoff points on a distribution 
of total required nebulizations, an aerosol therapy 
used to abort an asthmatic attack. The cutoff points 
adhered to the natural breaks in the distribution as 
closely as possible. In addition, opinions of the at- 
tending physician, based on severity, duration, and 
seasonal pattern of the attacks, and frequency of 
hospitalization were considered. The three groups 
were matched for age, sex, intelligence, and length 
of time, in months at the JNHAC. An analysis of 
variance computed for each of these variables failed 
to reveal significant difference. Table 1 presents these 
matching data. 

The Rod and Frame apparatus and the Embedded 
Figures Test used in the research were supplied by 
Witkin’s laboratories. With the exception of two 
changes, the procedure followed was that described 
by Witkin, The changes made were the elimination 
of the Rod and Frame body tilted trials, and the 
shortening of the Embedded Figures Test to 12 
figures in order to maintain the subject’s interest 
and motivation. Since each part of the Rod and 
Frame Test correlates highly with the total, the 
body erect position alone adequately reflects the 
subject’s mode of perception. A study by Jackson 
(1956) reported that the 12-figure form of the 
Embedded Figures Test correlated .99 with the full 
test in Jackson’s groups. It was, therefore, felt that 
shortening of the tests was justifiable. 

A pilot study was run to determine whether those 
children taking steroid therapy perceived the Rod 


TABLE 1 
= S GROUPS 
a Varranres Usep IN MATCHING SEVERITY 

= Length in 

months in 

a AC 

Sex Age IQ JNE. 

Grop y M F u SD M SD M SD 
k 101.94 16.51 13.11 431 
n o3 na 126) for 10172 19.12 13.69 3.66 
w E 1300 205 10755 2118 1500 525 


Echag 
ler Intelligence Scale for Children or Wechsler Bell 


evue Form I. 
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and Frame differently from those not on steroids. 
All subjects who had been previously selected and 
who had not taken steroids during their residence 
were pooled. From this group 20 subjects were 
selected using a table of random numbers. Ten sub- 
jects were then assigned at random to either an 
experimental or control group by another staff psy- 
chologist of the Home. Tabulation of the data 
revealed that the experimental and control groups 
did not difer in age, sex, intelligence, or length of 
institutionalization. im 

The experimental group received 10 milligrams of 
the steroid medication, orally administered by the 
charge nurse, in the same manner as other drugs in 
the Home. The dosage represented the mean amount 
received by members of Group III. The control 
group received a placebo. Retesting after a mean 
time of 8.1 and 8.2 days, respectively, for experi- 
mental and control groups was done only with the 
Rod and Frame. It was felt that practice effect after 
a short time on the Embedded Figures Test would 
make the results difficult to evaluate. 


RESULTS 


The results of the pilot study failed to show 
a significant effect of one administration of 
10 milligrams of steroid on Rod and Frame 
performance. The £ values between means for 
experimental and control groups before steroid 
or placebo, after steroid or placebo, and be- 
tween the means of experimental and control 
stoups were not statistically significant. 

Since it was also desired to determine 
whether continued steroid dosage has an ef- 
fect on Rod and Frame performance, a group 
of subjects from Group III was selected and 
matched on the relevant variables with the 
experimental group of the pilot study. It had 
already been observed that the administra- 
tion of 10 milligrams of steroid did not 
materially affect Rod and Frame performance. 
If Rod and Frame presteroid performance did 
not differ significantly from the performance 


TABLE 2 


EFFECT or 10 MILLIGRAMS STERANE ADMINISTRATION 
on ROD AND FRAME SCORES 


Pre-Sterane Post-Sterane 


Group N M* SD M SD 
Experimenta] 9 15.93 13.12 11.73 10.08 
Control 10 14.85 14.28 7.79 7.14 


a Means are mean error in degrees in setting the rod to the 
vertical. 
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of a matched group who regularly welg 
treated with steroid, then the probability 
would be very high that steroid medication 
continued for 1 year also has no significant 
effect on the Rod and Frame score. The $ 
of .58 for the comparison failed to reach 
significance (.60 < p < .70, df = 16). F 

Hypothesis I. The results essentially fail 4 
support the hypothesis that following remova 
from the home environment, the asthmatic 
symptoms of perceptually dependent chil- 
dren will be alleviated more rapidly than 
perceptually independent children. 

Each raw score in the Rod and Frame and 
Embedded Figures Test distribution was con- 
verted to a Z score following Witkin’s preg 
cedures. The two Z scores for each subject 
were averaged, and a rank in the total mode 
of perception distribution assigned (V = 58)- 
Since the distribution of these scores were 
skewed, the assumption of normality of the 
population was suspect. Therefore, a Kruskal- 
Wallis analysis of variance following the pt 
cedure outlined in Siegel (1956) was com 
puted. The Æ value of .029 failed to indicate 
significant differences (.80 < p < .90) in the 
obtained ranks on the mode of perception 
distribution for those subjects who were diag 
nosed as being different in their symptom 
atology. However, the mean scores for eac t 
severity group on the Embedded Figures Tes 
were in the predicted direction, On the RO 
and Frame, two of the three means were als? 
in the predicted direction, These data aE 
presented in tabular form in Table 3, For th® 
Rod and Frame, the mean represents the 
mean error in degrees in setting the rod d 
the vertical. The mean for the Embeddé 
Figures Test is the mean number of secon 
required by the subjects to locate the simpl@ 
figure in the complex one, 4 

Since the data was generally in the pe 
dicted direction and since the variability “we 
greater than expected using Witkin’s subje? 
as a frame of reference, an attempt to lowe 
the variability was made. This was done t 
analyzing means based only on the last fo" 
tials of the Rod and Frame, so that the frst 
four trials could logically be considered Pie 
tice trials for each subject. This also fa! 
to result in significance. Similarly, an attemP 
to dichotomize, instead of the initial atte™? 
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TABLE 3 


COMPARISON OF SEVERITY GROUPS ON THE ROD AND FRAME AND 
EMBEDDED FIGURES TESTS 


Change in Symptoms 


I: Rapid II: Intermediate TII: Little 
Group M SD M SD M SD 
Rod > waji 
Ebena Frame 12.88 10.70 11.39 10.45 15.14 10.00 
£ aes 143.27 30.68 136.80 85.90 128.64 82.80 
2 27 23 10 


—__ 


4 oe the severity groups also failed 
Stoups, n significant differences between the 
A pap bothosis II. This prediction stated that 
dentia] D exists between length of resi- 
pera treatment (milieu therapy) and modes 
ready ie The subjects, all of whom al- 
two be been tested, were assigned to one of 
dence ae depending on their length of resi- 
dence f e mean number of months in resi- 
Was 9 oe Group A, the more recent arrivals, 
Por ty with a standard deviation of 1.97. 
number second group, Group B, the mean 
with a of months in residence was 16.21, 
Stoups standard deviation of 1.93. The two 
ia Were matched for severity of illness, 
nd intelligence. They differed signifi- 
» however, in age. The children of 
Stoup than nere almost 1 year older as a 
aan those of Group A. 
taineq pe ois and standard deviations ob- 
Frame, % Group A were as follows: Rod and 
Figures M = 13.9; SD = 10.83; Embedded 
Means Test, M = 141.64; SD = 92.02. The 
Group 204 standard deviations obtained by 


PD Si were: Rod and Frame, M = 11.79; 
136.99. 30; Embedded Figures Test, M = 
these 4) SD = 75.23. In the evaluation of 


Pute mee an analysis of covariance was COM- 
factor X ich statistically controlled the age 
Mee. Owing the procedure described by 
: Probapi (1955). The resulting F yielded 
die, Ude ility greater than .05. It cannot be 
ee eae therefore, that there are significant 
ihe Stou S between the perceptual scores of 
0 the He oe a function of length of residence 
y = 

develo thesis IIT, This prediction was that the 

mental change described by Witkin 


obtained for the present sample. Also, that 
these asthmatic children would be signifi- 
cantly more field dependent than the normals. 

To answer these questions the subjects were 
divided into four age groups, similar in intel- 
ligence. Comparisons of means and standard 
deviations for each age group on the Rod and 
Frame and Embedded Figures Test were 
made. This analysis suggested a progressive 
decrease in the extent of field dependence 
with increasing age. The obtained data were 
then compared with data reported by Witkin 
et al. (1954, pp. 122, 124). The similarity in 
the shape of the two distributions for both the 
Rod and Frame and the Embedded Figures 
Test was striking. The distributions of the 
obtained data, compared with the normative 
data, only tended to differ significantly 
(04<p< 10) in the extent of field depend- 
ence. The asthmatic subjects were more field 


dependent. 
DISCUSSION 


While it appears that asthmatic children 
tend to be more perceptually dependent than 
nonasthmatic children, the fluctuation of their 
symptoms appears to be independent of their 
mode of perception. The hypotheses were 
based on Witkin’s description of modes of 
perceiving as a “coping mechanism.” That is, 
it represents an individual’s characteristic way 
of coping with the stress of his environment 
as well as the way in which he manages his 
“nner life.” In a somewhat similar manner, 
the case has been made for asthma as a 
coping mechanism. The psychogenic aspects 
of the disease have been described as the 
asthma-prone individual’s reaction to various 
types of environmental stress ranging from a 


58 GERALD M. FISHBEIN 


nonspecific variety to the stress resulting from 
fear of separation from a rejecting mother. 
If the coping mechanism reasoning is accurate, 
the hypothesized relationships should have 
been revealed. 

A possible explanation for the negative re- 
sults is suggested by Gibson (1952). He 
asserts that the individual differences found 
by Witkin are probably a function of the 
experimental situation, rather than demon- 
strating that individuals differ in the basic 
ways they perceive space. 

Gibson’s explanation could also be applied 
to the failure to achieve significant differences 
concerning Hypothesis II. Although it cannot 
be ruled out that no basic personality change 
has occurred during residence, there is very 
strong evidence that the children do increas- 
ingly manifest more traits of greater psycho- 
logical maturity with length of residence. 
They assume responsibilities far removed from 
any they had assumed in their home environ- 
ment. They often change from timid, whining 
individuals to independent, assertive children. 
Symptoms such as manifest anxiety, enuresis, 
and infantile speech frequently improve 
noticeably. However, the evidence that asth- 
matic children are more perceptually depend- 
ent than nonasthmatics is in agreement with 


the general personality structure drawn for 
the asthmatic. The trend toward greater 
perceptual dependence in the asthmatics 
would lend credence to Witkin’s hypotheses. 
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MANIFEST ANXIETY, SOCIAL DESIRABILITY, OR 
RESPONSE SET 


HENRY E. ADAMS anp ALBERT C. KIRBY 
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The present investigation tested Edwards’ hypothesis that the SD scale is 
not confounded with response set and determined the extent that response 


set contaminates Taylor’s MA scale. 


true and false scales for each personality 
procedures were then utilized to determine response s 
for each scale, The results indicate that the MA scale is relatively 
response set and that the SD scale is measuring 
desirability. It should not be assumed au 


items stated in a single direction is 


By reversing the items of these scales, 
scale were obtained. Statistical 
et and content scores 
free of 
both response set and social 
tomatically that a scale which has 
measuring response set or that a scale 


with items stated in both directions is not. 


ma to items of personality inven- 
Vidual’s q requently determined by the indi- 
light. Ed esire to place himself in a favorable 
Shey pe (1953, 1957) calls this tend- 
Tension Social desirability-undesirability di- 
Ocial a has constructed a scale (the 
easures tac bility scale) (SD scale) which 
Rive re gg tendency of the individual to 
Mventori y desirable response in personality 
th p a He has presented evidence that 
Other Scale correlated quite highly with 
negatip orality scales, either positively or 
sure y depending on whether the trait 
Sate is socially desirable or undesirable. 
Sonay swt Edwards has stated that all per- 
Soci Scales are contaminated with the 
as ability variable. 
1959 S been pointed out by Cronbach (1946, 
“Onto ñ at Personality inventories are also 
se ed with response sets. Two of these 
ond Sets are the tendency to acquiesce 
0 true”) or to dissent (respond 
possible re the basis of this hypothesis it is 
8 Espo at the SD scale is actually measur- 
lity M set instead of the social desir- 
e gp able since 30 of the 39 items 1m 
(957) peale are keyed “false.” Edwards 
Po Si S discussed this possibility and has 
T Sting a these two hypotheses. 
equ Ption’ Edwards seems to have made the 
ptal e all the personality scales were 
dikt bably aminated with response set. This 
Spo, enti not the case. Thus, he failed to 
between content scores and re- 


late 
Nse 
Scores of each scale. Webster 


(1958) and Helmstadler (1957) have pre- 
sented a statistical procedure for obtaining 
separate set and content components of a 
scale. The present investigation is an attempt 
to further test between the response set and 
social desirability hypotheses by using these 
statistical procedures and by reversing the 
statement of the items of the SD scale in 
order to obtain a true and false scale for social 
desirability. This procedure is described by 
Bass (1955) in his investigation of the F 
scale. Further, this procedure will also be con- 
ducted with the Manifest Anxiety scale (MA 
scale) (Taylor, 1953) in order to determine 
if the relationship between the SD scale and 
the MA scale is due to response set, social 
desirability, or an interaction of these two 
variables. 
METHOD 

Scales. The 39-item SD scale and the 50-item MA 
scale were utilized. The statement of the items in 
both scales were reversed so that items stated in 
the positive direction were stated in the negative 
direction and vice versa. A negative and a positive 
scale for Manifest Anxiety and Social Desirability 
were then constructed, making a total of four scales. 
These scales are not identical with the original MA 
or SD scale in that the original scales of both con- 
tained positive and negative items. The reversals 
were not particularly difficult to construct since the 
items in these scales are fairly concise and directly 
stated. This appeared to be particularly true of the 
MA scale. The criterion for adequate reversal of the 
meaning of an item was agreement of three judges. 

Subjects and procedures. The subjects were 22 
male and 29 female college students at Louisiana 
State University. They were given an SD scale 
(either positive or negative) and an MA scale (either 
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positive or negative) in the initial session and re- 
versed scales 2 weeks later. A counterbalanced de- 
sign was used to control effects of order of presen- 
tation of MA and SD scales and the effects of order 
of presentation of positive and negative scales. The 
male and female subjects were pooled since sex dif- 
ferences were not relevant to this study. Further, 
Edwards (1957) and Taylor (1953) both found no 
significant differences on these scales between sexes. 
Data analysis. Correlations between all scales were 
computed from the raw scores. Response set (S) and 
content scores (C) were also computed for SD and 
MA using essentially the formula described by 
Messick and Frederiksen (1958, p. 689) which is 
derived from the simplified ratio formula of Helm- 
stadler (1957). The equations are as follows: 


C = Frrr + Urine iJ 

S = Fonz — Ure = (2 — C) [2] 

Nr and Ne are numbers of items keyed true and 
false respectively. Fr is number of true items agreed 
with by the subject. Ur is number of false items 


disagreed with. The content score varies from 0 to 2 
while the set scores vary from +1 to —1. 


RESULTS AND DISCUSSION 


Table 1 shows the means and standard 
deviations for various scales. The response 
set mean for the MA scale is close to zero, 
indicating that response set in this scale ap- 
pears to be mainly random error. This does 
not seem to be true for the SD scale where 
the mean of the response set scores is much 
higher. It is also apparent that reversing the 
original scales tends to systematically increase 
the means and standard deviations of raw 
scores, 

As may be seen in Table 2, the correlation 
between the MA true and false scales is .62, 
which is significant at the .01 level; however, 
the correlation between the SD true and 


TABLE 1 


MEANS AND STANDARD DEVIATIONS For RAW CONTENT, 
AND RESPONSE SET Scores or SD AND MA SCALES 


Scales N M SD 
MA true 51 14.76 7.00 
MA false 51 19.22 8.14 
SD true 51 22.90 4.52 
SD false 51 30.08 5.57 
MA content 51 68 at 
MA set 51 —.08 A3 
SD content 51 1.33 29 
SD set 51 —.37 -22 


TABLE 2 


CorrELATIONS OF Raw Scores OF SD 
AND MA SCALES 


MA SD SD 
false true false 
MA true 62" —.63** —.73"* 
MA false = —,.52** —.s0** 
SD true — = 32* 
jg o5. 
*p <.01. 


false scale is only 32, which is significant at 
the .05 level. The correlation for the SD true 
and false scales is significantly smaller tha? 
the correlation obtained by Edwards and indi- 
cates that the SD scale is confounded w! 
response set. These results indicate that the 
SD scale may not be measuring a unidime? 
sional trait. 

The correlations between the various sD 
and MA scales are significant at the .01 eve 
and are in the direction predicted by the 
social desirability hypothesis. However, thet 
are 22 overlapping items in these scales whic 
spuriously inflates the correlations, Furthet 
these correlations are not contradictory to the 
response set hypothesis in that the IZA scales 
appear to be content scales, whereas the d 
scales appear to be measuring both set aP? — 
content. Thus the response set hypothes!§ 
could not predict the direction of the correl4” 
tion of SD set scores with the MA scales sinc? 
response set in the MA scales appears to 
largely random error. 

As may be seen in Table 3, the MA set 
scores are not significantly related to any ° 
the other variables. These results would te” 
to support the conclusion, that response set E 
the MA scale is largely random error. How 
ever, the SD set scores are significantly 1° 


* TABLE 3 


CORRELATIONS OF CONTENT AND SET SCORES 
OF THE SD AND MA SCALES 


p 
MA set SD content SD set 

MA content —.02 —.86** .66** 

MA set = .00 —.05 

SD content = = —.68** 


**p <01. 
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a 7 the SD content scores and the MA 
port See This data would tend to sup- 
SD scal conclusion that response set in the 
varianc es 1s not random error but systematic 
e. The high negative correlation be- 

tween the MA and SD c SEEN: 
ue to the and SD content is as expected, 

f the ‘hash a items. ; 

two scale ems which do not overlap in the 
Parent “ are examined it seems fairly ap- 
With res y the SD scale is contaminated 
e sa se set while the MA scale is not. 
and Biret in the IMA scale tend to be concise 
at tle the items in the SD are some- 
appears ambiguous. Ambiguity of items 
mining ‘he. be the major variable in deter- 
ality scale extent of response set in a person- 
4 scale ing As the ambiguity of the items of 
is ound the probability that a scale 
Other im ed with response set increases. An- 
ability aanl variable is the individual 
(Adam © discriminate or tolerate ambiguity 
i ‘glee Response set is likely to be 
Siven to ant factor in a scale when it is 
Ctiminate individuals whose ability to dis- 
2 con rch structure the items is minimal. 
SD sca ies usion, Edwards’ hypothesis that the 
ÎS not. 9 ROt counfounded with response set 
Pur ae ported by the present evidence. 
; Sessa response set hypothesis does not 
‘cales Sc y predict positive correlations for 
A e kaaa in the same direction and nega- 
rections ations for scales scored in different 
Scale: This prediction is true only when 
S are measuring response set. Prob- 


ably the best procedure for determining 
whether a given scale is confounded with re- 
sponse set is the statistical techniques de- 
scribed by Helmstadler (1957) and Webster 
(1958) utilizing original and reversed items. 
This technique is a direct measure of response 
set in personality scales. 
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A FURTHER APPRAISAL OF THE BODY 
BOUNDARY CONCEPT * 


SEYMOUR FISHER 


Upstate Medical Center, State University of New York 


A series of studies were reviewed which cross-validated and extended previous 
findings concerning the relationship of two measures of body image boundary 
definiteness (barrier and penctration scores) to various levels of behavior. 
Support was found for the view that the more definite an individual’s 
boundaries the more likely he is to manifest relatively higher physiological 
reactivity in body exterior as contrasted to body interior sectors. There was 
substantial evidence too that with increasing boundary definiteness there is 
greater ability to adjust adequately to disablement of one’s body, to maintain 
normal ego integration, and to be effectively communicative in small group 
settings. Finally, the rationale was discussed for interpreting the barrier and 
penetration scores within a body image framework. 


It is known that one assigns qualities (e.g., 
size, attractiveness) to one’s body in terms 
of personalized standards which bear little 
relationship to actual body characteristics 
(Cleveland, Fisher, Reitman, & Rothaus, 
1962; Fisher & Cleveland, 1958b; Lhermitte, 
1935; Secord, 1953; Witkin, Lewis, Hertz- 
man, Machover, Meissner, & Wapner, 1954). 
When an individual perceives his own body 
he seems to become uniquely ego involved 
Beloff & Beloff, 1957; Wolff, 1943) and he 
introduces systematic biases which may reveal 
a good deal about him. The term “body im- 
age” has been adopted to designate the atti- 
tudinal framework which defines the indi- 
vidual’s long-term concept of his body and 
also influences his perception of it. Investi- 
gators have explored many body image di- 
mensions with the intent of predicting from 
them to other behavioral variables. 

Fisher and Cleveland (1958b) proposed that 
a fundamental aspect of the body image has 
to do with the manner in which an individual 
perceives his body boundaries. It was sug- 
gested that there is variation in how definite 
or firm one perceives one’s body boundaries 
to be. Thus, the individual may view his body 
as clearly and sharply bounded, with a high 
degree of differentiation from non-self objects. 
But contrastingly, he may regard his body as 
lacking demarcation from what is “out there.” 
The view was taken that the process of 


1 This review was partially supported by United 
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learning to separate one’s body from n of 


virons is fundamental in the establishme” 
identity and that therefore the characte! p 
the body image boundary should provide w 
portant information about adjustment stra” 
gies. In translating the concept of body me 
boundaries into operational terms, Fisher of 
Cleveland developed a method which involi 
scoring the properties of the boundary rege j. 
of percepts elicited by ink blot stimt 
Boundary definiteness was found to be equi” ê; 
lent to the degree to which definite struct!” d 
substance, and surface qualities were assig” Z 
to the periphery of ink blot images. Respon? : 
such as the following were considered to €P"_. 
sent an expression of definite boundary 
cave with rocky walls, man in armor, any 
with striped skin, turtle with shell, me 
wrapped up, woman in fancy costume. — ta 

percepts in which the boundary is positiv se 
highlighted in some way are labeled “Þar” 


go 
responses.” A second boundary index 


t 


3 J0 
also formulated which concerns ink Pa 
percepts emphasizing the weakness, 1a¢ pi 


substance, and penetrability of persons | js 
objects. The term “penetration response wi 
applied to them, and some examples follo 
mashed bug, person bleeding, broken 
torn coat, body seen through a fluoros® 
The barrier and penetration scores are 
significantly correlated with the total 1U™ 
of responses given by the subject to 4 Ti 
ink blot stimuli (e.g., Rorschach or Holt ost 
blots). Therefore, it is necessary to Tê 


| 
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oa subjects produce a uniform number 
abiy fe for each blot. The interscorer 
on Da for evaluating barrier and penetra- 
bose n TA varies from .82 to .97, with 
= ues clustering in the high .80s and 
mii Th (Fisher & Cleveland, 1958b; Holtz- 
ner ares Swartz, & Herron, 1961, Maus- 
Ta ;* Ramer, 1961). 
sonality penk entitled Body Image and Per- 
at the es her and Cleveland demonstrated 
Predict a tidy image boundary scores could 
Prisingly %4 riety of behaviors. Rather sur- 
cantly Tak ese scores proved to be signifi- 
g Seen with such a range of phenomena 
in body an of body sensations: differences 
reactivity. erior versus interior physiological 
Small grou psychopathology; and conduct in 
mage Pe situations. Since 1958 when Body 
any studi Personality was first published 
origina] à D3 have been undertaken by the 
validate Ae and also others to cross- 
Dtaineq "Th extend the findings that were 
arize and le present paper seeks to sum- 
circums integrate these studies. A series 
Se P cribed topical areas will be pre- 
, Past findings pertinent to each will be 


Wr n followed by accounts of more 


Psy 
YCHOPHYSIOLOGICAL PATTERNS 


arti 
among ae physiological patterns were 
nitenesg first correlates of boundary defi- 
Noted th © be observed, It had been initially 
Neuro fs ar Patients with rheumatoid arthritis, 
PVolving aatis and conversion symptoms 
È ig oo musculature were characterized 
han >t barrier and lower penetration scores 


., Pat : E 
Colitis Ents with stomach ulcers or spastic 


act “ih Such findings the notion evolved 
tens Dsyck with definite boundaries who 
ae d i Aosomatic symptoms under stress 
deg, i ti in the exterior body layers (viz. 
in th jee whereas persons with in- 
Ta ite mies manifest such symptoms 
ten the ony regions (viz., stomach, 
in th Mod i internal organs). This exterior- 

noel was later extended to persons 
3. Ma mal range by findings which indi- 
Nad meet unpublished progress report entitled 
2835 Mal Tn Studies of Social Interaction,” 1961, 


si 
itute of Mental Health, Grant No. M- 
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cated that normal subjects with definite 
boundaries manifest relatively high reactivity 
in the muscles and skin (e.g., in terms of GSR 
and EMG) and low reactivity at interior sites 
(exemplified by heart rate); but with just 
the obverse pattern appearing for those with 
indefinite boundaries. 

Since the first presentation of these formu- 
lations a variety of studies has been under- 
taken to evaluate them further. Cleveland and 
Fisher (1960) and Fisher and Cleveland 
(1960) reported a replication of the original 
differences they observed in barrier and pene- 
tration scores between patients with rheuma- 
toid arthritis and patients with stomach 
ulcers. Arthritics (V = 26) exceeded patients 
with stomach ulcers (V = 34) in barrier re- 
sponses (.001 level) and in turn were ex- 
ceeded by them in penetration responses 
(.10-.05 level). It was also found that the 
arthritics were characterized by a significantly 
lower heart rate (interior reactivity) and 
higher number of GSR responses (exterior 
reactivity) than the ulcer patients under 
stress conditions (both at 001 level). 

Williams (1962) evaluated arthritic 
(N = 20) and ulcer patients (V = 20) and 
reaffirmed that the former had higher barrier 
(.10-.05 level) and lower penetration (.001 
level) scores than the latter. In addition, he 
found significant trends for heart rate to be 
higher, and muscle potential lower, in ulcer 
as compared to arthritic patients under certain 
conditions. However, predicted differences in 
GSR were not observed. 

Fitzgerald (1961) investigated children 
with Legg-Calvé-Perthes (LCP) disease. This 
& disease,” whose cause is unknown, results in 
damage to the hip joint which is perhaps 
analogous to some phenomena encountered in 
rheumatoid arthritis. Fitzgerald examined 
LCP from the perspective of the body image 
boundary theory. He compared 20 children 
with LCP to 15 controls. As hypothesized, 
the LCP children proved to have significantly 
ndary definiteness than the con- 
the LCP subjects as compared to 
ects were significantly more 
e and displayed relatively 
than verbal tests 


higher bou 
trols. Also, 
the control subj 
motorically expressiv 
greater skill in performance 
(WISC). 


Eigenbrode and Shipman (1960) are the 
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only investigators to date who have failed 
completely to replicate the boundary distinc- 
tion between psychosomatic patients with 
interior as opposed to exterior symptoms. 
They extracted from their clinical files the 
Rorschach protocols of 54 patients with 
“psychosomatic skin disorders” (exterior) 
and 29 patients with internal disorders (e.g., 
stomach ulcer, genito-urinary disease). Their 
scorings for the barrier and penetration 
variables revealed only chance differences 
between the two groups. 
Malev (1961) extended the exploration of 
the exterior-interior hypothesis to normal 
children. His design involved 30 male 6-year- 
old and 30 male 8-year-old subjects. The 
subjects’ mothers were interviewed to as- 
certain the frequency with which each sub- 
ject had been characterized by exterior versus 
interior symptoms. In addition, GSR, heart 
rate, and blood pressure were recorded from 
the subjects under conditions of rest and 
stress. The data indicated that at both ages 
6 and 8 the greater the boundary definite- 
ness of the subjects the more likely they 
were to manifest a significant predominance 
of exterior over interior symptoms. Analysis 
of the physiological data demonstrated that 
in the 6-year-old group heart rate was signifi- 
cantly negatively related to boundary definite- 
ness, but GSR bore only a chance relationship 
to it. In the 8-year-old group, all the physio- 
logical measures proved to be positively cor- 
related with boundary definiteness. This pat- 
tern differs from that characterizing adults; 
and it led Malev to question whether the 
physiological correlates of boundary definite- 
ness may not systematically differ at certain 
age levels from those found in adults. How- 
ever, he did point out further that the symp- 
tom patterns reported by mothers in both 
ó- and 8-year-olds are correlated with the 
barrier and penetration scores in the same di- 
rections as they have been found to be in 
adults. He speculated on this basis that 
symptom reports by mothers about their chil- 
dren may be better indicators of long-term 
autonomic patterning than brief samples of 
physiological reactivity obtained in artificial 
laboratory situations. 
Brown (1959) applied the barrier and 
penetration indices to the discrimination of 


20 college students with contact dermatitis 
from 20 controls. He found a borderline 
tendency (.20-.10 level) for the derma- 
titis cases to have higher barrier and lower 
penetration scores than the controls. He 
noted that the experimental and control 
groups were “impure” insofar as the former 
probably contained people “exposed to power- 
ful allergies” and the latter “young adults | 
who may yet develop contact dermatitis. 
Cleveland and Johnson (in press) com- 
pared the boundary attributes of 25 young 
men with coronary disease with a matche 
group of 25 men awaiting surgery. They 
discerned no differences in barrier scores 
between the groups. However, penetration 
responses were significantly higher in the 
coronary than in the presurgical patients. , 
Davis (1960) investigated reactivity ” 
25 men with unusually high barrier score’ 
and 25 men with unusually low scores. 
obtained electromyograph, skin resistanc® 
blood pressure, and ballistocardiograp 
measures under rest and stress conditions 
There were four differences during rest wh 
were in the predicted direction but only E 
(EMG, stroke volume) were statistical 
significant. An analysis of changes in H 
sponsivity from rest to stress indicated he 
as predicted, EMG was significantly great? 
in the high than the low barrier group an 
conversely that heart rate, stroke volur 
and total cardiac output were of lath 
magnitude among low than high barrier sv) 
jects. A borderline tendency (.06 levied 
appeared for total peripheral resistance b 
be greater in high than low barrier si 
jects. Although no predictions had been m4 d 
about blood pressure, it was found that du 
ing both rest and stress mean systolic blosa 
pressure was more elevated in the low, i 
high barrier groups. David considered his ” t 
sults to be supportive of the hypothesis | er 
high and low barrier persons react difer 
entially at exterior and interior body a 
Fisher (1959b) sought to replicate Dele 
findings in a population which was quite jec” 
ferent in sex, age, and barrier score $€% 
tivity. Thirty girls with a median age ° js 
were studied whose barrier scores were 
tributed over the entire range rather 
representing extremes. GSR was take? 
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2 poa response index and heart rate as 
iri S index. Recordings were secured 
me p rest and stress. The barrier score 
related ka be significantly positively cor- 
frelaté à 2 GSR frequency and negatively 
Aer > with heart rate during stress. 
Positive ese conditions only a borderline 
ine oe (.10-.05 level) between 
appeared, score and GSR_ frequency 
a Eprprits at this point to refer to 
1950) ormulation (in Rubinstein & Parloff, 


that i 
ski á y% i 
n conductance increase is excitatory, 


Whereas ; 
Tansaction ee of cardiac rate is inhibitory of . . - 
Ne patter the organism with the environment. 
ski resista, of response obtained when recording 
ions en tk and heart rate may reveal | oc- 
nt and Sealy individual is “open” to his environ- 
individu y to react to it, or conversely, when 
Mentally « tal is not “open” and indeed, instru- 
rejects’ the- environment (p. 205). 


Lace 
Exper ae eeSted on the basis of a review of 
(1957) nts by Darrow (1929), Davis 
lotr, fos Lacey (in Rubinstein & Par- 
and ree ) that an attitude of being open 
t incr €ptive to the world is accompanied 
tion in €ased skin conductance and decelera- 
‘YDical eart rate, with the converse pattern 
This a a closed unreceptive orientation. 
the inside ation is obviously analogous to 
tasting i “Outside model which proposes con- 
fate as aie of skin conductance and heart 
me ana olary definiteness varies. Indeed, 
Be toner y becomes even more precise if 
qrison Te that the definite boundary 
EN Cond aracterized by tendencies to high 
cn ou uctance and low heart rate) has 
sive p be be more “open” and less 
ee the fa dealing with the environment 
Cleve definite boundary person (Fisher 
S The i 1958b). 
ction 2OY Of the material presented in this 


c 
Op Pt a Dad supportive of the boundary 
dy, Yo end versus interior reactivity. 
Heat shalt & Shipman, 1960) of 
bestt ate udies contradicted this model. 
Eten m nd muscle potential have fared 
hay, Nor teaei es, respectively, of interior and 
teag Pe ninety: GSR and skin resistance 
Wit consistent as indices of exterior 


and need to be evaluated further. 


Also, in two exploratory studies blood pres- 
sure has proved to be significantly but vari- 
ably linked with boundary definiteness (viz., 
positively in children and negatively in 
adults). 

RESPONSE TO STRESS 


The possession of definite boundaries was 
shown in Body Image and Personality 
(Fisher & Cleveland, 1958b) to permit the 
individual to deal relatively efficiently with 
stress. There was particular evidence that 
reaction to the stress of body disablement 
was likely to be less severe in the definite 
than indefinite bounded individual. For 
example, adequacy of adjustment to polio 
disablement and also amputation were posi- 
tively correlated with the barrier score. 
Landau (1960) has provided support for 
these findings. She studied 40 paraplegic men 
with spinal cord injuries. Their adjustment 
to their disablement was evaluated by ob- 
servational ratings and by means of a 
sentence completion test. It was found that 
the higher a patient’s barrier score the better 
was his adjustment as defined by behavioral 
ratings (.01 level) and sentence completion 
responses (.01 level). The penetration score 
had only a chance relationship to the criteria. 
Landau noted that the barrier score was not 
related to the duration of time the patient 
had been disabled. This is congruent with 
earlier studies (Fisher & Cleveland, 1958b; 
Ware, Fisher, & Cleveland, 1957) in which 
the barrier score seemed not to be influenced 
either by the amount or duration of damage 
sustained by the body. Indeed, more recently 
Fisher (1959a) found that the decline in 
physique accompanying advanced aging 
does: not result in a decrease in barrier 


responses. 

McConnell and Daston (1961) considered 
the responses of 28 women to the stress 
heir own pregnancies. Subjects 
and _postdelivery. Initially, 
each was given the Rorschach; the Osgood 
Semantic Differential to be applied to her 
own body; and a structured interview. Post- 
delivery, only the Rorschach and the Osgood 
Semantic Differential were repeated. The 
favorableness with which the subjects viewed 
their pregnancies turned out to be positively 


occasioned by t 
were seen pre- 
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manipulated by the experimenter so as to 
contradict the subject’s reports. Each sub- 
ject’s responses to the contradictions were 
evaluated. It was found that the higher the 
barrier score the greater the likelihood that 
the subject would consider his own judg- 
ments, rather than his partner’s, to be cor- 
rect. Also, the barrier score was positively 
linked with the degree that the subject felt 
satisfied with his own performance and 
negatively related to his level of anxiety. 
Incidentally, the barrier score was signifi- 
cantly and negatively related to the L (Lie) 
score on the MMPI. This suggested that 
the individual with definite boundaries has 
less need to present an exaggerated favor- 
able picture of himself than does one who is 
vaguely bounded. There was no relation be- 
tween the barrier score and degree of yield- 
ing behavior. Mausner ? concluded: “To sum- 
marize, this preliminary analysis tends to con- 
firm the generalized personality description 
derived from the original Fisher and Cleve- 
land work” (p. 124). 
Ramer (1961) hypothesized that the high 
barrier person would exceed the low barrier 
person: 


> 1. In initiating interpersonal communica- 
tions. 

2. In communicating committal, directive, 
and disagreeing statements rather than self- 
depreciating ones, 

He studied 96 female subjects (in groups of 
four) ina setting in which each was isolated 
from the others. Instructions were given to 
write a story about several Pictures and then 
to communicate with a fictitious partner 
about the story by writing messages which 
would presumably be delivered by the ex- 
perimenter. One-third of the subjects were 
given no responses to the messages they 
sent; another third were given unfriendly 
replies from the fictitious partner; and the 
remaining third were given friendly replies. 
In this setting high barrier subjects sent 
more messages (.01 level) and more units of 
communication (.05 level) than the low bar- 
rier subjects. A significant difference in the 
same direction occurred with respect to num- 
ber of committal messages as opposed to those 
asking for orientation. There were nonsig- 
nificant but consistent tendencies for the 
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barrier score to be negatively related to such 
variables as number of self-depreciating E 
passive accepting messages but positively 
with the number of direction giving messages. 
Borderline evidence was obtained that the 
“barrier style of behavior” was most eviden 
under threat conditions. Ramer considere 
most of his hypotheses to have been con- 
firmed or at least supported by the results. 
Differences in degree of communicativi 
expression in group situations between hig 
and low barrier subjects has been observ 
by Hornstra and McPartland’ to hold pE 
also in a schizophrenic population. They 1° 


RAR naton ai 
port that there are significant distinctions ™" 


the ward behavior of schizophrenic patients 
with relatively high and low barrier score 
Patients with relatively high scores are rata 
by ward personnel as displaying behiai 
which is restless, acting out, and initiatory 
interaction. This is in contrast to patien 
with lower barrier scores who are depicted *; 
showing “retarded and withdrawn behavior: 
The studies just summarized affirm 
boundary characteristics play a meaning 
part in an individual’s group behavior. 
more clearly articulated an individu A 
boundaries the greater the probability t 
he will seek to communicate with other gro 


fu! 
h 
als 


members and also that his communicati? 
will be direct and active rather than pasa 
or self-depreciatory. With regard to the ™ 
ter of suggestibility, it should be noted w 
the Cleveland and Morton results were SW 
Portive, and those of Mausner not SuPer 
tive, of previous findings. Whether this di 
ence is a function of the unlike methods " 
to measure suggestibility remains to be 5° 
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rê 
The barrier and penetration scores vic 
earlier found to discriminate schizopht? ot 
from nonschizophrenic subjects. They di tice 
distinguish between normals and neuro ie 
but roughly separated normals and neu 2i 
(high barrier, low penetration) from Sjon) 
phrenics (low barrier, high penetra” ine 
(Fisher & Cleveland, 1958b). Of cours® 


opto 
TR. K. Hornstra and T. S. McPartland, B P 


lished progress report entitled “The Relation tio" 
havioral Constellations to Drug Use,” 1961; 3308: 
Institute of Mental Health, Grant No. MY 
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ne of vague body boundaries with 
Marked oe had for some time been re- 
Schilder, ee clinical observers (e-g., 
anes (Holtzman et al., 1961) applied 
groups oa series to normal and pathological 
Was higher observed that the barrier score 
in normal T the penetration score lower 
OWever S than in chronic schizophrenics. 
Was the. wae in the case of penetration 
n additi ifference significant (.01 level). 
omtelations he factor analyzed the inter- 
Penete of 23 indices (including barrier 
est in 1 oe derived from his ink blot 
iscovereq ies samples of subjects. He 
aded high hat the barrier score consistently 
With Ra on a factor which he associates 
Wareness of organized, ideational activity, 
è ie eatoni concepts” (p. 171). 
Mts raara fen loaded high on several 
ticularly lated to disturbance. It was par- 
Maturit identifed with indicators of im- 
thopatk o bodily preoccupation, and psy- 
the barri ogy. These factor analytic results tie 
Netration Score to ego integration and the 
levela Score to maladjustment. 
Woulg nd (1960) inquired whether there 
Schizophr, changes in the boundary scores as 
Mhase af as recovered from the acute 
els chizo i disorganization. Twenty-five 
ducting re Tenics were evaluated upon first 
ensiona] hospital with the Lorr Multi- 
1s. The Scale for Rating Psychiatric Pa- 
— evaluated again after 5 and 
AS ent involving tranquilizers. 
a rion of the patient’s response to 
f E whether he attained sufficient 
B ere ne the hospital. Holtzman Ink 
nif Weeks afk ministered predrug, and 5 and 
me idt r er treatment had begun. A sig- 
tht i © of .60 was found between decre- 
he + ™ penetrati ound betwe £ 
fto Lorr mo ee i scores and decrement 1M 
The t e oS idity rating during the period 
ho na of treatment to the fifth week. 
ine the same relationship from 
here coe to the thirteenth week was 
®vel) fa also a significant trend (.05- 
t patients judged capable of 
hospital to have declined in 
Soaps contrasted with nondis- 
Š s who tended to increase their 
Cores, The barrier score failed 
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to be related to any of the criteria of pa- 
tient change. 

A second phase of this study concerned 
45 schizophrenics who had been administered 
the Rorschach on admission and again upon 
leaving the hospital. Each patient was psy- 
chiatrically rated upon admission and again 
at time of discharge. It was established that 
patients rated as, improved or markedly im- 
proved showed a significant decline in pene- 
tration (.01 level). For barrier the only sig- 
nificant change was an increase from first 
to second testing in the markedly improved 
group. Cleveland (1960) considered that his 
results demonstrated in the recovering schizo- 
phrenic patient a “dramatic firming up and 
defining of the body image boundary” (pp. 


259-260). 
Pankow (in Burton, 1961) observed in 
: with schizo- 


her psychotherapeutic work 
phrenics that their art productions are char- 
acterized by disrupted boundaries. She spe- 
cifically compared this phenomenon to the 
boundary indefiniteness depicted by low bar- 
rier and high penetration responses char- 
acteristic of schizophrenic populations. 
Reitman (1962) studied the body image 
changes in neurotics and schizophrenics fol- 
lowing sensory isolation. Twenty neurotics 
and 20 schizophrenics were exposed to sen- 
sory isolation conditions. Holtzman Ink Blot 
Tests, measures of tactile sensitivity, and 
estimates of body size were obtained before 
and after isolation. A control group received 
the pre- and postbattery of tests, but with 
isolation not intervening. No changes Oc- 
curred in the scores of this group from pre- 
to postevaluation. However, there were sig- 
nificant changes in the experimental groups. 
The neurotics manifested decreased barrier 
and increased penetration scores following 
isolation.® The schizophrenics, contrastingly, 
obtained higher barrier and lower penetra- 
tion scores. Tt was speculated that sensory 
isolation by minimizing stimulating input has 
a disruptive effect upon nonpsychotics which 
8 Using an entirely different methodology, Cam- 
pareri (1958) found that nonpsychotic subjects ex- 
posed to sensory isolation ina swimming tank made 
references tO loss of body boundaries (e.g, “I had 


a difficult time after awhile in the tank distinguish- 
ing where I left off and the surrounding water 


began.”). 
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decreases boundary definiteness. But in the 
case of schizophrenics the sensory isolation 
seemed not to be disruptive. It seemed to 
provide a nonthreatening pattern of stimuli 
which fostered reorganization and more 
realistic body boundaries. This latter finding 
was in keeping with earlier reports (Azima & 
Cramer, 1956; Gibby, Adams, & Carrera, 
1960) concerning the therapeutic effects of 
isolation on schizophrenic symptomatology. 
It is important to note that the results from 
the tactile threshold and body size estimate 
tasks were similarly reversed for the two 
experimental groups and in a direction 
congruent with the concept that isolation 
produces boundary alteration. Thus, the 
schizophrenics showed increased tactile sensi- 
tivity and a decreased concept of body size 
following isolation; but the nonpsychotics 
showed no change in tactile sensitivity and 
an increased concept of body size. There are 
previous studies by Wapner (1958) which 
suggest that the pattern of changes with re- 
gard to body size estimates in the schizo- 
phrenic group are related to increased aware- 
ness of the body periphery and those in the 
nonpsychotic group with lessened awareness 
of the periphery. It should be added that in 
estimating the sizes of non-self objects (e.g., 
baseball) the two groups did not shift their 
judgments from test to retest. Only judg- 
ments with regard to one’s own body were 
sensitive to the sensory isolation effects. 

In the area of psychopathology the 
boundary scores seem to distinguish grossly 
between schizophrenics and nonschizophren- 
ics. Furthermore, there is evidence that as 
an individual reorganizes following schizo- 
phrenic breakdown his penetration score de- 
clines. The barrier score is less successful in 


9 The potential of the boundary scores for meas- 
uring other kinds of maladjustment aside from that 
in the category of psychosis is pointed up by Miner 
and De Vos (1960) finding that urban Arabs who 
are conflicted about their identity produce signifi- 
cantly higher penetration scores (.05 level) than oasis 
Arabs who have not yet been exposed to such ex- 
treme identity conflict. This, by the way, is con- 
firmatory of previous data (Fisher & Cleveland, 
1958b) which demonstrated that Japanese-American 
men who were struggling to adapt to United States 
life had less definite boundaries than native Japanese 
not beset by such identity problems. 


this respect. Cleveland reported that it in- 
creased as recovery proceeded in one category 
of patients, but that it otherwise failed to 
correlate with the reorganization process. 
Within the context of the changes produced 
by sensory isolation one finds that the bar- 
rier and penetration scores vary in opposite 
directions in schizophrenics and nonschizo- 
phrenics. Schizophrenics react to isolation 
with an increase in barrier and a decrease D 
penetration scores, with the converse holding 
true for the neurotics. In the first instance 
isolation seems to decrease definiteness a 
body image boundaries and in the second ! 
apparently helps to re-establish them. 


DEVELOPMENTAL ASPECTS 


There have been scattered indications that 
aspects of the developmental process (ast 
& Cleveland, 1958b; Malev, 1961) might P 
related to boundary attributes. i, 

Fish (1960) scanned the boundary © 
relates of several developmental paramete™ 
in children. She applied a multiple-choi 
version ?° of the usual Rorschach techniq" 
to measure barrier and penetration responsi 
in boys at ages 7 (N = 21), 9 (V=2 a 
and 11(N = 25). She also secured measure 
tapping such diverse variables as concept , A 
time, resistance to perceiving aniseikonic } 3 
duced distortions in one’s mirror image, ou 
cept of one’s height, and ability to define t 
adult role. As in a previous study (Fisher i 
Cleveland, 1958b), there were no indicati? 
of progressive change in boundary ant 
with age. It was ascertained, though, eA 
the barrier score was positively (.05 lever 
correlated with ability to represent 4 150: 
qualities in figure drawings at age 7. A jp 
the barrier score was positively correlated p 

- (.01 level) and 9-year-olds (.09 ev Fe 
with a more “mature” mode of time per ie 
tive that involves the perception of fu 

adult events as distant from, rather t as 
close to, the present. The barrier score “ ht 
not related to judgments of one’s own be 
or to degree of distortion in self-imas® 


ya" 
10 There was no attempt to determine the compage” 
bility of the multiple-choice method to the spon riet 
ous response procedure usually used to obtain 
and penetration scores, 
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Salad aniseikonic lenses. In general, it 
ited mere that despite the use of an 
iia od of barrier measurement, the 
aoe results which were obtained in 
caton to the barrier score linked it posi- 


tiv : agit et 
ely with certain indices of developmental 
Maturity, 


D á 
ISCUSSION OF EMPIRICAL FINDINGS 


oe ae published since 1958 with 
Versatilit, he boundary scores point up their 
u a Re has been moderately good 
Vatiations on of the fact that boundary 

S are accompanied by certain pat- 


erns Pye i 
terms a physiological reactivity. Both in 
Ures of Psychosomatic symptoms and meas- 


emy pettonomic response the bulk of the 
t Ported data indicate that the higher 
me of boundary definiteness the 
th atid y tendency to channel excitation to 
£0 do so muscle and the less the tendency 
and heart at interior sites like the stomach 
adult These findings have appeared in 
Wilizine ae and children and also in studies 
tion, i different designs and instrumenta- 
temajn re true that certain inconsistencies 
{logs GSP be explained. For example, why 
somi _ sometimes correlate with barrier 
ĉuton ee imes not? Also, why do given 
fssure © Measures (e.g, GSR and blood 
Correlati differ in the direction of their 
3 opp On with the barrier score in adults 
lar e Osed to children? Of course, the even 
i he remains of clarifying the mechan- 
Actiyi Neteby body attitudes and body re- 
other, ”) Patterns become linked with each 
Pottance Owever, what is of particular im- 
“SPonsiys is that patterns of autonomic 
a eae have been demonstrated which 
ork rg within a body image frame- 
sy’ Dtion 1 apparent ties with the 
Path al autonomic categories (€-8+ 
etic ; 4 
enst: C ANd parasympathetic). Lacy (in 
tef, Stein 
3 Parloff, 1959) has already 


eq to m Š 
Con, ies the inadequacy of the conventional 
lees 


Steate 


T Xity of A 
e bou autonomic response patterns. 


effectively with difficult, disturbing experi- 
ences (especially those involving body dis- 
ablement). Even the findings about small 
group behavior have similar connotations 
with regard to adjustment effectiveness; for 
they portray the high barrier person as 
coping with group interrelationships in a 
more independent, realistic, and yet group 
integrative fashion than does one who is low 
barrier. In addition, of course, the results 
from the group studies point up that the 
way in which an individual delineates his 
body limits in relation to others plays a 
noteworthy role in his group conduct. It is 
still an unsettled matter as to which aspects 
of group interaction are most directly related 
to the boundary parameter. However, the 
trends cited suggest that high and low bar- 
rier subjects are best distinguished in their 
group behavior with respect to how much 
each tries to communicate with others for 
mutual facilitation and stimulation rather 
than for self-centered purposes. 

An unexpected fact that has emerged from 
the newer studies is that the boundary scores 
are indicators of certain kinds of change in 
the individual. Cleveland (1960) detected 
shifts in the penetration score as schizo- 
phrenics recovered from disorganization. Mc- 
Connell and Daston (1961) recorded mean~ 
ingful changes in the penetration scores of 
pregnant women from pre- to postdelivery- 
period. Reitman (1962) discovered the 
exciting fact that both barrier and penetra- 
tion scores are altered in neurotics and 
schizophrenics during sensory isolation. Fur- 
thermore, these alterations are accompanied 
by equivalent changes in light touch thresh- 
old and concept of one’s body size. Here one 
sees boundary score changes correlated with 
such widely different phenomena as person- 
ality reorganization, the completion of preg- 
and the impact of decreased sensory 
experience. Apparently, fluctuations in bound- 
ary attributes do offer some promise as in- 
dicators of certain modifications of the in- 
dividual. At this point one can only con- 
jecture whether boundary fluctations repre- 
sent initiating forces in change processes or 
whether they are subsidiary effects. 

It should be specified that the penetration 
e has been the index primarily correlated 


nancy, 


scor 
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with change phenomena. The barrier score 
was consistently related only to changes 
produced by sensory isolation. This tends to 
be congruent with results from earlier studies 
(Fisher, 1959a; Fisher & Cleveland, 1958b) 
which indicated that the barrier score was 
largely a measure of persisting attitudes 
rather than of short-term variations in state. 
The penetration score, by contrast, seems to 
be more sensitive to immediate situational 
conditions. However, this differentiation 
can only be considered to be a tentative one 
which waits further confirmation. 

Actually, the relationships of the bar- 
rier and penetration scores still remain to 
be worked out in detail. These relationships 
fluctuate in different populations (although 
often low and tending in a negative direc- 
tion). The relative usefulness of the two 
scores is an open question and admittedly 
has been neglected as a matter of inquiry. 
Beginning attempts to combine them into 
one index of boundary definiteness have been 
made with fair success (eg., Fitzgerald, 
1961; Malev, 1961). 

In general, one seems justified in saying 
that the studies which are reviewed are sup- 
portive of the major past findings concerning 


body image boundary definiteness which 
have been reported. 


INTERPRETATION OF BOUNDARY SCORES 


It would be well in closing this overview 
to consider a general question which has been 
raised concerning the interpretation of the 
boundary scores. Wylie (1961), Mednick 
(1959) and others have criticized the as- 
sumption that barrier and penetration scores 
represent measures of body image, as such. 
They are inclined to view these scores as 
more indicative of cognitive or perceptual 
operations than of anything derived from 


11 The sensitive response of the penetration score 
to variations in state is further exemplified in a 
report by Herron (1962). He found that when the 
Holtzman blots were administered under neutral 
conditions and under conditions designed to arouse 
achievement-motivation, the penetration score was 
the most reflective, among a variety of other indices, 
of the achievement condition. It was significantly 
lower in the achievement oriented than in the neutral 
state. 
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body experience. They imply that the ob- 


served network of empirical relations between 


the boundary scores and other phenomena 
could just as well be explained within a per- 
ceptual framework as via body image con- 
structs. Basically, they question whether there 
is adequate support for assuming that the 
properties assigned to the periphery of ink 
blot percepts reflect how the body boundary 
is experienced. Such criticisms cannot be 
easily dismissed. The complete absence © 
previous empirical work with regard to body 
boundary feelings has made it difficult t° 
find adequate body image criteria agains 
which to validate the boundary properties as 
sumed to characterize barrier and penetration 
responses. There are, however, certain lines o 
evidence which have encouraged the writer 
(and Sidney Cleveland) to persist in a body 
image conceptualization of the data wh! 
have accumulated: 

1. Primary among these is the fact that 
the boundary scores are correlated with bo 
phenomena to a degree unequaled by previo" 
measures.*? They predict various levels ye 
body behavior in the way that a body imag? 
measure would be expected to do. They jaw 
shown themselves to be meaningfully relate 
to ability to cope with disablement of om?" 
body; to patterns of phantom sensatio 
triggered by amputation; to experienc? 
changes in body size and also light o 
threshold following sensory isolation; to ‘ sf 
ferential size judgments assigned to the tig t 
and left sides of the body; to concern ab? e 
the vulnerability of the skin; and to dese 
of anxious concern about the body (Fish 
1960; Fisher & Cleveland, 1958b). Incider 
tally, the last of these cited relationships g 
particularly pertinent because the bound 
scores do not correlate with indices of 2° 


e 
3 In a just completed and unpublished study fat 
writer (and Rhoda Fisher) have found a signif odf 
relationship between the barrier score and the baier 
associations given by normal subjects. High mney 
subjects gave associations which indicated tha! the 
exceeded low barrier subjects (,02 level) odd 


degree to which their awareness of the pot 
boundary region (skin and muscle) exceede east)” 
awareness of the body interior (stomach and P 5p 


This study represents the most direct and convie the 
demonstration of the body image foundation 
barrier score to date, 


| 


Bopy BOUNDARY CONCEPT 73 


fal anxiety which are not specifically phrased 


i body terms. 
a further striking that the boundary 
which ae patterns of body reactivity 
nc volve a differentiation between the 
body, Wh Me nonboundary regions of the 
are reflect gy assumes that the scores 
ion at eid of differential levels of activa- 
or actual] ody sites (inside versus outside) 
activation E S a role in instigating the 
soliq oe ferences, the fact remains that a 
congruence ation exists. There is force in this 
Signed i between boundary properties as- 
reactivity me ink blot periphery and the 
‘phery A aracteristics of the body pe- 
lismiss it Sus body interior. One cannot 
Sidets the T mere coincidence if one con- 
ĉrior-interi act that the existence of an ex- 
jects ci activation pattern in normal 
Model pr as specifically deduced from the 
concep vided by the body image boundary 
Mage co here are few, if any, non-body- 
w Suc ee which would lend themselves 
raphy.” iction phrased in terms of body 
fou No dependable relationships have been 
tween the boundary scores and in- 
T nitive o might be considered to have 
“a oun r “Perceptual style” connotations. 
ated to ary scores are not consistently re- 
Oductivitg: variables as intelligence, verbal 
Plexity im: the Barron simplicity-com- 
dents: con asion, Gottschaldt figure judg- 
tin Dane o ora] individual Rorschach 
D, rigig Peed of figure-ground alterna- 
It jg ty, or authoritarianism. 


r h 
we o ognized that more work needs to be 
wt as tablish the body image rationale 

Ores been advanced for the boundary 


Severa] a a matter of fact, the author has 
tg strate ies under way which seek to 
tie Orts of tong an individual’s introspective 
in Ns (eg, Ody experience in various situa- 
tin Song Aer stress, in response to drugs; 
lar „to placebo) conform to the par- 
és of exterior versus interior 
at would be predicted from his 

| Penetration scores. 
of eo it may be added that not 
ant Petatig € attractions of a body image 
that «02 Of the boundary scores is the 
as led to the successful testing 


b G tions 
r 


of a variety of novel hypotheses. Is this not 
a major consideration in any research 
strategy? 
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SYMBOLIC LEARNING AND READING RETARDATION 
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It was predicted that advanced, average, and retarded readers would also be 
advanced, average, and retarded in both the number of trials required to 
master symbolic learning tasks and in their response latencies to visual and 
auditory symbols. Some confirmation of the prediction was obtained, but 
both the provision of a reward and the effect of practice increased the 


efficiency of retarded readers, 


especially on the auditory task. The data were 


interpreted as suggesting that retarded readers need an incentive in relatively 


difficult learning situations and also 
readers tend to be less attentive to 


th 
st 


difficulties in symbolic learning may 


ability to attend to stimuli. 


W. 
ret. alters and Doan (1962) suggested that 


bec. readers have difficulty in learning to 
ifficult symbols with responses and that this 
eme Tey confined to tasks in which 
Was sup ol is a printed word. This suggestion 
Ma Sst by a study of symbolic learning 
ey oe average, and retarded readers 
jects varg and nonreward conditions. The 
ic P Maa Grade 7 and 8 boys from two 
e aso entary schools. Boys in one school 
dren in te to the reward condition; chil- 
multi le second school were not rewarded. 
Pattment. €-choice apparatus with four com- 
as used. each opened by a separate door, 
Neutral si The subject was asked to find a 
Comparp ulus object hidden in one of the 
by ali ments, each of which was symbolized 
4 lag t of a particular color. His task was 
a n the associations between the colors 
tiong ret compartments. Under reward condi- 
e ie readers performed significantly 
Tage ently than did either advanced or 
readers, Provision of a reward pro- 
ead a a marked improvement for retarded 
s ra ut the performance of the other two 
Ure, = influenced very little by this pro- 
tions authors concluded that there are 
l s that the performance of retarded 
arin’ Particularly in the area of symbolic 
> Can be improved by the provision of 
Pe wish to express their appreciation 
id Sea Board of Education and the 
Ration g tat Of Clairlea Public School for their 
in this study, and to Christine Orbell for 


tarn 
ter A collecting the orthorater and audi- 
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at, unless highly motivated, retarded 


imuli. It was further suggested that 


be, in part, a function of reduced 


a reward; however, they suggested that their 
results might have been more conclusive had 
the reward been more elaborate and more 
appropriate for the subjects’ age levels. 

Since Walters and Doan were interested in 
several aspects of the perceptual and cognitive 
functioning of retarded readers, only a part 
of their study was devoted to symbolic learn- 
ing. Their use of only a visual stimulus limits 
the scope of any generalization that may be 
made on the basis of their study. Moreover, 
the use of students from two different schools 
in the reward and nonreward conditions, and 
the consequent possibility of confounding the 
influence of reward with influences of different 
teaching methods, socioeconomic status, and 
other school-related factors reduce confidence 
in their findings relating to the differential 
effects of motivational level on the three 
groups of readers. 

Walters and Doan used only a comprehen- 
sion test as an index of reading proficiency. 
Reading comprehension correlates highly with 
verbal intelligence test scores (Hage & Stroud, 
1959), and a composite reading score is a 
more valid index of reading proficiency than 
any one test alone (Reed & Pepper, 1957). 
Consequently Walters and Doan may not 
have adequately assessed the discrepancy be- 
tween the reading proficiency scores and the 
intelligence test scores of their children, i.e., 
the measure on which their division of pupils 
into different reading groups was based. 

The present study represents an attempt 
to extend and refine the procedures used by 
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Walters and Doan in investigating symbolic 
learning in retarded readers. It differs from 
the earlier study in the following respects: 

1. The inclusion of two types of stimuli 
(symbols)—auditory and visual. : 

2. The use of a composite reading score 
based on tests of reading speed, comprehen- 
sion, and vocabulary. 

3. The selection of rewards only after 
consultation with school authorities to deter- 
mine their appropriateness to the children’s 
age levels. 

4. The use of children from only one school 
which was located in a socioeconomically 
homogeneous suburban community, 

The design also permitted an investigation 
of transfer effects from one sense modality to 
another and consequently provided data rele- 
vant to the hypothesis advanced by Raab, 
Deutsch, and Freedman (1960) that retarded 
readers cannot shift responses as easily as 
good readers. More directly, it provided evi- 
dence concerning the extent to which children 
at different reading levels benefit from prac- 
tice in symbolic learning, 

There is some advantage, at least in pre- 
liminary investigations of reading retarda- 
tion, in confining attention to children re- 
ceiving education through the regular school 
curriculum, since placement in special classes 
or selection for remedial reading groups may 
produce emotional reactions which could influ- 
ence performance in selected perceptual and 
cognitive tasks. 

Reading disability is much more prevalent 
among boys than among girls. For this reason, 
and because the motivational patterns of girls 
in social learning tasks differ from those of 
boys (Crandall, 1963), only boys were used 
as the subjects in this study. 

Studies of development of perceptual skills 
in children indicate that performance on 
perceptual tasks improves considerably with 
age during early and middle childhood 
(Bender, 1949; Coleman, 1953). In view of 
this finding, boys under 11 years of age were 
excluded from the present experiment. 

In the present study the following hypothe- 
ses were tested: 

1. In a multiple-choice task involving sym- 
bolic learning, retarded readers perform more 
poorly than average readers, and average 


readers perform more poorly than advanced 
readers. ’ 

2. Retarded readers give slower reaction 
times to both auditory and visual stimuli in 
a multiple-choice task than average readers, 
and average readers give slower reaction times 
than advanced readers. À 

3. Provision of an incentive reduces dif- 
ferences in performance among the three 
groups of readers. 


METHOD 
Subjects 


The subjects were drawn from Grades 6, 7, and § 
boys attending a single suburban public school. 3 
The selection procedures which led to the clas 
sification of the subjects into advanced, averen 
and retarded reading groups were similar to thong 
described by Walters and Doan (1962). They a 
volved a comparison of each subject’s relative stand 
ing in reading (average reading grade) and bis 
relative standing in intelligence, ji 
The Dominion Group Test of Learning Capacity 
and Gates’ Reading Survey were administered to E 
boys in Grades 6, 7, and 8, Grades 6 and 7 tests We" 
scored by the experimenter, and Grade 8 tests 
the school principal. into 
The raw scores for each test were transformed in a 
z scores in order to estimate the discrepancy betwee’ 
each child's reading level and his intelligence. in 
Z scores were computed separately for each test A 
each grade in order to restrict the comparison e 
each pupil’s performance to that of his peers in ag 
and educational level. Boys whose z scores in read 
minus their z scores in intelligence were greater tha 
+.5, were classified as advanced readers, Those Wi 
a discrepancy score greater than —.5 were consider 
retarded readers, and those with discrepancy score’ 
between +.39 and —.39 constituted the averas 
reader group. s 
The IQ range of subjects used in this study T 
restricted to between 90 and 126 in view of the af, 
ficulty in interpreting discrepancies between inte 
ligence and achievement in children at extreme rang 
of ability. m 
Boys less than 11 years old were excluded ff 
study. Boys with known visual anomalies Ree 
hearing loss (as measured by a test of audit? z 
acuity) were eliminated from the sample, as bar 
those who had not been educated entirely in Cana fe 
Also excluded were boys reported by school auth? 
ties to have emotional or behavior problems. and 
Seventy-two boys—24 advanced, 24 average $ e 
24 retarded readers—who met all the above requ” 0 
ments were selected, Their age range was from 
to 15.8 years, with a mean IQ of 106 for Grade 
107 for Grade 7, and 104 for Grade 8. The ™ 
IQ for all subjects was 105. as” 
Advanced, average, and retarded readers were 
signed to reward and nonreward conditions in $ 
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a way eer 

r F to maintain, as far as possible, a balance 

oe the six subgroups in age, grade level, and in 
g-intelligence discrepancy scores. 


Apparatus 


2 akg consisted of a multiple-choice box 
and the pro —— by Walters and Doan (1962), 
Were S Are ures used for presenting visual symbols 
compartment. the same as in the earlier study. Each 
Particular color a randomly paired with lights of a 
e correct x or and the subject’s task was to learn 
or the color-compartment associations. 
isual ee of the auditory task the 
Pure-tone anism and buzzer were disconnected. A 
Sea tor with earphones, attached to the 
Symbols e box, was used to present auditory 
tone, Selec en the box was completely closed, & 
the boy ted by means of a manual presetting behind 
Each of oo audible through the earphones. 
Coupled at e four compartments of the box was 
quencies ys, ris with one distinct tone. The fre- 
1000 cps. Th were approximately 100, 175, 420, and 
Teadily di nese frequencies were selected as being 
e ee one from another. 
resp ie enables the latencies of the subjects’ 
(Walters k. each trial to be automatically recorded 
Doan, 1962). 


Pr Cedure 


Each sub; 
9 a subject was conducted by the experimenter 
ligh and room, which was illuminated by artificial 
1° the eo only a desk and chairs in addition 
stan, i ee equipment. The subject was asked to 
then j ro. ront of the multiple-choice box and was 
th iain to the test procedures. 
the Visual uctions for the administration of both 
the der and auditory tasks varied according to 
© Visual in which these tasks were presented. If 
Bave the task was presented first, the experimenter 
following instructions: 


Tam go; 
test an going to hide this nebbish (holding up the 
in one of these four boxes. (Demon- 
yo want you to see if you can fnd it. 

eMonstr. hear a buzzer, watch these lights. 
Pen the wt) As soon as you see a light go on, 
p bbis „door of the box in which you think the 
wily, yo hidden, If you watch the lights care- 
Ka tims can learn to choose the correct box 


Th 
Ang © experi 
the Procee erimenter seated herself behind the box 
Vi 


pider, Qus pistes place the nebbish in the back of 
Utto, 2 e: 
Thh, which 
Lo, Per} 
Trent” kibje 
tria, Were re 


I 
the 


partments in a predetermined random 
trial, she pressed the appropriate light 
also served to activate the buzzer. 
was continued until the subject had 
successive correct trials. The latency 
S response and his choice of compart- 
corded by the experimenter for each 


Or e 

r 
two A Separate in time the administration of 
bolic learning tasks, a coding task, 


taking approximately 15 minutes, was introduced. 
This task was chosen for exploratory purposes only. 

During the administration of the auditory task, the 
subject, wearing earphones, again stood in front of 
the multiple-choice box. Following a demonstration 
of the four tones, he was instructed as follows: 


Now I am going to hide this nebbish again in 
one of the four compartments of this box. This 
time I want you to listen to the tones in your 
earphones. As soon as you hear a tone, open the 
door of the box in which you think the nebbish 
is hidden. If you listen carefully, you can learn to 
choose the correct box every time. 


The latency of the subject’s responses and his choice 
of compartment were recorded for each trial. The 
trials were repeated until eight successive correct 
responses were obtained. 

When the auditory task was presented first and 
the visual task second, instructions for each task were 
appropriately modified. 

When all other testing was completed, each child 
was handed a color naming chart (Wells & Ruesh, 
1945) and asked to name colors in quick succession 
from left to right, in order to ensure that poor per- 
formance was not due to difficulty in identifying 
colors. No subject included in the study had a 
color-identification problem. 

Each subject was asked to refrain from discussing 
the experiment with other children until it was 
completed. 

Order of presentation. The order of testing of the 
subjects was determined by the alphabetical order 
of their names. One half of the children in each of 
the six subgroups was given the auditory task first, 
while the remaining children were given the visual 
task first. Assignment to one or other order of 
presentation was made at random. 

Since knowledge that other subjects were being 
rewarded for participation in the study could have 
influenced the attitudes and performance of the sub- 
jects in the nonreward group, all nonreward subjects 
were tested before testing of reward subjects began. 
An interval of approximately 3 weeks, which in- 
cluded the Christmas vacation, separated the testing 
of nonreward and reward subjects. 

Rewards. Reward subjects were shown an array 
of games and attractive objects (ie. miniature chess 
sets, pens, key chains, stamps, etc.). They were 
allowed to examine the display for a few minutes 
and then, before commencing the first task, were 
try hard and do well on these tasks 
which I will ask you to do, you can have one of 
Remember, you can get the prize only 


At the conclus 


had done well 
Oe ee of his choice when the experimenter had 
finished her work at the school. In an attempt to 
minimize communication between the subjects and 
to reduce the children’s curiosity concerning the 
nature of the experiment, distribution of prizes were 


withheld until the conclusion of the experiment. 
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Following the completion of testing, all subjects 
(reward and nonreward) who took part in the 
project were given a “prize” for their participation. 
Supplementary tests. Measures of the subjects 
binocular visual acuity were obtained on a Bausch 
and Lomb modified orthorator. Auditory acuity was 
tested with a Maico audiometer. These measures 
provided a check on the accuracy of school reports 
concerning absence of visual or auditory defects. 
There was no evidence of differences in visual or 
auditory acuity among the three reading groups. 


RESULTS 


Two measures of each subject’s perform- 
ance were secured from the symbolic learning 
tasks: number of trials to criterion and 
median reaction time of the subject’s re- 
sponses. Medians were chosen in preference 
to mean reaction times since for many 
subjects the initial response latency was 
disproportionately slow. 


Analysis of Trials-to-Criterion Measure 


Table 1 shows the mean number of trials 
to reach criterion for all subgroups of subjects 
on both the visual and auditory tasks. An 
analysis of variance, the results of which are 
given in Table 2, indicated that there were 
differences among the groups significant at 
the .05 level, and a Tasks x Orders effect sig- 


TABLE 2 


ANALYsIS OF VARIANCE OF NUMBER OF TRIALS TO CRITERION 
ON THE SYMBOLIC LEARNING TASK 


TABLE 1 


J z E 
MEAN NUMBER OF TRIALS TO CRITERION ON TH 
SYMBOLIC LEARNING TASKS 


n el se 
Visual Auditory 
Visual Auditory Visual Auditory 
first second second first 
Advanced readers Das 
Rewarded 93.67 66.17 35.50 6 er 
Not rewarded 71.67 47.17 25.17 ól 
Average readers 
Rewarded 60.67 41.83 21.83 10 ee 
Not rewarded 97.67 37.66 26.33 71 
Retarded readers 83 
Rewarded 95.83 48.33 45.07 oe 
Not rewarded 102.17 41,50 30.83 NHH 
Taaie T 


Note.—N =6 in each subgroup. 


nificant at the .001 level. This latter finding : 
may reflect a kind of transfer effect, 1e; 
subjects with previous training on the visua 
task did better on the auditory task are 
subjects without previous training, and VC 
versa. Subsequent ¢ tests of group mean di 5 
ferences showed that retarded readers p" 
formed more poorly than either advanced ° 
average readers (differences being significa? 
at the .01 level), but that advanced 4? 
average readers did not differ. 

In addition to the overall analysis of V4" 


j- 


Source SS df MS F 

Reading groups (G) 7,573.875 2 3,786.938 3.38* 
Reward conditions (C) 0.340 1 0.340 <1 
Order of presentation (0) 1,687.840 1 1,687,840 1.51 
exc 4,422.931 2 2,211.466 1.98 
GXO 4,241.514 2 2,120.757 1.90 
cxo 85.563 1 85.563 <1 
GACXO 4,182.123 2 2,091.062 1.87 
Error(b) 67,147.750 60 1,119.129 l 
Tasks (T) 3,164.063 1 3,164.063 2.21 
TXG 454.318 ž 227.159 <1 
TXO 87,172.562 1 87,172.562 60.82*** 
TC 0.007 1 0.007 <1 
TXGXO 7,114.348 2 3,557.174 2.48 
TXGX GC 6,854.652 2 3,427.326 2.39 
TXCXO 2,626.562 1 1,313.281 <1 
TXGXCXO 3,669.239 2 1,833.115 1.28 
Error(w) 85,990.749 60 1,433.179 


ae e eS 5 


*p <05. 
sp < 001. 
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TABLE 3 


ANALYSIS OF VARIANCE OF TRIALS TO CRITERION 


on Visuat Task ONLY 


Source SS df MS F 
Groups (G) = 
6 3 ; 3,678.250 2 1,839.125 2 
— of presentation (O) 56,560.056 ł 56,560. 056 sere 
R vard conditions (C) 0.222 1 ” 0.222 <1 
GXO 4,259.695 2 2,129.848 1.39 
GXCXO 221.027 2 110.51 <1 
Error(b) 1,578.583 2 789.291 gi 
92,093.666 60 1,534.894 
“> < o0, 
ance 
» $e 
Carried Parate analyses of variance were not uniform for the three groups of readers, 
; as indicated by a Groups X Orders interaction 


neither z the visual and auditory tasks. 
&ct related 2 was there a significant main 
k (Table © reading level. For the visual 
Pretrainin 3), the subjects who had had 
better than on the auditory task performed 
taining i, the subjects who had had no pre- 
n be nan no other effect was significant. 
significant Gra, task (Table 4), there was 
a <.05). U roups X Conditions interaction 
te vanced an pines nonreward conditions both 
T than reta average readers performed bet- 
i ant at a ed readers, differences being sig- 
a average 01 and .02 levels, respectively; 
ter, ef and advanced readers did not 
Ongiy; ctarded readers under the reward 


kt lon 5 
parde Performed significantly better than 


1 d 

S aa under the nonreward condi- 

“ing vanced the performance of average 

of tence b readers was not significantly 
y reward. In addition, the effect 


Preyj 
lous ci 
training on the visual task was 


LYSIS on y TABLE 4 
ARIANCE OF ‘TRIALS TO CRITERION ON 


S Av 
a “Vnirory Task ONLY 
Uree = _ 
Groups SS af MS F 
tde; 
Condi 3350.250 2 2175.125 210 
axes Hai 1 32,300.347 31.24** 
x 235 i 0.1 <1 
X S ao 583 2 3508.791 3.39% 
XC 1134.528 2 5567.264 5.38** 
orgs O 1830.125 1 1830.125 1-77 
b) 273.083 2  3,136.542 3-03 
0 


6 
> 2,044.833 60 — 1,034.081 
t <0: 


significant at the 01 level. In the case of 
average and retarded readers, the groups pre- 
trained on the visual task performed better 
on the auditory task than the corresponding 
groups without pretraining (p<.01 and 


p < .001, respectively). 


Analysis of Median Reaction Times 


The median auditory and visual reaction 
times for each subject were estimated, and 
these medians were then averaged to give the 
group means shown in Table 5. 

Table 6 shows the results of an analysis 
of variance of subject’s median reaction times 
for the multiple-choice tasks. There was a 
significant Groups X Orders effect, reflecting 
the fact that when the auditory task was 
given first, retarded readers responded more 
slowly than either advanced or average 
readers ($ < (001 in each case), whereas 


TABLE 5 


MEDIAN REACTION TIMES 1N THE 


Group MEANS OF 
AND VISUAL TASKS 


AUDITORY 
Visual Auditory 
a 
Visual Auditory Visual Auditory 
second second first 


first 


Advanced readers 


Rewarded 190.50 173.67 155.00 191,67 
Not rewarde 149.83 177.50 209.83 231.17 
Average readers 
Š ‘Rewarded 203.50 192.50 157.33 170.17 
Not rewarded 201.83 176.17 192.50 220.83 
tarded readers 
Re warded 166.67 174.33 239,33 321.17 
165.00 178.83 220,33 295.67 


Not rewarded 


Note.—1! /100-sec- units. 
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TABLE 6 
ANALYSIS OF VARIANCE OF MEDIAN REACTION Tres 
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TABLE 8 


oN 
ANALYSIS OF VARIANCE OF TRIALS TO —— 
SYMBOLIC LEARNING TASKS FoR NoNREWARD 
JECTS: FIRST ADMINISTRATION ONLY 


Source SS af MS F 
706.5 F 
Groups 35.413.097 2 17;706:548 2.13 Source SS df MS 
Conditions 1,750.028 1 ae sore Er 
5 4 1 1,680.44 ` 7 z S 
PRA mo e oe Groups 19,778.389 2 9,889195 51 
Eo 71,132,681 2 = 35,566.340  4.29* Tasks 148.028 1 s 149.02% 1.99 
cxo 8.804.695 1 8,804,695 <I GXT 6811.055 2 3,405.528 1 
GXOXC 13,865.180 2 6,932,590 <1 Error(b) 51,337,500 30 1,711.250 
Error(b) 497,857.333 60 8.297.722 
Tasks sarin © asseio. Sye =a 
TXG 11.634.042 2 5.817.021 310 o, 
TXO 16,986.778 1 16,986.778 9.06 
20.028 1 220.028 <i 
ae 20 eri 2 209.298 =< Supplementary Analyses 
3.029.346 2 1,514.673 <1 5 ee E tion í 
TORC 448.0971 448,027 <1 Since both pretraining and the introduc dif- 
TXCXGXO Sias e  PS+925S <i of reward appeared to modify group 
Error(w) 112,454.333 60 1,874.239 


*p <.05. 
ep <01. 


there were no differences among advanced, 
average, and retarded readers who were given 
the visual task before the auditory task. 
Visual and auditory reaction times were 
also analyzed separately. No significant dif- 
ferences emerged from the analysis of visual 
reaction times. The analysis of auditory re- 
action times (Table 7) yielded a Groups x 
Pretraining interaction significant at the .05 
level. Retarded readers who had had no pre- 
training on the visual task responded signifi- 
cantly more slowly than either advanced or 
average readers with no pretraining (p <.01); 


there were no differences among advanced, 


average, and retarded readers who had 
received prior training. 


TABLE 7 


ANALYSIS OF VARIANCE OF MEDIAN LATENCIES FOR 
AUDITORY Task ONLY 


Source SS df MS F 
Groups 41,430.778 2 20,718.890  4.00* 
Conditions 1,605.556 1 1,605.556 <1 
Orders 63,962.722 1 63,962.722 12.34*** 
GXC 3,640.777 2 1,820.389 <1 
GxoO 48,769.777 2 24,384.889 4.70* 
CxO 2,640.222 i 2,640.222 xi 
GXCXO 7,531.445 2 3,675.723 <1 
Error(b) 310,935.333 60  5,182.256 

LJ P 

we $ 00i 


ferences, further analysis were carried E 
in which only the data for the first y 
performed by nonrewarded subjects W° 
included. sis 
Table 8 shows the results of an analy’ ee 
of variance of trials to criterion for the ae 
groups of subjects on the task, auditory 
visual, which was first administered to a 
Differences among the three reading at 
were significant beyond the .01 level. ra 
there was no significant difference bet™ pe 
tasks and no interaction effect, data ton fot 
auditory and visual tasks were combine air 
subsequent ¢ tests of differences between Phat 
of reading groups. These tests indicated Jy 
retarded readers had performed more P°? r$ 
than either advanced or average Tea dif 
(p < .01 and p < .05, respectively). NO 
ference was found between the perform 
of advanced and average readers. 0 
A corresponding analysis of varianc? ig 
median latencies is shown in Table 9. 4 ‘ob 
ference between tasks significant at the 


TABLE 9 6 


rae 

ANALYSIS OF VARIANCE oF MEDIAN REACTION ae? 

FOR SYMBOLIC LEARNING TASKS FOR Noni 
Supyects: FIRST ADMINISTRATION ONY 


F 
Source SS df MS 
Zs af 
Groups 9,526,889 2 4,763444 <77 
Tasks 53,361,000 1 53,361.000 %48 
GXT  18788.667 2 9394334 
Error(b) 238,318.667 30 7943.956 


#2) <0; 


ance 
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al pts the fact that the children in 
fee ides more slowly to the auditory 

=... to the visual task. Both the dif- 

eee the reading groups and the 

evel n effect failed to reach an acceptable 
of significance, 


DISCUSSION 


Thi 
findin, Study generally confirms the earlier 


tet es and Doan (1962) that 
lic een, ers have difficulties with sym- 
long cia However, significant interac- 
taining g ing the effects of reward and pre- 
Statement a not permit an unconditional 
Stoups of E out differences among the three 
€ pro es, 
€ form <n of an additional incentive in 
e Performa, reward did not equally improve 
Performa ance of all subjects. Neither the 
teaders a ce of advanced nor of average 
N the hs Significantly affected by reward; 
tardeq Tea s r hand, the performance of re- 
R au o who were rewarded was, on 
at of a y task, significantly better than 
Nardeq Spe readers who were not re- 
p Varded A fact, retarded readers who were 
A th alee task did as well as children 
a reading groups. The auditory 
t, ay to have bean the more dif- 
te, than at took longer for all groups to 
tarq a id the visual task. It may be that 
readers, when a learning task is rela- 
F lt, need an additional incentive 
dise: S tear ret in the solution of the 
of “minatio - On the other hand, auditory 
it attention n may well require more effort 
hig’ Possihie "tee Part of the subject and 
Nd 0 fon that retarded readers, unless 
the he ig ated, are less attentive to stimuli. 
the first 4 Y significant transfer effect from 
Dropp Metal € second task suggests that once 
ne eee of the symbolic learning 
tions S Benen aetstood, the principles involved 
angst? oii ized to somewhat similar situa- 
SR ie ae the learning process. This 
D thatded ii Was most noticeable in the case 
ong Other aders, who performed as well 
Visad o two groups of readers on the 
€ tasks administered, whether 
itory, Moreover, retarded read- 
ad pretraining on the visual 
significantly more quickly to 


T aud 
h 


the auditory symbols than those who had had 
pretraining. 

On the basis of previous studies it was 
predicted that retarded readers would have 
slowest reaction times. This hypothesis was 
supported in the case of the auditory task 
and particularly when this task was admin- 
istered without pretraining. On the visual task 
no significant differences were found among 
the response latencies of the three reading 
groups. This finding is in direct contrast with 
the findings of earlier studies by Raab et al. 
(1960) and Walters and Doan (1962), in 
which it was found that retarded readers 
were significantly slower than more advanced 
readers in responding to visual stimuli. This 
contradiction may well be a function of the 
differences in the nature of the tasks employed 
in the three studies. Both Raab et al. and 
Walters and Doan used simple reaction-time 
tasks, whereas the symbolic learning tasks 
involved the additional elements of a multiple- 
choice situation. 

The findings of the present study thus sug- 
gest that in a complex task the speed of 
learning of retarded readers is more likely to 
vary with practice and motivational level than 
that of advanced or average readers. More- 
over, reaction times of retarded readers are 
also susceptible to the influence of practice. 
Thus, it seems that children who are average 
or better in reading perform in a more stable 
manner, from condition to condition, than do 
children whose reading achievement is poor. 

Since the effect of pretraining was greater 
for retarded readers than for the other two 
groups of children, it seems possible that, in 
addition to the fluctuations in motivation 
often associated with underachievement, other 
factors contribute to the learning difficulties 
of retarded readers. A deficit in attention may 
be one of these. In a recent paper, Gardner 
and Lohrenz ( 1961) suggest that individual 
differences in assimilation are in part a func- 
tion of characteristic differences 1n the degree 
to which individuals attend to stimuli. It is 
not inconceivable that the difficulties which 
retarded readers demonstrate in various types 
of associative and symbolic learning tasks are 
not primarily due to a deficit in their ability 
to form associations, but rather are a function 
of reduced ability to attend to stimuli. In 
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view of this, it may be advisable to investigate 
the role of attention in associative learning 
in relation to reading achievement. 
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THE USE OF AN OPEN-ENDED GROUP IN THE 
INTAKE PROCEDURE OF A MENTAL 
HYGIENE UNIT* 


ANDREW = 
V S. DIBNER, ROBERT D. PALMER,* BENJAMIN COHEN, np ARNOLD G. GOFSTEIN 


Veterans Administration Mental Hygiene Clinic, Boston, Massachusetts 


Several clinics have found screening in small groups on Intake to be beneficial 


in reducing waiting lists and evaluatin 
used small short-term groups to orien’ 


g patients for treatment. Others have 
t and educate patients to treatment. 


We have run a continuous, open-ended Intake group for these purposes and 


to provide nuclei 


elimination of the waiting list and de 
preparation for psychotherapy. During a 
referred to the open-ended group staye 
than a control group. A replication using 


result. 
As j 

hae, ae with many large clinics, the 
linie in dministration Mental Hygiene 
Problem of Boston was troubled with the 
eking a growing waiting list of patients 
Procedure sychotherapy. The usual Intake 
“table — assigning patients deemed ac- 
asis ae psychotherapy to therapists on 
e Often openings as they occurred was 
z riment resulting in a pile-up of cases. An 

Sgn to al group therapy procedure, de- 
aNd to fy Screen candidates for psychotherapy 
"em bere eee continuously as an open-ended 
z a a tinent group, was instituted to 

mte De list unnecessary. ; 
% Tee area ial group was expected to help in 
iting <@S—first, reduce or eliminate the 


Uns Peel i 
eqs. n lly a problem with child guidance 
Ntact Ţ Some the wait is between the first 


n an 
ana dice Intake evaluation; in some as 
Oy, Test ©) it is between Intake evaluation 


t A 
hag Study herapy appointment. At the time 
Wai Pou as initially started in 1958, we 

in, 


By, eee 
Yeho tion 
p logical s 


il f this paper was read at the Eastern 
No ól. 


Association Meeting, Atlantic City, 


ow 
Assay at 

a Wi 

husets orcester State Hospital, Worcester, 


for other closed therapy groups. The result has been 


monstrated improvement in patient 
period of 18 months, the 61 patients 
d significantly longer in treatment 
another therapist yielded a similar 


frustration to clients who are seeking help. 
Waiting has been claimed by many to 
augment ailments, to make resistance to 
therapy more obdurate, and to prolong the 
therapeutic process. For some patients a dis- 
order may have become chronic by the time 
he is seen for the initial interview. However, 
it should also be mentioned that in many 
cases a waiting list serves as a benefit be- 
cause it gives some patients time to solve their 
problem on their own. It often gives time for 
external situations to develop, thus relieving 
pressure, and in the cases of children who 
need help, it gives the child a chance to 
mature. In general, however, most agree that 
any technique which would reduce a waiting 
list would be beneficial. 

The second problem area of Intake at which 
our open-ended group technique was directed 
was that of screening for psychotherapy. 
Could this new procedure provide a fresh 
approach to social diagnosis and augment the 
impressions gained through interviews by the 
social worker and psychiatrist? Could it shed 
light on prognosis and indicate the patient’s 
capacities to benefit from individual psycho- 
therapy or group psychotherapy? 

The third Intake problem of concern is 
that of preparing the patient to accept psy- 
chotherapy- The high dropout rate in psycho- 
therapy after being screened for acceptability 
is evidence of patients’ unpreparedness. Many 
investigators have complained of the waste of 


professional effort and time that this causes: 
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the hours of psychiatrists’, psychologists’, and 
social workers’ time that go into Intake and 
the further hours that are scheduled for pa- 
tient appointments where the patient does not 
show up. In our clinic, where only about 50% 
of the applicants are offered psychotherapy, 
45% drop out by the fifth session. Kurland 
(1956) reports similar statistics at other 
Veterans Administration clinics. 

Various methods have been sought to end 
this waste, to better select and to motivate 
patients for psychotherapy. For example, 
Bitterman (1958) reports using a program of 
telephone evaluations to judge applicants’ 
tolerance for waiting, and scheduling first 
appointments appropriately. 

Taboroff, Brown, Kormer, Reiser, Tal- 
madge, Goates, and Stein (1956); Stone, 
Parloff, and Frank (1954); and Peck (1953) 
report doing Intake diagnosis in short-term 
groups. Taboroff found that diagnostic ratings 
derived from observations of behavior in 
groups were similar to those accomplished by 
psychiatric interviews and psychological tests. 
Peck found that patients could be oriented to 
psychotherapy while they were being evalu- 
ated for psychotherapy. 

Other types of short-term screening groups 
were described by Ginott (1956) and by 
Hotkins, Kriegsfield, and Sands (1958). 
These were for parents who applied to child 
guidance centers, It was found that appropri- 
ate referrals either within the agency or else- 
where in the community could be rapidly 
made and that the early dropouts could also 
be quickly and economically identified (Ginott 
& Blek, 1959), 


METHOD 
Subjects 


The experimental group (E) consisted of 61 
male veteran patients assigned to the open-ended 
therapy group by Intake psychiatrists during a 
15-month period. Although the group was intended 
to exclude patients diagnosed as psychotic, some 
were accidentally referred. The control group (C) 
was chosen to represent a typical cross-section of 
the usual Intake referral (directly to a therapist) 
during the time the experimental group was oper- 
ating. Two control subjects were selected for each 
experimental subject, one chosen from the Intake 
list of the day before the experimental case was 
referred and one from the day after. Thus a control 
group of 122 cases was formed. The average age of 
patients in both groups was approximately 39 years. 


Sixty percent of the cases in each group had aias 
noses of psychoneurosis, the rest formed appa 
mately equal groups of personality disorders, psy¢ i 
physiological disorders, or psychoses, borderline oF } 
remission. s 

At the time of the study all patients seek 
treatment were seen that day by a social WOrke 
and psychiatrist. The psychiatrist would decide k 
psychotherapy would be offered; then the pane 
would be assigned to an opening in some 
schedule or would be put on a waiting list. 4 
first batch of patients in the open-ended group Clea 
from the waiting list; subsequently they came fro 
Intake. 


Open Group Therapy Procedures 


The therapist had a brief individual interview 
with each new patient before his entry into cl 
ongoing group. He described the clinic’s treatm 
facilities and presented the open-ended group tO se 
patient as a “trial” situation which would be u pe 
by the patient and the therapist to decide which oe 
of treatment, if any, was most appropriate for ha 
The patient was told that, after a few meetings ig 
the group, he and the therapist would meet 38 y 
to discuss their decision as to further treatm aca 
The treatment proper involved weekly minn 
during which the therapist fostered group awaren® 
and discussion of their experience and fooling: N 
relation to symptoms. The therapeutic techna 
differed from that used in the usual “closed” 8" jve 
at our clinic in that the therapist was more acre 
in introducing new members and in orienting ”.. 
remarks toward describing what group therapy iy 
about. The therapist expressed clarification as pete 
as possible. He would also occasionally elie 
individual meetings with patients after a g! a 
meeting to discuss with them their behavior in 8" 
and their reactions to the treatment. there 

The group roster ranged from 4 to 12 and yhe 
would be an average of 6 to 7 at meetings. to 
the therapist felt comfortable with a decision 4 di- 
what further treatment would be best for the ing 
vidual case, he would seek out the proper apa jth 
and reassign the patient after first discussing it “4o 
him after a group meeting. When transferrin? in 
group therapy, he would always transfer patien ans” 
pairs or other groupings. On three occasions be t ew 
ferred five or more at a time as a nucleus of ĉ 30 
group. Patients stayed in the group from 1 
sessions, averaging 6.2 meetings. 


Procedures for Evaluating the Treatment 


In order to evaluate the effectiveness of grd 
open-ended group, attendance and discharge 16%, jp 
were reviewed to determine the length of i jgh! 
treatment for patients in E and C groups: nts e 
patients in the E group, (13%), and 13 pe on 
the C group, (11%), did not report for ovate 
therapy session. Since these patients could i hey 
said to have experienced any treatment at ar com 
were dropped from further study. All furthe 
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co o done with patients who attended at 
3 months nee session. An arbitrary cutoff date, 
Intake es the last patient was referred by 
attended by osen, and then the number of sessions 
either date” each patient from date of referral to 

e of discharge or cutoff date was counted. 


RESULTS 


aed Aa the Experimental group stayed 
oup and a 6.2 meetings in the open-ended 
5 furthe then went on for an average of 
tof dat r sessions in other treatment by the 
uring na totaling 21.7 therapy meetings. 
g on a mod, patients in the Control 
Ings, ee an average of only 13.9 meet- 
Y t test, means are significantly different 
oups 1 ested of comparison of the 
thay’) $ illustrated in Table 1. This shows 
<a percentage of E group than 
n pee in treatment over time. 
due to f to check if these differences were 
Variable actors other than the experimental 
evant the patients were compared on 
nt dimensions, 
Not 4, Patients in E and C groups were found 
iffer significantly on age, number of 
le «oF glad in psychotherapy, year of 
Neth of a psychotherapy, average previous 
cumulat stay in psychotherapy, and total 
thera ; €d months of previous psycho- 
PYchother hus, previous experience Wi 
quits apy could not have produced our 
pa nosis pi two groups were different in 
patents og a higher proportion of 
of Phality D e E group were diagnosed 
Patient. + isorder and a higher proportion 
Sin the C group were diagnosed 


"vious 


Perg TABLE 1 


ENT, 

ACE 

Langg Ov Parents m E anp C Grours RE- 
SYcHormeRapy UNTIL Curorr DATE 


Sessions 


1 5 40 20 30 40 50 


E 
GC 10 st a 0 26 7 § 
00 66 46 27 15 6 4 


© hy 
Urin Suto ber 
a the date Sessions a patient could theoretically attend 
‘Or 39 Curse Shire depending on when he began therapy 
es in %40, or athe study. Thus the low percentages indi- 
or 50 sessions is partly due to the small num beg 
ions 2, "2° had entered early enough to attend 
nthe Cutoff date, These percentages are not 
erapy stay-rate. 


Psychotic. However, since patients with diag- 
noses of personality disorder had a lower 
attendance record than the group average, 
and patients with psychotic diagnoses had a 
higher attendance record than the average, 
the group differences on these dimensions 
could not have accounted for the experimental 
findings. Rather they work in the reverse 
direction. 
Discussion 


First of all one might ask, “Does this sort 
of procedure have any therapeutic effective- 
ness, or is it wholly diagnostic and educa- 
tional?” Although this question was not 
formally explored, it is the opinion of those 
consultants involved in the experiment that 
it was therapeutic. Three patients were dis- 
charged as improved directly from the open- 
ended group, and after several meetings many 
other patients yerbalized lessening of anxiety 
feelings or reduction in physical complaints. 
From the standpoints of content, activity of 
interaction, and extent of patient participation 
tape recordings of most sessions of the group 
would be indistinguishable from recordings of 
other “closed” therapy group meetings. In 
most ways the therapist’s technique in a 
typical meeting is similar to that generally 
used at the clinic. He stimulates interaction, 
reflects expressed feelings, and compares and 
summarizes the various experiences, attitudes, 
and feelings expressed by different members. 
Where possible, he connects related material, 
including cause and symptomatic effect. Al- 
though he is generally inactive during the 
group discussion, he demonstrates continual 
interest and acceptance. 

However, in several ways the therapist’s 
technique differs from that used in closed 
groups. The therapist is more active in 
introducing new members, and in describing 

the group membership 


(repeatedly because ; 
changes) what group psychotherapy is about. 


In most closed groups the therapist purposely 
avoids such orienting remarks. Because the 
patients are not expected to remain in the 
group for very long, focus is mostly on what 
sustains, impairs, and improves individual 
adaptation, and the relationship to symptoms. 
There is less activity and depth in interpreta- 


tion than would be the case in a closed group. 
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One other difference lies in the therapist's 
willingness to see any patients for brief in- 
dividual contacts before or after group 
meetings if he deems it advisable. Patients 
seek out the therapist, perhaps to tell him of 
some recent disturbing event in their lives. 
The opportunity is then taken to accept this 
as a problem and encourage the patient to 
bring the problem to the group for dis- 
cussion, emphasizing that this is exactly 
the purpose of the group. The therapist, too, 
will ask one or more patients to stay after 
a meeting to discuss with them individually 
how they see treatment progressing, and to 
tell them some of his observations of their 
behavior. In this way, silent members in 
particular are recognized and their difficulties 
in participating are discussed with them. 
Often exploration of their feelings and en- 
couragement to participate has resulted in 
lessened guilt, stronger group feeling, and 
increased participation in meetings. 

Aside from the special effects of the 
therapist’s activity, the effects of two other 
factors should be considered. For one thing, 
the open-ended group meetings were pre- 
sented to the patient as a trial situation. No 
immediate commitment to psychotherapy was 
promised or demanded, whereas this is 
often the case in direct referral to a therapist. 
A new patient applies for treatment at the 
clinic at a time when his anxiety and self- 
concern are often at their highest points, 
defenses are fluid, and he may have mobilized 
himself to do something constructive about it. 
This is the best time to initiate treatment. 
At the same time the patient (at our clinic, 
at least) is a person of limited acquaintance 
with psychotherapeutic practice, is not psy- 
chologically minded, and sees no special 
merit in any kind of extended psychological 
treatment. He wants help but at this time 
could only want an impermanent arrange- 
ment. That is exactly what is offered by 
the open-ended group. Also the prospect is 
offered that if the trial treatment is successful, 
he and the therapist together can make the 
most suitable arrangements for more per- 
manent treatment. , 

This atmosphere of no immediate com- 
mitment can have a beneficial effect on the 
therapist as well as on the patient. It relieves 
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him of the subtle pressure to retain a cast 
in treatment once it has been screened by 
Intake and assigned to him as appropriate 
for treatment. The therapist’s acceptance 0 
the appropriateness of psychotherapy for pS 
new patient often exceeds the patient s a 
ceptance of this procedure which is alien to 
him. 

A feature unique to the open-ended group 
which undoubtedly added to its efiectivenc® 
was its method of transferring patients k 
pairs or other groupings to ongoing or M 
formed groups. These men were proba he 
helped in adjusting to transfer by E 
strengthening presence of their old group E 
sociates, although the groups assuming m 
patients then had problems involving @ 
similation of clique subgroups. : of 

As a result of the special conditions | 
high group turnover and minimum screenih 
of new patients for the open-ended grouk; 
we were able to make some observations |” 
the effects of such turnover and of the ne 
lems resulting from heterogeneous gro 
membership. 

Only one element of variability 
the group members was found to © 
a clear-cut obstacle to group progress: 
was including in the group at the same 
psychotics with neurotics and patients w 
character disorders. This difficulty had ; J 
anticipated and Intake personnel were jent 
structed to exclude the psychotic pate a 
from referral to this group. However. 5° nis 
were mistakenly admitted. Invariably n 
led to severe difficulty. The psychotic me to 
bers’ distortions and projections serve ye- 
stampede neurotic patients with weak pe 
fenses into distorted thinking and made 
others distinctly uneasy and silent. ont 

While other elements of variability 2” oč 
group members, such as age, educatio”: orf 
cupational level, ethnic background, his jp 
of criminalism, or previous experien ues 
psychotherapy, created problems Or 15 Fa 
these served as useful group-discussio” pe 
terial and did not seriously hampe"! 
group work. grot 

As for the frequent turnover in ts E 
membership, this special group, with P 
pressed policy of turnover, did not p 
the same intensity of angry feelings 


tim? 


ee —_ 
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ao were | admitted as do closed 
this a pe _member’s role in 
newcomer t changed rapidly from that of 
soon after : old member because, relatively 
Perhaps =e entered the group, the older, 
More new oe — members, departed and 
eft, the a ers entered. When members 
tanks. ices seemed to be a closing of 
also seen pee, es a a of patients was 
remaining oe graduation” and left some 
tecognized. ients feeling deprived or un- 
H 

for S patients selected from the group 
r individual or for transfer to closed groups 
charged yy treatment? Most of those dis- 
Stopped co ere self-terminated; they just 
teady O ae to meetings. Three, as al- 
further e Eae were deemed not to need 
Stoup ni ag Readiness for transfer of 
termined er to further treatment was 
Sychother y his degree of acceptance of 
degree of apy as evidenced in his attendance, 
“Xpresseq Participation in discussion, and 
, Transf Opinions about treatment. 
indicated | to individual treatment seemed 
T lems = those patients who had acute 
Bb ines viously tied to recent events 1m 
NN t als, to other environmental pres- 
a ênts ai seemed indicated for those 
we Powerin se homosexual anxiety became 
in re g Tan in a group or for those who 
try. Stoup ssistic as to be unable to relate 
‘al, even after as much as 6 months 


tha, ot ma and for those who felt 
Wia indi more secure in a group setting 
Coul aml therapy, for in a group 
Very į are the therapeutic work. 
Ctio, ‘MPortant element in the constant 
Will and transferring of patients Was 
ini 0E thie therapist to be con- 
ose a8” his best patients, retaining 
Propa, Whom he had not yet been able 


tren, Ope 
rly 
tment evaluate or prepare for further 


in 
K3 


A REPLICATION 


effe, 
oce g two important factors on 
€ still remained untested. These 


were (a) the use of only one therapist for 
the experimental group and (b) a greater 
proportion of patients in the experimental 
group going on to closed group therapy 
while the control group had a greater pro- 
portion in individual therapy. We have rep- 
licated the study with another therapist and 
have investigated the average length of 
stay in closed group therapy in our clinic as 
opposed to individual therapy. Although 
Nash, Frank, Gliedman, Imber, and Stone 
(1957), report a much higher dropout rate 
in group therapy than individual therapy, 
we checked for this at our clinic and found 
no difference. During an g-month period, we 
sampled 36 patients who began individual 
therapy and compared their attendance over 
a 12-month period with 10 patients assigned 
to group therapy. There was no difference in 
dropout rate. 

We repeated the experiment of the open- 
ended group for an g-month period, using 
another therapist. During this period, of 31 
patients who were referred to the open- 
ended group, 22 attended for at least one 
meeting. In a manner similar to the earlier 
study, we selected 25 control patients who 
appeared for at least one therapy session. 
The experimental patients stayed an average 
of 7.7 meetings in the open-ended group and 
then went on to 10.2 meetings in further 
therapy. Thus, they were in treatment 17.9 
sessions. This is counted up to a cutoff date 
13 months after the start of the open-ended 
group. In comparison, the 25 control patients 
stayed for a total of 15.8 sessions. This 
minimal difference is deceptive. The first 
batch of seven patients in the open-ended 
group, the therapist transferred out very 
quickly (they averaged only 3.5 meetings 
in his group). These patients did not stay 
on in treatment very long. Their total treat- 
ment sessions Were only 7.5. Realizing this, 
the therapist kept his later patients in open- 
ended group longer, and the next consecutive 
15 patients averaged almost 10 meetings m 
his group and then stayed on for a total of 
almost 23 sessions by the cutoff date. Thus, 


if we compare the average of 23 to the 15.8 
we have a 


i ] group 

sessions of the contro 7 ; 

similar favorable result, as was the case with 
We found that patients 


the original study. 
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should be kept 10 sessions rather than just a 
few for best effect. 


REFERENCES 


Bitterman, C. M. Serving applicants when there 
is a waiting list. Soc. Casewk., 1958, 39, 356-360. 

Grvott, H. G. Group screening of parents in a 
child guidance setting. Int. J. group Psychother., 
1956, 6, 405-409. 

Grvorr, H. G, & Brex, L.. A study in non- 
attendance of initial interviews in a community 
clinic. Int. J. group Psychother., 1959, 9, 314-321. 

Horxrss, A. S., Krrecsrierp, M., & Sanns, R. An 
interview group therapy program for the waiting- 
list problem. Soc. Wk., 1958, 3, 29-34. 

Kurranp, S. H. Length of treatment in a mental 
hygiene clinic. Psychiat. Quart. Suppl., 1956, 30, 
83-90. 


Nasu, E. H., Frank, J. D, Grrepman, L. Hs 
IMBER, S. D, & Strone, A. R. Some factors Te- 
lated to patients remaining in group psycho- 
therapy. Int. J. group Psychother, 1957, 7; 26%- 
274. 

Peck, H. B. An application of group therapy t 
the intake process. Amer. J. Orthopsychiat., 1953, 
23, 338-349. 

Stove, A. R. Partorr, M. B., & Frank, J. D. i 
The use of “diagnostic” groups in a gt0UP 
therapy program. Int. J. group Psychother. 1954 
4, 274-284. 

Tanororr, L. H, Brown, W. H, Korates, I N» | 
Reser, D. E, Taratapce, M., Goates, B- Di 
& Srem, E. A note on intake diagnosis ? 
groups. Int. J. group Psychother., 1956, 6, 193-19% 


(Received June 18, 1962) 


Journal o, is 
1963, Val. SP Rug Bevchology 


NOTES AND COMMENTS 


COST EFFICIENCY AND TEST PREDICTION 


DAVID RIMM 


Stanford University 


In evaluating the efficiency of a 
the cost of a false positive error, 


be taken into consideration whenev! 
efficiency index based solely upon the expected 
tions might well lead to an incorrect 


rejection of a given test. A relatively 
account error costs, as well a 
and the population base rate. 
hypothetical examples, showing 
formation regarding error costs is in 


Th an 3 
cle, fear ormnative and thought-provoking arti- 
fectiy nes and Rosen (1955) considered the ef- 
technique of the more traditional assessment 
With a mee standardized test) in comparison 
Tent of ection procedure involving the assign- 


al : 
One o y subjects to the same class or category. 
Wite frequ main points of their paper was that 
M ently even the better selection instru- 


ents 
“sig ic inferior, in terms of total correct de- 
She, yp x the procedure of simply passing every- 
Decify id of the present paper is to 
tee ion te at the value or efficiency of a given 
tiv Wo cee taking into account the cost of 
4 eS may be le types of errors. Thus, false posi- 
vi dollars e considerably more costly, in terms 
Ce versa or man-hours, than false negatives, OY 


Dan 
et up stration, suppose a psychiatric test 
Psychiat order to screen out all potential 
tam, Sy tic “washouts” in a flight training 
at ie 0 to ed that, on the average, it costs 
of Belo ce ma through flight training. If, 
à neur of such training a man presents signs 
ately Psychiatric breakdown, he will im- 
e discharged. The price for such a 
RN reakda (a false negative) is $10,000 (less 
the taining) rey occurs prior to the conclusion 
Drog Ost a. the other hand, suppose that 
al pitty giacruitment and induction is ap- 
to the Stage eoo. An individual who is rejected 
ie Ypes ut who is actually quite resistant 
TE (a f if f stress one experiences in flight 
Bing g SE eis positive), represents the second 
8$ ion error, the price of this mistake 
è ratio of the cost of these two 
etter i is 1:10. In other words, it 1S 
© reject a fit individual than to 


err 


s the discriminatory p! 

The formula is t 
how a decision may 
troduced. 
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given screening instrument it is essential that 
relative to the cost of a false negative error, 
er this information is available. An 


proportion of correct classifica- 


decision regarding the acceptance or 
simple formula is presented, taking into 


ower of the instrument, 
hen applied to certain 
be reversed when in- 


accept or pass a man who is, in reality, unfit for 
such training. 
eehl and Rosen, the following 


Consistent with M 
notation will be used throughout this presenta- 


tion: 
ple called “sick” 


py is the proportion of sick peo 
posi- 


by the test (i.e., the proportion of true 
tives). 

po is the proportion of healthy people incor- 
rectly labeled “sick” by the test (i.e. the 
proportion of false positives). 

P is the actual proportion of sick people in a 
given population (i.e. the base rate). 


For N individuals, NP of them will actually be 
sick, and NP(— pı) will incorrectly be labeled 
“well” by the test. Thus there will be a total of 
NP(1— Pi) false negatives. For the same N 
individuals, N (1 — P) will actually be healthy 
and N(1—P)b2 will incorrectly be labeled 
“sick.” There will thus be a total of N(1 —P)p2 
false positives. 

Let “A” be the cost of each false negative, and 
«B” equal the cost of each false positive error. 
The total cost, for both types of errors for any 
sample of size N is then given by: 


= ANP(1 — fi) + BN(1 — P) px 


Total Cost 

If, instead of using a selection test, all indi- 
viduals are passed or called “healthy,” there will 
be a total of NP selection errors (all false nega- 
the cost for these errors will be 


tives), and 
given by: 
Base Cost = APN. 
We can now define “Cost Efficiency” as the 


proportion decrease in cost when a selection test 
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is introduced. Thus: 


Base Cost — Total Cost 
Base Cost 


Cost Efficiency = 


B(l — P)pe 
Cost Efficiency = pı — —s 
AP 
If the ratio of the cost of a false positive to the 
cost of a false negative is set equal to R, then: 


7 R(1 — P)p: 
Cost Efficiency = p, — —— 

A few examples should aid in the interpretation 
of the Cost Efficiency. In the hypothetical flight 
training example “R” was equal to Yo or «1. 
Suppose that on the average 25% of the men 
“wash out” during the critical last week of 
training. Suppose that a screening procedure is 
now introduced which enables one to identify 
correctly 60% of the potential washouts. But at 
the same time this procedure incorrectly labels 
20% of the potentially qualified men as unfit for 
training. From the above formula, the Cost Ef- 
ficiency for this selection test would be equal to: 


_ AG = .25).2 _ 


6 = SA, 


25 


This means that for every dollar that would have 
been spent paying for errors resulting from the 
procedure of accepting each applicant, .54 dollars 
will be saved as a result of using the screening 
test. To put it another way, for each dollar spent 
as a result of errors associated with passing every- 
one, only 1 — 54 = 46 dollars will be spent if 
the screening test is used instead. 
There may be situations where the cost of 
training is low in comparison with the cost of 
recruitment. This might be the case when a com- 
pany is confronted with the task of hiring semi- 
skilled construction workers for an extremely un- 
desirable job overseas. Here individuals might be 
selected on the basis of whether or not they would 
remain on the job for a certain period of time, 
since every applicant is assumed to possess 
enough skill to handle the job. Suppose that on 
the average 50% of the men quit the job before 
they have completed 2 years work. Now suppose 
that a selection procedure is introduced which 
is capable of correctly identifying 70% of the 
“quitters” but mislabels 25% of the “nonquit- 
ters.” Finally, suppose that in the long run, 
rejecting a nonquitter will be four times as 
expensive as accepting a quitter. 


The Cost Efficency for this selection test would 
be equal to: 


a 40 — .5).25 
T — — M 


5 


= — .30. 


F ` inter- 
A negative Cost Efficiency has the same int 


pretation as a positive index. In this m 
for each dollar spent due to errors resulting a 
hiring every man, 1 — (—.30) or 1.30 gollan i 
be spent if the screening procedure is oe 
instead. In this example it would be much a 
economical simply to accept every applican a 

If “R” is set equal to unity, then the are 
parison which the Cost Efficiency measure Pan 
would be reduced, in effect, to the sort of Fs 
parison Meehl and Rosen considered in w 
paper. In our flight training example an R 0.00: 
would have resulted in a Cost Efficiency of tial 
Since a zero Cost Efficiency implies no dime d 
advantage, the test would have been el a 
because administering any instrument costs A 
certain amount of money. In the second examp t 
an “R” of unity would have resulted in & ly 
Efficiency of .45, so that the test would proba! d 
have been accepted as part of the stan Jes, 
induction procedure. Note that in both ezamp 
ignoring relative error costs leads to a wat 
which is opposite to that which would have 
made had cost been considered. 

As has been indicated, whenever the 
Efficiency for a given instrument is po e 
instrument has the advantage over base rate ph 
diction. The maximum value of the Cost or" 
ficiency is unity, possible only when the preted 
tion of true positives is equal to 1 and vo 
either the base rate is equal to 1 or there es es 
false positives. The minimum value for the x at’ 
is negative infinity, with the index tending texnce 
this value as the base rate approaches zero: © cy 
the slope of the curve relating Cost Effie! 
and base rate is always positive (see Fig. 4 


cost 


COST EFFICIENCY 


4 5 6 
BASE RATE 


Fic. 1. Cost Efficency as a function of nte! 
rate, P. (The slope is given by Rpo/P?; the eve 
cept equals Rp»/p: Rpe. This particular 
for the flight training example.) 
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increase j y 
e A in Pie rate will necessarily mean an 
in Cost Effici : 3 
equal, t Efficiency, all other things being 
DISCUSSION 


cta ueh this development we have made 
is that nh icit assumptions. One such assumption 
is negligible. ty of administering a screening test 
ably be om n most instances this would prob- 
individuals oe Another assumption is that all 
n many ¢ who pass the test will be accepted. 
that only ases a selection ratio is involved such 
available x certain fixed number of openings is 
Vary with ae the Cost Efficiency would tend to 
Would have as number of applicants, this index 
Consideration. be adjusted so as to take this into 

ion. This might be accomplished by 


XPressin 
tutti g both a G 
Utting etéa, bı and pə as functions of the 


Whe 
Cost a these assumptions can be met, the 
Course p ERCY can be calculated, providing of 


e t] 
hat knowledge of the base rate and error 


Jour, 
Iggy" of 3 
3, Vol, Senvutting Psychology 


ON THE DETECTION OF WILLFUL FALSIFICATIO. 


costs is available. Unfortunately, as Meehl and 
Rosen have pointed out, this sort of information 
is frequently nonexistent. In many situations, the 
cost of errors could never be measured in dollars 
and cents (e.g, an unfit pilot dying in a plane 
crash, or a mental patient being refused therapy, 
though in fact he is quite ill and experiencing 
considerable anguish). However, some sort of 
relative valuation of the types of errors will 
almost always be possible. If this judgment can 
somehow be translated into a number (R). no 
matter how cold-blooded this procedure may 
seem, it would most certainly be better than 
assuming that all types of mistakes are equally 
bad. 
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the quite evident that the patterning of 
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i Profile can be significantly altered by 
Studja?" ace of the subject since it is a 
falar’ ih a Ellis (1953) cites 22 of 25 
Dg Able, ee demonstrate the device to be 
Mycbiatrie ayson and Olinger (1957) found that 
t I as Patients asked to respond to the 
Pto tig, ould a “normal” individual were able 
ed tg pase improved profiles when com- 
a E pra MMPI results obtained from 
ig et upon admission to the hospital. 
Bostic ie! usefulness of the device as 4 
ning or trument, particularly when used for 
4 impor in situations where social desirability 
‘ee! ce element, seems to be dependent 
atte bilit, n the honesty of the subject or on 
Mts”, Of the interpreter to detect willful 
ishonest reporting. 
ome, tion of the MMPI includes four 


er] Š 
a 2 Director, Bureau of Testing and Re- 
auw University. 


“validity scales” in order to provide the inter- 
preter with some basis for the detection of 
fraudulent profiles. There has been some support 
for the usefulness of these validity scales. Cofer, 
Chance, and Judson (1949) found that college 
students attempting to malinger will have sig- 
nificantly high F scores while those attempting 
to fake normal can be detected by an additive 
combination of the Z and K scores. Calvin and 
McConnell (1953) refer to eight studies in which 
faking on the MMPI can be detected at a sta- 
tistically significant level using the validity scales. 
Recently, however, other studies have suggested 


high F score is y 
havioral di 


suggesting thi 1 
sig S d in the light of the psychological 


interprete 
er of the individual. The use of a linear 
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combination of scores to detect malingering on 
the MMPI was first suggested by Gough (1947, 
1950) who reports that the formula F minus K 
satisfactorily identifies a substantial portion of 
malingered profiles when a cutoff score of +9 
is used. Hunt (1948) using Gough’s “F — K Dis- 
simulation Index” reports similar success using a 
cutoff score of +10. Both Gough and Hunt at- 
tempted to find a cutoff score for the F—K 
index which would satisfactorily detect “faking 
good” profiles but report little success as con- 
siderable overlap exists between F — K score dis- 
tributions derived from faking good profiles 
and “honestly reported” profiles, Gough (1954) 
has also found that there are 74 items in the 
MMPI to which persons attempting to malinger 
respond significantly different than do diagnosed 
psychoneurotics. Gough refers to this grouping 
of items as a “Dissimulation Scale” (Ds). 

The purpose of the present study is to investi- 
gate methods, including the F—K index, Ds 
scale, L+ K index, and a variety of unexplored 
linear combinations of scores, that might be 
useful in distinguishing between malingered, 
faking good, and honestly reported Profiles ob- 
tained in a screening situation, 


PROCEDURE 


Fifty college students, including 25 males and 25 
females ranging in age from 20 to 22 years, were used 


TABLE 1 


Comparisons OF MMPI Mean STANDARD Scores, Ds Scores, AND SPECIAL LINEAR 
COMBINATIONS or Raw Scores FOR THE HONESTLY REPORTED AND 
WILLFULLY ALTERED Recorps or Each Group 


as subjects. They were divided into two groups a 
the basis of similarity of academic sader 
average. One group contained 13 males an ales 
females, while the second was comprised of 12 ei | 
and 13 females. In the first part of the ive 
the subjects in one group, Group G, were aske 
respond to the MMPI in such a manner as to arra 
normal or socially desirable as would an atre 
job or school applicant. The subjects in the seca 
group, Group M, were asked to respond 4 ant 
MMPI in a manner as to appear sufficiently de 
to be exempt from some social responsibility ‘ia 
as military service but not so deviant that E jon 
tionalization would be required. In the second po dt 
of the investigation all 50 subjects were aske 
respond to the MMPI again but this time in 4 di 
pletely honest manner as if they were interene 
gaining information concerning themselves. WI a 
names were requested in the first administrate 
the MMPI when the subjects were attemptini i 
falsify profiles, complete anonymity was assured, 
the second administration and only data Gaa. 
sex and group classification were recorded Í 
effort to insure the honesty of reporting. 


RESULTS iG. 
i A su | 
All profiles were scored for all nine diag? 


scales and the four validity scales plus the F H- l 
index. All records from Group M were also soi . 
on the Ds scale and the Z- K index was ie 
puted for all records from Group G. cle 


Group G (N = 25) 


Group M (N = 25) 


Honest Faked Honest Faked 

$ = I 
Scale £ SD x SD t ï SD Xx SD f 

= 1.9% 

A 43.32 5.77 50.64 8.16  3.66* 43.72 415 4716 10.14 0, 
F 51.68 7.26 47.72 5.08  2.24** 55.28 6.82 100.80 17.65 17, 
K 58.20 7.42 62.92 633  242** 57.44 7.67 43.64 504 56, 
Hs 48.31 674 4844 714 007 47.50 7.62 70.44 1886 70, 
D 48.77 9.90 44.72 512 1.84 53.15 6.75 81.96 19.40 42, 
Hy 55.38 7.42 56.20 412 049 53.19 6.27 67.04 15.26 9%, 
På 54.96 7.42 51.04 484  223** 57.50 7.60 85.32 13.12 60, | 
Pa 53.58 8.05 53.32 6.61 0.13 51.38 9.23 79.28 18.79 10 
Pi 53.65 8.60 50.80 4.74 148 54.96 742 85.32 13.12 1084 
Sc 55.65 7.49 54.20 4.47 084 57.04 8.64 100.00 19.47 44; 
Ma 55.77 8.37 59.36 13.55 113 59.77 12.19 75.44 14.52 br 
Ds — 10.62 5.57 40.27 11.76 jz 
P-K “1808, 3.68 MA aw B40 H ez s a 
L+K 18.80 4.89 23.40 4.96 3.31* = a 
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tio 
scales and onestly-reported records for all 
the more special score combinations. Some of 
Pes emp statistical data is shown in 
Noted i. examination of Table 1 it will be 
01 exists Ms statistically significant difference at 
except the i, all of the regular MMPI scales, 
Teported wee when the Group M honestly- 
Indicating th malingered records are compared 
Was succes a the attempt at willful alteration 
considerably ul. It should also be noted that 
etween RA fewer significant differences exist 
tecords ae honestly-reported and faking-good 
01 exist roup G. In this instance a difference 
at 05 She for the L scale and differences 
Ng considerati the F, K, and Pd scales suggest- 
taken fro erable similarity between the records 
of willful, Group G. ¢ tests comparing each set 
teported Altered records with all 50 honestly- 
erences eg cords yield statistically significant dif- 
sentially the same as those found when 


fle Yifu 


jottrast Of the 25 scores falling below 20. In 
0 64 » th 


hone cme of less than 12, the range 
m Stlyrepon eg 30, While none of the 50 
x tee ee records yielded an F score of 
lege “tion of re Me range being 0 to 11. The 
die Uecess ful king-good records was considerably 
01 geNce of y Although a statistically significant 
aig the z 05 for the F and K scales and 0 
kogt betiveg 21e and F — K and Z + K indexes 
SN records. the honestly-reported and faking- 
the * in the of Group G, considerable overlap 
appli fulnes ranges of scores which eliminates 
Gro tion, o these items in terms of practical 
Whi), PG viel he honestly-reported records of 
~a q the fees an F—K range of +5 to —22 
x a ~23 48-800d records have a range 0 
“ale shior The Group G distributions for the 
itor! range of 9 to 23 for honestly- 

Tape and faces, aNd 9 to 24 for faking-good 
Gye8ctivay, 10r the F scale, O to 14 and 0 to 12, 
& are Tie L scale distributions for 
to 6 for honestly-reported records 
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and O to 9 for faking-good records with con- 
siderable overlap. Likewise, the distributions of 
the L+ K scores show too much overlap to be 
practically useful in detecting faking-good records 
with the honestly-reported distribution ranging 
from 12 to 29 and a faking-good range of 11 
to 30. Not shown in Table 1 are a variety of 
linear combinations which were evaluated with 
regard to the detection of faking-good records. 
These include 2L + K, F — 2L, K + D, K + Pa, 
and 2L-+Pd. None of these combinations 
produced satisfactory frequency distributions 
whereby more than 65% of the faking-good 
records could be accurately discriminated from 
the honestly reported records although all re- 
vealed statistical significance at least at .05. 


DISCUSSION 


The results of this investigation lend support 
to the usefulness of Gough’s Ds scale and 
F— K Ž dissimulation index as methods of de- 
tecting malingering. This data also supports 
Cofer’s (Cofer et al., 1949) findings that the F 
scale taken alone can be useful for the detection 
of malingered records. In contrast, this data does 
not support Cofer’s findings that a L + K index 
will satisfactorily detect faking-good records nor 
the more general proposition that the K scale 
taken alone is practical for this purpose. On the 
contrary, the results suggest that deliberate at- 
tempts at faking “normal” profiles can be suc- 
cessful and are reasonably undetectable, at least 
through the use of any single scale or linear com- 
bination of raw scores. The question of whether 
the honestly-reported profiles truly represent sin- 
cere reporting must, of course, be considered. All 
evidence seems to indicate that they do. For in- 
stance, the honestly-reported records of Group G 
are highly similar to those given by Group M 
even though the previous records of these groups 

ifferent. In addition, significant 


were completely d 
changes did take place in the scores of the L, F, 


and K scales of the honestly-reported records 
when compared with the faking-good profiles. Fi- 
nally, great pains were taken to insure anonymity 
hen the honestly-reported rec- 


to the subjects W 
ords were requested, and the intent of the study 
was discussed with the subjects prior to the first 


MMPI administration so that there appears to be 
no realistic reason to shed doubt on the validity of 
their responses. Assuming that the data collected 
are valid, it can only be concluded that the 
usefulness of the MMPI as a screening device 
seems seriously limited by the fact that willful 
manipulation of the record can be easily ac- 
complished. When such manipulation is in the 
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direction of creating a “socially desirable” profile 
there appears to be no way of detecting this 
manipulation with reasonable confidence. Of 
course. this study has not considered the element 
of total configurational signs as evidence of 
willful alteration. However, the number of actual 
linear combinations attempted was such as to 
include many of the diagnostic scales although 
to no practical avail. Thus it would seem reason- 
able to suggest that conclusions concerning willful 
alternation in the direction of creating socially 
desirable profiles based on configural signs be 
made with definite caution. It also seems worth- 
while to note that the K scores of the honestly- 
reported records used in this study tend to be 
relatively high, which in turn obviously suppress 
the interpretive value of the K scale as a general 
measure of defensiveness. Since the subjects in 
this investigation are college students oriented 
toward intellectual sophistication it might be 
hypothesized that the high mean K scores repre- 
sent a set toward social desirability which might 
not evidence itself so intensely in a population 
manifesting a greater range of intelligence. 
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accept Brief Reports of research studies in 
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out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specification 
given below, 


2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 


Documentation Institute to insure indefinite avail- 
ability, 


3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 


charge to all who Tequest it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation, 
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Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits, 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


author’s lines, must not exceed 85 lines a 
eraging 42 characters and spaces in leg a 
Set the typewriter margins for short line i 
42 characters, which are 3.5 inches Jong 
elite typing, and 4.2 inches long in eee | 

The manuscript of the Brief Report af 
be double spaced throughout. Except E b 
short lines, it follows the standard sty sion 
the 1957 revision of the APA Publica” 
Manual. Headings, tables, and references ip 
avoided or, if essential, must be counte ac 
the 85 lines. Each Brief Report must be 
companied by a footnote in the style be ol 
which is typed on a separate sheet an 
counted in the 85-line quota: 
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of Congress, 


fi 

Extended report. Because the extend? , js 
port is intended for photoduplication, 3 sty!” 
not copy to be sent to a printer, its a of 
should differ in several ways from t 4 oft 
other manuscripts: (a) The extended mo! 
should be typed with single spaciDS gy 
economy in duplication, (b) Tables 2” tes! 
ures should be placed adjacent to the b 
which refers to them, A caption sho" gul 
typed below each figure, (c) Footnotes bi 
be typed at the bottom of the page 0” nect 
reference is made to them. In other E eo 
the full report is prepared in the style 
fied by the Publication Manual. 


Ip 
| Journal of C. 3 
1963, Vol. 27 Noi E pp eey 


| 


NEED FOR APPROVAL AND THE PREMATURE 
TERMINATION OF PSYCHOTHERAPY ` 


BONNIE R. STRICKLAND 


Emory University 


This study tested the hypotheses that approval-depende 


tend prematurely to terminate psychotherapy, 
d than patients 


as more defensive and less improve! 


85 psychiatric outpatients completed the Mar 
for approval) and rate 


sirability scale (the measure of need 


attitude towards patient, patient’s a 


Approval-motivated females ra 
Position and social class failed to 
these findings were discussed. 


Series of recommendations on the 
of psychoanalysis, Freud (1949) 
that the premature termination of 
Wsuitabiine a be attributed to the initial 
l, i inad ty of the patient for psychoanalysis 
Psychosis equate intelligence, low social class, 
Nifestat; or to an early and dramatic 

of ne of resistance. He advocated a 
analyst tial analysis in order to permit 
therap to assess the patient’s motivation 
en ad i whether or not he met the 

one criteria for treatment. : 

i as rary research on early termina- 
Mpirica] In the main, taken a straight 
onsi parte roach to the problem. A not 
as e body of studies in the last 
i established, with certain incon- 
and contradictions, two major 
of abrupt termination. The first 
Hia variables roughly corresponds 
u on of unsuitability: educa- 
class an, Miller, & Smelser, 1958), 

(Auld & Myers, 1954; Hollings- 
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st 
Mie 
q : 
of this study were presented at the 
Association meetings, 


ment in psychotherapy. Therapists rat 
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and satisfaction with therapy. Additiona 
class, and ordinal position. Results confirmed t 
for approval group terminated significantly 


motivated patients were generally given more negative rati i 
ted themselves as more improved. Ordinal 


predict stay in therapy. 
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DOUGLAS P. CROWNE °? 
Okio State University 


nt individuals (a) 
and (b) are rated by therapists 
less approval-dependent. 
lowe and Crowne Social De- 
d their improve- 
30 of these patients on defensiveness, 


titude towards therapist, improvement, 


1] measures included diagnosis, social 
he hypotheses. The high need 
(p < .005) earlier. Approval- 

ings by therapists. 


The implications of 


8; Rosenthal & Frank, 
1958), and intelligence (Affleck & Mednick, 
1959; Auld & Eron, 1953). The second set 
of predictors is probably best conceptualized 
under the heading of defensiveness. Indices 
such as the Rorschach, MMPI, semiprojec- 
tive sentence completions, and therapists’ 
ratings have shown differences between 
patients who prematurely terminate and 


those who continue in psychotherapy. Un- 
(Affleck & 


productive Rorschach records 
Mednick, 1959; Gallagher, 1953; Gibby, 
Stotsky, Hiler, & Miller, 1954; Taulbee, 
1958), more limited acknowledgment of 
symptoms and of personal dissatisfaction on 
the MMPI (Taulbee, 1958), sentence com- 
pletions indicative of evasion and unwilling- 
ness to reveal oneself (Hiler, 1959), and a 
more limited prognosis as rated by therapists 
(Garfield & Affleck, 1961) appear to be 
characteristic of the abrupt terminator. These 
findings are further consonant with the es- 
sential meaning of Freud’s early clinical 
observations. 
The present study was undertaken to test 
a theoretically derived prediction concerning 
the phenomenon of defensive early termina- 
tion of psychotherapy. The major hypothesis 
was that patients characterized by a high 
need for approval are more likely to terminate 
ly. As a consequence of 


sychotherapy ear 
theit need to maintain and defend a vulner- 


head & Redlich, 195 
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able self-image, these individuals are more 
likely to avoid the threats associated with 
personal disclosure (and, possibly, antici- 
pated social censure for seeking this form of 
help with personal problems) by breaking 
off therapeutic contact. Previous research on 
the need for approval has found approval- 
motivated individuals to be more compliant, 
persuasible, eager to please, and conforming 
in a variety of situations: opinion con- 
formity (Marlowe & Crowne, 1961), social 
conformity (Crowne & Liverant, 1963; 
Strickland & Crowne, 1962), verbal con- 
ditioning (Crowne & Strickland, 1961; 
Marlowe, 1962), perceptual defense (Barthel 
& Crowne, 1962), and attitude change 
(Salman & Crowne, 1962). The present 
hypothesis is based on an extension of the 
meanings of the construct of need for ap- 
proval from an earlier conceptualization 
emphasizing compliance and influencibility 
(cf. Crowne & Strickland, 1961). Recent 
work by Conn (1962) and Crowne and 
Liverant (1963) suggests that individuals 
with a high need for approval are more de- 
fensive and are concerned with avoiding 
threats to self-esteem. This proposition re- 
ceives further support from the initial finding 
of greater test taking defensiveness by per- 
sons with high scores on the Marlowe-Crowne 
Social Desirability (M-C SD) scale, the index 
of the need for approval. 


METHOD 


The data for the present study were collected 
as part of a large-scale research project at Columbus 
Psychiatric Clinic assessing both patient and 
therapist variables. The therapists were not ap- 
prised of the nature of the research, and none of 
them had any knowledge of the patients’ standing 
on the independent variables, 


Subjects 


Eighty-five patients, in treatment during 1960 
and 1961, served as subjects in this study. Through- 
out the interval spanned by the research, almost 
every patient seeking help was admitted to psy- 
chotherapy which, with only rare exceptions, con- 
sisted of weekly therapeutic hours. A majority of 
the patients were given neurotic or character dis- 
order diagnoses, although a considerable number of 
them were classified as psychotic or prepsychotic. 
These diagnoses were made in staff meetings by 
resident psychiatrists on the basis of intake inter- 
views and, occasionally, psychological test data, 
The age range of the patient sample was from 
20 to 54, with a mean of 33.7 years. The thera- 


pists included psychiatrists and psychiatric residents 
sychologists, and social workers. 

othe final sample was comprised of two gro 
of patients. In Group I were 23 patients, 13 ao 
and 10 females, who terminated psychothet 
during the months of October and November a 
Group II consisted of 62 patients, 26 mals oe 
36 females. This was a replication sample term! 
ing sometime during May, Junc, or July 1961. 


Measures 


jor 
The index of the need for approval, the or 
independent variable, was the M-C SD eb 
(Crowne & Marlowe, 1960). The M-C SD sc 
a 33-item, true-false questionnaire assessing an 
degree to which individuals avoid maen $ 
depict themselves in improbably favorable _ 0 
An illustrative item is, “I’m always willing 
admit it when I make a mistake.” ein 
The patients rated their own improvement, 
therapy on a nine-point scale ranging from 
proved (1) to unimproved (9). Within the uit 
pass of the larger Project, therapists were red gel 
to rate each of their patients on a slightly e ale: 
version of the Seeman (1954) Case Rating the 
Of interest in this study were five items On ah 
Case Rating scale: Number 5, the therapists “ng 


cow 


A urih? 
mate of the patient’s attitude towards him Sas 
therapy; Number 6, the therapist’s attitude ont! 


3 p r50 
his patient; Number 7, the degree of per ont 


integration versus defensiveness of the Lage: 
Number 9, the therapist’s estimate of the pa an 
satisfaction with the outcome of therapy) euti 
Number 10, the therapist’s rating of en se 
outcome. In contrast to the original, whic’ gist! 
nine-point scales, the range of values for the ™ 
version was from 1 to 16, . the pe 

The closing summaries of treatment in yan” 
tients’ folders were rated on a five-point scale oh 
ing from optimal improvement and mutua af poe 
ment on termination (1) to termination Y (3)' 
the therapist’s agreement and no improvem¢? of 
Additional variables included education oF sbi) 
cupational level, which were combined in = gs) ef 
similar to the procedure followed in the Holli”: io" 
Index of Social Position (Hollingshead & yas iN 
1958), and ordinal position, Birth order W496?) 
cluded in an attempt to replicate Schachter $ Py 
findings that first- as opposed to later-b?™ sb 
tients continue longer in psychotherapy. Th 
of hours of therapy at termination constit¥ 
major dependent variable, 


Procedure w 


sen te 
The M-C SD scale was mailed to the ae a 
Group I within a 3-month interval from at 
of termination. In Group II, 33 of the a reg 
were given the SD scale at the Clinic. T the owh 
ing 29 patients in this group completed, Gy 
ventory at the conclusion of therapy 25 eats 
I. As shown in Table 1, the M-C scale ™ rou oh 
Standard deviations of the patients in a 
and II who completed the measure after ii 
therapy are closely comparable to the 
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Stan i igs r 
at te Cn of the Group II patients tested 
the ad of return on questionnaires mailed to 
10%; in fa a Groups I and II was approximately 
naires ae ah most of the uncompleted question- 
f Re neo due to incorrect addresses. 
the termin: i ney U test revealed no difference in 
Patients Wis number of therapy hours of the 
did not U > returned the SD scale and those who 
Unlikely th tt 1366.5, z =40, ns). Thus, it appears 
dependent a bias affecting either the independent or 
the rl nS was introduced into the study 
Neither d erapy administration of the SD scale. 
biased pores, the sample appear to have been 
Observeq 1 selective returns nor were differences 
m etapy SD scale scores between patients tested 
termin, ti at the Clinic and patients tested after 


Ratings 

in 

Mailed ~ a our improvement in therapy were 
tion, Sixty. e patients within 3 months of termina- 
Own imp “three of the 85 patients completed their 

e ovement scale, 

at vary tists completed the Case Rating 
i ent kon oe depending on the date each 
a ay T treatment, between November 1960 
Tam made Le ‘The therapists’ ratings were unl- 
s the time While the patients were in treatment. 
Wd have A ratings were made, the therapists 
tan’ ulim ad no reliable knowledge of the pa- 
tin, ate dates of termination. Thus, these 


scale 


Bs 
lepap. Wel paj 
met ete not contaminated by the possibility of 
tints ĉttitudes towards early terminating 
RESULTS 


elow, were found, and the analyses 
Tdingly carried out without regard 
h ` ae age. 
WPothesig alish the test of the central 
fy: ied scores on the M-C scale 
87 fo omized at the overall mean 
Val ` Yield the high and low need for 
groups. The distribution of the 
ours of therapy of these groups 


acco; 
ex 


TABLE 1 


S. 
a SCORES or PATIENTS TESTED AT THE 
AT THE ConcLUSION OF THERAPY 


Group N M SD 


I 

T (Posey x 

n At cling! 23 13.35 
SP Rost) 34 11.53 


Post . 
(Qa mination) 28 


g, 
VG me eee ss 
Bug ll ve = — - = 
May STS Ua) = 1.02; £ (Ia versus Ii) = 0 


nation) 


7; 


TABLE 2 


DISTRIBUTION OF THE NUMBER OF Hours OF Psycuo- 
THERAPY FOR HIGH AND Low NEED FOR 
APPROVAL PATIENTS 


High Low 
need need 
for ap- for ap- 
Hours of proval proval 
therapy (N = 43) (N = 42) 
71+ (to 284) 5 11 
61-70 1 1 
51-60 1 5 
41-50 0 5 
31-40 7 2 
21-30 10 9 
11-20 11 5 
1-10 8 + 


is shown in Table 2. By a Mann-Whitney U 
test, high and low need for approval patients 
differ significantly in the number of hours 
of therapy (z= 2.84, P< .005?). Looking 
at the distribution of therapy hours in Table 


2. it is clear that above the median hours of 


therapy. highs are sharply underrepresented, 
below the median 


while the reverse is true 

hours of therapy. Patients low in the need 
for approval tend to be more equally split 
above and below the median. At the extremes 
of the distribution, however, low need for 
approval patients, in contrast to the highs, 
are overrepresented in the highest number 
of therapy hours and underrepresented among 
the very early terminators. It should be 
noted that differences of approximately the 
agnitude were found in the two 
samples (Groups I and II); thus, at least 
within the same clinic, some evidence for 
the replicability of this finding is afforded. 
For the 30 patients in the second group 
whom the therapists completed the Seeman 
scale, correlations were computed between 
these ratings and the M-C scale, the number 
of hours of therapy, and the index of social 
class. These correlations, and the intercor- 
relations of the five Case Rating scale items, 
are presented in Table 3. Approval-dependent 
patients tended to receive more negative 
ratings on each of the five scales. The most 
striking of these correlations is that between 
the need for approval scale and therapists 


ratings of personal integration versus defen- 
gs 


same mMm 


on 


3 Two-tailed test. 
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TABLE 3 
INTERCORRELATIONS OF Hours OF THERAPY, NEED FoR APPROVAL, AND THERAPISTS’ RATINGS 


Variable 


Variable 1 2 3 4 5 6 7 

1. Need approval 68a 
2. Hours of therapy —.28, 
3. Patient’s liking/respect as 

for therapist —.18 23 
4, Therapist’s liking/respect n 

for pale i. —.35 -40* 66** 
5. Personal integration versus . 3 

defensiveness of patient —.67** 57% 27* 33** 

. Patient’s satisfaction with 

ý Tany —.32 2 .60** 46** 554 
7. Amount of improvement 

in therapy —.34 45*| 46** 50** 163" Agre 

N = 83 07 
8. Social class |-.07 .19] |.27 00 .28 ao" ‘ | 
—N = 30— l N = 30 
a Test-retest reliability, 5-6 month interval during which patients were in therapy (N = 24), 
N =85. 
*p < 03. 
"p< 01. x 
of th? 

siveness or disorganization (r = —.67). The pists knew them less well, As a test w 


remainder of the need for approval-therapists’ 
ratings correlations are consistent but fail 
to reach a conventional level of significance. 
Each of the Seeman scales correlated 
moderately and positively with the number 
of hours the patient remained in therapy. 
The index of social class was found to be 
unrelated to the M-C scale (r = —.07) and 
negligibly correlated with length of stay in 
therapy (7 =.19). Moderate positive cor- 
relations were obtained between social class 
and therapists’ ratings of the patients’ liking 
and respect for the therapist (r= 27), 
satisfaction with therapy (r= 36), and the 
defensiveness scale (7 = 28); Only the cor- 
relation with satisfaction with therapy at- 
tains significance ($ < .05). 
Since the patients had been in therapy 
for varying lengths of time when the thera- 
pists’ ratings were made (that is, it was not 
possible to obtain the therapists’ ratings after 
a standard interval in therapy for all pa- 
tients), it is conceivable that the approval- 
motivated group was rated lower simply as 
a function of their tendency to remain less 
long in therapy. Thus, the negative ratings 
may simply reflect the fact that their thera- 


possibility, partial correlations were ther” 
puted between the M-C scale and the the 
pists’ ratings with the number of hour ed 
patient had been in therapy at the an or 
rating partialed out. Table 4 shows ne nip? 
relations. It is clear that the relatio”? pe 
between 
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‘The negative sign of the last two cales- 


is due to the reversed scoring of the § 
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TABLE 4 


P 
‘ieee CORRELATIONS BETWEEN NEED FOR 
PPROVAL AND THERAPISTS’ RATINGS 


Partial cor- 
relations with 
7 ka : need for 
Therapists’ ratings approval 
i ue 
4 Patient's liki ce = 
i ationt’s liking ‘respect 
2 for therapist ata 
Theat iii 
herapist's liking ‘respect 
for patient aas 


Pers i 

roe integration versis 

k lefensiveness of patient 
Atlan’ iiaii 
tient's satisfaction with 
therapy 


A ʻi 
mount of improvement 
in therapy 


Note ~ 
aa gf 
< OL. 30, 


A R 
i At termination, 63 patients completed 
Of their own improvement on @ scale 
8 from 1 (improved) to 9 (unim- 
ship bet: For males, there was no relation- 
and t ven ratings of own improvement 
 COrrelati 1-C scale (r = .01). A significant 
%39) OP Was obtained for females (r= 
| fen ing E ith approval-dependent females 
Ratings ő rate themselves as more improved. 
er a own improvement correlated —.16 
l “ber o and —.26 for females with the 
of hours of therapy. 


o 
reed the stability of the index of 
terapy approval during the course of 


Siti and as a further check on the 
 Yatigny) Y Of administering the independent 
te ition termination to a sizeable 

Admini of the sample, the M-C scale was 

Stered to 24 patients who had been 
Se miis earlier and then remained 
ete In this interval, patients com- 
he « letest €n 20 and 25 hours of therapy. 
lg titial ppoorelation of .68 was obtained. 
` at 4 ean of the test-retest group was 
this Was re second testing, the obtained 
am ‘42 (t = 0.57, ns). Moreover, In 

Ple, the mean M-C scores are 
ae mean of the nae 
Ve .92 versus 12.25, t= 0.57; 
tagte i 12.25, ¢ = 1.42 ns). It remains 
i hate at the correlation between the 
: opts need for approval and number 
toup qy therapy among the 34 patients 
~.34. * Who took the scale at the Clinic 

artialing out the number of 


of 
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hours of therapy at the time the scale was 
given, the correlation was —.29. This, of 
course, is almost identical to the —.28 cor- 
relation reported in Table 3. 

In an attempt to replicate the finding 
reported by Schachter (1959) that first-born 
and only-child patients remain longer in 
therapy than later borns, these two groups 
of patients were compared in length of stay 
in therapy. For 49 patients on whom ordinal 
position was recorded, a Mann-Whitney U 
test revealed no difference in hours of 


therapy (z = 0.22, ns). 
Discussion 


High and low need for approval patients 
differ strikingly in the length of time they 
remain in psychotherapy. Approval-dependent 
patients terminate much earlier than those 
less approval-motivated. To accept the hy- 
pothesis that the earlier termination of high 
need for approval patients represents a 
means of avoiding anticipated threats to self- 
esteem, it is first necessary to exclude the 
possibility that they are simply less dis- 
turbed and less in need of help than those 
less dependent on approval. As established 
by therapists’ ratings, approval-dependent 
patients appear to be more defensive or 
disorganized. Moreover, they tend to be 
judged by their therapists as less personally 
liked, less satisfied with the progress of 
therapy, and less improved in treatment. 
Finally, no systematic differences were found 
between high and low need for approval 
groups in type or severity of diagnosis. It 
would appear that the approval-motivated 
group did not terminate earlier as a result 
of progress and improvement, and their early 
termination thus takes on the character of 
resistance and defensiveness. : 

These findings are consonant with recent 
studies on the approval motive (Conn, 1962; 
Crowne & Liverant, 1963) and support 
the conceptualization of the approval-de- 
ent person as one who is concerned with 
protecting and maintaining a vulnerable self- 
image. Thus, it seems likely that, faced with 
the prospect of self-revelation and the threats 
this poses to his defensive image of himself, 
the approval-dependent patient convinces 

ychotherapy 1S really not 


imself that PSY 
peor while or that, in a very few hours, he 


pend 
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has attained sufficient improvement. For the 
approval-dependent patient, the dilemma is 
one of giving up a defensive self-conception 
or defying the therapist. The outcome of such 
an avoidance-avoidance conflict is leaving 
the field. Additionally, it is possible that these 
patients fail to obtain highly valued af- 
fectional and dependency gratifications from 
their therapists which might alter the balance 
of the conflict. A corollary, but not incom- 
patible interpretation, is that the approval- 
motivated individual is apprehensive about 
possible social criticism for seeking therapy 
and the (to him) implicit admission that he is 
“crazy.” It is worth observing that the 
defensive test-taking behavior of high need 
for approval persons is consistent with this 
general interpretation of our major finding. 

In this study, both time pressures and 
another large scale research project being 
conducted at the same time necessitated ad- 
ministering the SD scale to a large number 
of patients after the conclusion of psy- 
chotherapy. This raises a crucial methodolog- 
ical issue: whether, perhaps, scores on the 
scale were affected by participation in psy- 
chotherapy, thus rendering the findings more 
parsimoniously interpretable in terms of the 
influence of therapy on the questionnaire, 
As detailed above, an attempt was made 
to check on the possible effect of therapy 
on the scale by conducting a long term, test- 
retest analysis of the index of need for 
approval while patients were in treatment, 
No evidence for systematic changes was 
found. Also, the relationship with number of 
hours of therapy was maintained in the group 
given the M-C scale at the Clinic. 

In fact, the test-retest data strongly sug- 
gest that the approval motive is very re- 
sistant to change. If any procedures could 
systematically alter dependence on the ap- 
proval of others, it would certainly be ex- 
pected that psychotherapy would be high 
on the list. Whether premature termination 
can be lessened among approval-dependent 
individuals is yet an open question, although 
it may be that, in the early hours of psy- 
chotherapy, before an enduring and trusting 
relationship with the therapist has been 
established, minimizing the: demani for 
frank personal revelation and providing re- 


assurance and support would tend to reduce 
this form of resistance. nly 

Ratings of own improvement showed 0 js 
a very slight relationship with hours E 
therapy: patients rating themselves 25 a y: 
improved tended to remain longer in theriak 
While no relationship was found betwee? 


\ + provë 
index of need for approval and own inated 
ment ratings for males, approval-mo more 


females tended to rate themselves ba n 
improved despite the earlier terminat te 
the high need for approval group. > sive 
result lends further support to the defe 
ness hypothesis. 

The finding that therapists ar 
make meaningful predictions and Lr he 
izations of their patients independent en 
length of time the patient has pe ends 


therapy above a certain minimum, jot | 


at 
to support Meehl’s (1960) demons?” spe 
that therapists arrive quite early at 
interpretations of their patients. 

n m 

The discrepancies that were found Þe atin 
the thereapists? ratings on the Cas? sult 
scale and the ratings of the closing oh 


therapists? 
Many therapists did not make closing “yW 


tmation conveyed. 
clear that terminal statements of tre? 
are subject to vagaries that do not APP rutt 
affect in nearly the same degree st" P 
ratings of patients, t w 

In addition to their implications 1% e 
vulnerable self-esteem and defensive”? oe 
tension of the approval-motive CO?” alt 
these results also have some suggestive pe, 
regarding the problem of a model f% A 
chotherapy. The verbal conditioni4, one 
adigm has received considerable attent H 
the recent literature as an analogue cp’, 
Process of psychotherapy (cf. Kr 
1958). According to the verbal condit’ j 
model, psychotherapy can þe concept 
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as a 
N in which the therapist subtly 
havior: i anges in the patient’s verbal 
verbal and changes then generalize to 
Teal life out ehavioral changes in the patient’s 
at what side of therapy. If we can assume 
comparable i measured by the M-C scale is 
then the g in patients and college students, 
libsrestier ne amenability to verbal re- 
vals iy of approval-motivated individ- 
(Crowne ry verbal conditioning situation 
ei Strickland, 1961; Marlowe, 1962) 
early ang tendency to terminate therapy 
Problem we less improvement pose a real 
Merits op T the model. Whatever the other 
analogue Cemerits of the verbal conditioning 
tect of} it certainly fails to account for the 
in 4 ndividual differences and the mean- 
e indivia Situation as it is perceived by 
Dtovalg. ual. It is one thing for the ap- 
Need ie endent person to be more influ- 
Meng y the subtle reinforcers of an experi- 
vident] 3 Produce more plural nouns; it is 
SUtrendes ne another thing to ask him to 
siri is defensive conception of himself. 
Best mary, the results of this study sug- 
lh eritici defensiveness and avoidance of 
aa ru cism constitute a major determinant 
Contras Pt termination of psychotherapy. In 
id or the Previous studies, both social class 
“ngt o inal position were unrelated to 

Psychotherapeutic stay. 
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INTELLIGENCE TEST PERFORMANCES OF BRAIN 


DAMAGED SUBJECTS 


WITH LATERALIZED 


MOTOR DEFICITS 


HOMER B. C. REED, Jr, anp R. M. REITAN? 


Indiana University Medical Center 


The Wechsler-Bellevue Intelligence Test was 
brain damaged Ss having lateralized motor de 
no lateralized motor deficits, and control Ss 


results indicated that brain damaged 


demonstrate different kinds of intellectual 
with left motor deficits and, additionally, 
a function of the lateralization of cerebral 


the motor handicap per se. 
discussed. 


Since 1950 a variety of investigative efforts 
by psychologists and neurologists have estab- 
lished that certain basically different intel- 
lectual abilities are impaired in individuals 
with brain damage confined principally to the 
left cerebral hemisphere as compared with 
individuals in whom brain damage is confined 
principally to the right cerebral hemisphere. 
In cases of hemiplegia or hemiparesis, where 
there is adequate independent evidence of 
cerebral damage, one might therefore expect 
to find differential impairment of intellectual 
functions attributable to organic involvement 
principally of either the left or-right cerebral 
hemisphere. Additionally, such cases might 
demonstrate deficiencies on measures of intel- 
ligence attributable primarily to the motor 
involvement. Such deficiencies would presum- 
ably be greatest on tasks involving complex 
psychomotor functions and should parallel in 
magnitude the degree of severity of the 
motor handicap. 

The present study has been concerned 
with comparing the intelligence test perform- 
ances of groups of brain damaged subjects 
with lateralized motor deficits, a group of 
brain damaged subjects without lateralized 
motor deficits and a group of non-brain- 
damaged control subjects. The intelligence 
test employed was the oe r 
(Form 1). This test consists of X v 

rch Grant B-2416 from the 
ee Neurological Diseases and 


i ublic Health Service. ; 
a on are indebted to Melinda Mayhall 


for assistance with the statistical analysis. 


10 


The significance of the findings was briefly 


administered to groups of 
ficits, brain damaged Ss with 
having no brain damage. The i 
ith right-sided motor deficits 
impairment than brain damaged Ss 
that the intellectual impairment is 
dysfunction rather than reflecting 


Ss w 


re 

east! 
o mM der 
if 
ef 


tests, 6 of which were designed t 
verbal abilities, with the remaining 
signed to measure nonverbal abilities: 
ferential impairment on this test of 1 gro" 
formances of the various experimenta th 


would be an objective demonstration otf 
brain damaged subjects with left-side rel 
deficits may be expected to show $ pra" 
kinds of intellectual deficits ma o 
damaged subjects with right-side te a 
deficits. Such a finding would consti igh i 


ditional evidence that the bt Ee m 
cerebral hemispheres subserve difi a desi 
tellectual functions. Furthermore, 2 eva 
of the present study permits objecit rmi 
tion of the influence of centrally de res 
lateralized motor deficits on meas!” ot 


: a ho! 
intelligence involving complex psy¢ 
functions. 


METHOD př 
in bye 
The following groups of subjects, used at He 
study, were selected from individuals test? na wi 
Pita cd Laboratory of the Indian d 
versity edical Cent i the 
ba nter during ip! 
ap 
Group I-18 brain damaged subjects W' pt 
hemiplegia ith sie 
Group II—29 brain damaged subjects wit ft 
hemiparesis yn 
Group T1—20 brain damaged subjects W” ft 
hemiplegia it! a 
Group IV—20 brain damaged subjects W” 
hemiparesis aun 
Group V—29 brain j i 
\ damaged subjects y 
aoe motor deficits > d ad” 
o = i ji o 
pate 20 control subjects without 


2 


l 

i 

e 

thee, Wer ; 

= avian Significant intergroup differences ON 
of hed as i The brain damaged groups were 
Do; tsion, Bi osely as possible with respect to type 
‘ible to a indicated in Table 2 it was not 
tom diagnosi tain an exact match on this variable. 
apte sie of brain damage was based on a 
ty tionay urological examination and on such 
n Procedures as were deemed appropriate 


TEST PERFORMANCES OF 


TABLE 1 


ACE AND EDUCATION VARIABLES 
For Each GROUP 


Group Mean Mean 

age Range education Range 

i g 00 21-61 10.94 6-16 
m a 80 26-63 950 7-16 
m 2225 29-55 995 2-19 
y 43:20 20-65 990 3-16 
v 460 17-64 995 3-16 
44.70 27-62 1015 3-8 


TO} 
s with right hemi- 
ith right hemipare- 
plegia; 
Group 
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i up I, Brai 7 

TOW , Brain damaged patient 

Tee ir prain damaged patients wi n 

Brain grain damaged patients with left hemi 

lamaged aged patients with left hemiparesis; Gro 

i Conco eens, with no Iateralized motor deficits; 
s Without cerebral damage- 


EAA] 
o 


'i 
Ri 
e OUS w 


Va s possible on 
te tables as p 


ere matched as closely 
of age and educational levels and 


Subject (i.e, EEG, angiography; 
4 neurosurgical findings, etc). ta 
amip © fo solegia or hemiparesis 
Nin, ae each subject, the notes of the ex- 
aig, ul ieee sists or neurosurgeons made during 
S course of hospitalization, plus ad- 
atory test findings obtained in the 
Hialstead’s Finger 
a and Halstead’s (1947) Tactual Per- 
the p It should be noted that although 

dged q rain damaged subjects in Group 
© have lateralized motor deficits, the 


t co 
o. Lop, see 


seine 
fon lati ology Laboratory on 


Diag, TABLE 2 
NO 
Stic DISTRIBUTION OF PATIENTS 


Di. Group 
Vy, “Rnosig eon aT 
te Or t un i NY 
te 
De bral 
kene, Va 2 a 
N Scu! A 7 4 5 5 
linge tative lar disease oF í 9 10 
Infa atini or demye- $ 
act s disease 
tiles OF inflam, 2 4 ait 1 
ati 
tma ory 
yee CFE. o 
T e TE 
ia Blas o Gro; 
Qi Qi Croup 4? L Brain damaged paticnts with, i 
Gyoup uD Ir gbrain ip Carved patients with Ee neniipare: 
egy ys Br. rain damaged patients with left hemiplegia; 
Felts Ma Brain danena ed patients, Wino left hemi aresis; 
ic] T may tients h no i y 
ludes OUD VI, Controls: at rout cerebral USteralizes/ motor 
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criteria did not preclude the presence of motor 
deficits approximately equilateral in nature. 

The control group included subjects hospitalized 
for conditions not relating to brain dysfunction and 
nonhospitalized subjects. 

After the Wechsler-Bellevue 
ministered to each subject, all scores for each 
subtest and for the summary measures were pooled 
and ranked within each distribution from poorest 
to best performance. The ranks were then con- 
verted to normalized T score distributions, giving 
each distribution a mean of 50 with a standard 
deviation of 10. Differences between the groups of 
subjects were evaluated by computing the ap- 


propriate ¢ ratios. 


Test had been ad- 


RESULTS 


Table 3 presents the performances of all 
groups of subjects on the Wechsler variables. 
The right hemiplegia group performed at a 
slightly higher level than the right hemipa- 
retic group 0n 10 of the 11 Wechsler subtests. 
Comparison of the summary measures on 
these two groups, however, reveals no sig- 
nificant differences (see Table 4). Subjects 
with left hemiplegia were superior to sub- 
jects with left hemiparesis on 5 of the 11 
subtests and again the comparison of the 
summary measures reveals no significant 
differences. It is readily apparent, then, that 
the severity dimension did not reliably 
order the psychological test results. In the 
absence of any significant differences between 
the hemiplegic and the corresponding hemi- 
paretic groups it was decided to combine the 
four groups with lateralized motor deficits 
into a right motor deficit group and a left 
motor deficit group. Figure 1 presents the 

erformances of the combined groups on 
Jer subtests. As can be seen, the 


the Wechs 

normal controls were superior to all of the 
brain damaged groups on all subtests. The 
th left-sided motor deficits was 


group wi P i 

superior tO the group with right motor 

deficits 0n all of the verbal measures with 
all of the perform- 


being true on 
The brain damaged group 
having no Jateralized motor deficit performed 
at a level between the other brain damaged 
groups 0n the verbal tests but was gen- 
erally somewhat superior to them on the 
performance measures- This last finding sug- 
gests that g Jateralized motor 


subjects havin 
deficits may be 


the reverse 
ance measures. 


at a slight disadvantage on 


104 


Homer B. C. REED, JR., 


AND R. M. REITAN 


TABLE 3 
T Score MEANS AND STANDARD DEVIATIONS OF WECHSLER VARIABLES FOR ALL GROUPS 
Group 
Variable I I Ill Iv v vI 
Verbal WS 7 
x 48.33 43.65 51.30 30.5 715 58.95 
5 $ i p 50.50 47.15 z 
SD 7.57 10.20 9.56 8.96 0.03 7.99 
Performance WS 
x 49.78 49.05 44.30 ài z g 59.80 
. A 46.70 50.45 
SD 6.47 8.32 9.08 10.65 10.17 7.64 
Information 
x 49.50 45.15 5 5 57.35 
= ` S15 52.65 49.20 46.15 s 
SD 9.71 11.27 8.95 8.09 $33 9:39 
Comprehension i 
x 47.61 44.60 si 65 e 56.25 
5 e 1.65 49.91. 49.25 
SD 9.36 10.79 11.14 7 33 9.54 an 
Digit span a 
SD ABOO 44655070 515s 4840 is 
. : 7.56 10.72 7.97 8.89 p: 
Arithmetic 
x 49.61 44.153 57,39 
; i A 49.65 5 7.85 Si 
SD 7.76 10.00 9.63 et R 0.78 
Similarities oe A 
x 48.61 475 ë 57.45 
: : 7 51.90 9.03 7.90 ge 
SD 8.54 9.67 8.83 er fio tele 
Vocabulary e N 
x 48.72 46.00 5 54.15 
> > ` 51.20 55 ae 
SD 10.35 10.99 9.91 Be ers 10.00 
Picture arrangement x ói 
x 51.78 5 Š 56.45 
SD zw e ASIO 4685 49.50 56:27 
Picture completion i 9.90 11.05 
x 51.94 47.35 80 
SD 8.12 Tas “eo E Ey 
t 5 i 
Block design 8.29 12.24 
x 49.89 48.90 59,00 
SD 5.73 sas 4999 atao s255 S95 
Object assembly s 9.22 9.94 
x 50.00 50.30 5 3 
SD 9.03 4 43.50 48.40 51.80 57:9; 
8.14 10.24 10.47 a 
Digit symbol i 7 
X 48.94 47.20 15 
SD 7.33 7.82 sao 48.70 49.75 5073 
i 10.40 9.54 i “ati 
Note.—Group I, Brain damaged patients with right hemiplegia; G ain te 
ain d d patients with left I legia; G + Group II, Brai S -o nemi, P' 
Samaged patients wth no Tateralized motor defekts: Group VE, Contan aattaied paucamaged patients with right Bip V 
out cerebral damage. 0 
blems, but th ` jo 
complex psychomotor pro aera iS The con oat fu erl os 
handicap does not seem to be a significant the Bra rel were significantly suP we, 
me amaged subjects on bot wi 
one. istical i „casures. The left motor deficit groUF ot? 
Table 4 presents statistical comparisons on Significant] ; sek isht 1 
measures of Verbal fici Y Superior to the right o, i 
the Wechsler summary a: eficit grou s scot: io 
: d Performance Weighted i P on Verbal Weighted yb 
Weighted Score and Her : Spite of the Consistency, h with yp! 
Score for the various groups of subjects. the right y, however, 


Motor deficit group exce 
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TABLE 4 
= t Ratios ON WECHSLER WEIGHTED SCORES 
VWS PWS VWS — PWS 
Group t p t $ t p 
B versus II 1.59 20 .30 80 
£ II versus IV .27 80 .77 50 
IE versus III + IV 241 02 1.97 10 4.64 .001 
V versus I + II 51 .70 45 -70 10 1.00 
V versus III + IV 1.50 .20 1.82 10 3.96 .001 
V versus I + II + II + IV .55 -60 1.33 20 
VI versus I + II 5.37 .001 5.03 001 1.13 30 
VI versus III + IV 3.35 005 5.69 001 2.93 01 
I versus V 4.39 001 3.29 .005 1.03 40 
N 
Group thi potoup I, Brain damaged patients with right hemiplegia; Group II, Brain damaged patients with right hemiparesis; 
maged patio? damaged patients with leit hemiplegia; Group IV, Brain damaged patients with left hemiparesis; Group V, Brain 
ients with no lateralized motor deficits; Group VI, Controls without cerebral damage. 
left 
= Jateralized motor deficits did not differ sig- 


Su “le deficit group on the performance 
Btoups + the difference between these two 
diq ee the Performance Weighted Score 

reach statistical significance. The 


Stoy 
s , ; 
PS of brain damaged subjects with 


=—:— Right hemiparesis - 
Right hemiplegia 


-e—e—e- Brain damaged with no 
jateralized motor deficits 


he group of brain damaged 
teralized motor deficits on 


nificantly from t 
subjects without la 
either summary measure. 

Evaluation of 


jntraindividual difference 


----= Left hemiparesis - 
Left hemiplegia 


— Normal Controls 


Fic. 1, Comparative Wechsler Bellevue i 
el H vue performances for brain 
without lateralized motor deficits and for a m e 


damaged groups with and 
without brain damage. 
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scores between the Verbal and Performance 
Weighted Scores reveals a clear and signifi- 
cant difference between the right and the 
left motor deficit groups. The group of sub- 
jects with left motor deficits also differed 
significantly from the brain damaged and 
normal control groups on this measure, but 
the right motor deficit group did not. 


CONCLUSIONS AND DISCUSSION 


The principal conclusions to be drawn 
from this study may be stated as follows: 
brain damaged subjects with right motor 
deficits may be expected to demonstrate a 
different kind of intellectual impairment than 
brain damaged subjects with left motor 
deficits. The differential impairment of these 
groups of subjects is attributable to lateraliza- 
tion of cerebral dysfunction and seems to be 
unrelated to lateralization of the motor deficit 
per se. These findings support the research of 
Briggs (1960), who found that subjects per- 
formed nearly as well with either hand as 
with both hands on the performance sub- 
tests of the Wechsler Adult Intelligence Scale. 
The conclusions of the present study are 
also in close agreement with the researches 
of Reitan (1955), Kløve (1959), and Doeh- 
ring, Reitan and Kløve (1961), who have 
demonstrated that impairment of verbal 
abilities as measured by the Wechsler-Belle- 
vue Scale is associated primarily with lesions 
of the left cerebral hemisphere, whereas 
abilities requiring the appreciation of spatial 
and temporal relationships are impaired 
primarily by lesions of the right cerebral 
hemisphere. Each of the above studies em- 
ployed different lateralizing criteria but each 
used subjects with current neurological com- 
plaints, as has the present investigation. 
Thus the conclusion relating different abilities 
to the two cerebral hemispheres is seen to 
have a high degree of generality for this 
population of patients. Failure to find dif- 
ferential intellectual impairment in groups 
with longstanding damage having lateralizing 
neurological signs has been reported by 
Fitzhugh, Fitzhugh, and Reitan (in press) 
and Kløve and Fitzhugh (in press), suggest- 
ing that chronicity of damage may be an 
important independent variable. Bruell and 
his associates (Bruell, Peszczynski, & Albee, 
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1956; Bruell, Peszczynski, & Volk, 1 
found no differences in ability to pe 
verticality among groups of brain damani 
subjects with differently lateralized ne 
deficits. This research, also, suggests the Pa 
sible importance of chronicity of ce 
dysfunction as a determinant of intellec 
impairment. d 

The presence of conflicting findings a 3 
cerning the existence of differential e 
lectual impairment in brain damaged subida 
having lateralizing neurological signs rion 
phasizes the complexity of the a 
problem in this area of research. The vail | 
with which neurological signs indicate a 
organic condition of the brain is depe” an 
upon a wide variety of covarying pe reds 
until this variance is systematically exp10"% g 
conflict rather than agreement among es 
results of various investigations may Þe 
pected to occur. 
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Wiggins (1959) found that Edwards’ SD scale was 
ing MMPI records of Ss instructed to answer in 
from records of other Ss not so instructe 


Wiggins’ study on both procedural and interpretative grounds and reported 
hat suggest SD can perform this discrimina- 


findings of a redesigned replication t 


of control scales in both studies, 
test-retest “improvement” effect, 


n . 
pend experiment, Wiggins (1959) 
€r thin evidence which suggested, among 
of 3 that the social desirability scale 
i dis in -dwards (1957) was not effective 
Mistr Sushin MMPI records of subjects 
to answer in terms of social desir- 
Struct a an records of subjects not so in- 
More recently, Walker (1962) has 
a € inter iggins’ study on both procedural 
Mgs Pretative grounds and reported find- 
e te redesigned replication that reflect 
Crimi: orably on Edwards’ SD scale as 4 
ig nation index. 
Constitu (1959) employed two randomly 
90 Pee groups: (a) a control group of 
Astr Mas students receiving the standard 
‘Ns printed on the front of the MMPI 
it collese. (b) an experimental group of 
Pa i Be students instructed to answer 
oo Co € direction that people in general 
age aha to be more desirable. With 
1959) © the experimental group, Wiggins 
*ssumed that: 


abi? relati 
in? respon uniform group of high social desir- 
the a to the ots may be created by special instruc- 
Serye Ount op Piects. Such a group would differ in 
US a this tendency and could, in principle, 
fton @l a ounitetion group for distinguishing an 
“eas 2 Small of social desirability response style 
inapi Socia] amount. Scales which purport to 
iter ly aq desirability response style should pre- 
f, an tin, leve a certain amount of success in 
20). 8 such a group from a “normal” group 


tion. It was argued that given (a) the actual development 


(b) the success of other desirability scales in both studies, 
and (d) Walker’s failure to control for the 


Walker’s conclusions cannot stand, and his 


not effective in distinguish- 
terms of social desirability 
d. Walker (1962) has criticized 


of the SD scale, 
(c) the similarity 


defense of the SD scale must be judged unsuccessful. 


Walker (1962) objected to Wiggins’ pro- 
cedure on three grounds: (4) Edwards’ SD 
scale was developed to measure social desir- 
ability response tendencies under standard 
test conditions and hence should not be re- 
quired to discriminate between standard and 
role playing situations; (b) social desirability 
was probably operative under Wiggins’ con- 
trol conditions and this, together with the 
“obvious” nature of SD items, would con- 
tribute to a high mean scale score that would 
be restricted in the amount it could shift 
under role playing instructions; and (c) 
Wiggins’ instructions or his subjects’ ability 
to grasp them may have been less than 
optimal from the standpoint of maximizing 
the amount of social desirability responding 
in the experimental group. 

With the above objections in mind, Walker 
conducted an experiment in which the same 
120 subjects were given standard instructions 
followed, 1 week later, by experimental 
instructions designed to “closely approximate 
those used in the original development of 
Edwards’ SD scale” (Walker, 1962, p. 163). 
The marked improvement in the discrimina- 
tive efficiency of the SD and K scales under 
Walker’s revised procedures led him to con- 
clude that his three criticisms of Wiggins’ 
study (as it related to Edwards’ SD scale) 
were appropriate. 

The present paper will endeavor to demon- 
strate that given (a) the way in which 
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TABLE 1 


COMPARISON OF THREE SocIaL DESIRABILITY ROLE PLAYING INSTRUCTIONS 


Wiggins (1959) 


Walker (1962) 


Edwards (1958)° 


This inventory consists 
of numbered statements. 
Read each statement and 
decide whether People in 
General would consider a 
true or a false answer to 
be more desirable. You 
are not asked whether the 
statement is true or false 
as applied to you. Rather 
you are asked to decide 
which answer you think 
People in General would 
consider to be more de- 
sirable 

You are to mark your 
answers on the answer 
sheet you have. Look at 
the example on the an- 
swer sheet shown at the 
right. If you decide that 
people in general con- 
sider true to be the most 
desirable answer, blacken 
between the lines in the 
column headed T. (See 
A at right.) If you decide 
that people in general 
consider false to be the 
most desirable answer, 
blacken between the lines 
in the column headed F, 
(See B at right.) 

Remember do not give 
your own opinion of your- 
self but blacken the an- 
swer that you feel people 
in general would consider 
to be most desirable. 

In marking your an- 
swers on the answer 
sheet, be sure that the 
number of the statement 
agrees with the number on 
the answer sheet. Make 
marks heavy and black. 
Erase completely any an- 
swer you wish to change. 
Do not make any marks 
on this booklet. 

Be sure to answer every 

statement. 


Last time when we 
gave you the MMPI, we 
asked you to respond to 
the items honestly and 
to tell us whether each 
item was True or False 
as applied to you. To- 
day, we want to see how 
well you can do in an- 
swering the same items 
So as to create the most 
favorable impression you 
can. In other words, we 
want you to answer each 
item in terms of whether 
you consider the True or 
the False response to be 
more desirable. If you 
think the True response 
is more desirable than 
the False then answer the 
item True. If you think 
the False response is more 
desirable than the True, 
then answer the item 
False. For example, if 
you were trying to create 
a favorable impression, 
you would probably an- 
swer True to this item: 

Tam regarded as an 
honest person. 
and you would probably 
answer False to this item: 

I frequently hurt the 
feelings of others, 

Remember, answer each 
item in such a way as to 
create the most favorable 
or desirable impression 
Possible, 


Please fill this out for 
Dr. Edwards. He wishes 
to obtain social desirabil- 
ity ratings for these 
items. Mark an item T 
(true) if marking it true 
is the most socially de- 
sirable response to the 
item. Mark an item F 
(false) if marking it false 
gives the most socially 
desirable response. Give 
the most socially desir- 
able picture you possibly 
can from the way you 
mark the items. Please 
mark every item. . - 
Nosignatures are needed. 


* A, L. Edwards, personal communication, February 20, 1958, 


A REPLY TO WALKER 


eed fact developed the SD scale, (b) 
scales ieee power of other desirability 
e) the oth the Wiggins and Walker studies, 
both st riage of control scale scores in 
Cree n ies, and (4) Walker’s failure to 
meng? . the spurious test-retest “improve- 
a enomenon on the MMPI, Walker’s 

ns cannot stand, and his defense 


of the Ed 
“dward? SD iuda 
unsuccessful. scale must be judged 


Ror 

E 

PLAYING INSTRUCTIONS AND SOCIAL 
DESIRABILITY SCALES 


In 


Walk criticizing the original experiment, 


er j A ° 
impeaches the instructions used as 


Wel 

t a as the ability of the subjects to grasp 
Wiggins y) there is reason to believe that 
Were ery experimental instructions 
Mount of that they did not maximize the 
Way” of responding in the socially desirable 
demon? 162); and “Wiggins’ failure to 


and oe substantial differences in the K 
failure ais may have been due to the 
perimental subjects to understand fully the 
com al instructions” (p. 163). i 
(Tab e r arison of the two sets of instructions 
ate ked suggests that Walker’s objections 
of g on the undistinguished performance 
on any and K in Wiggins’ study rather than 
uent] notable omission in instructions subse- 
that Sta remedied by Walker. The implication 
Partici anford University undergraduates (who 
astue ated in the original study) are less 
ple hiss grasping the requirements of a 
Wester ying task than their counterparts at 
ma ‘i Washington College of Education 
Possibi; Somewhat reluctantly conceded as @ 
admin. tY. However, one cannot help but 
Students. € manner in which the Stanford 
M the s pplied the instructions to items 
d and Cof pools,’ even though they 
ot 08 items from Edwards’ SD. mi 
i sults with Cofer’s scale, Cof, repli- 
(arte on € of a study performed a decade 
ag ler ae more heterogeneous population 
S addit ice, & Judson, 1949) can be cited 
Aus Onal evidence of the representative- 
Ati i 
in tha : Social desirability scale developed from 
Cop Deroup, minated experimentals from controls 
hon a simia Wieeins (1959) original study- 
Si (Cofer a ile developed from role playiné 
t, Chance, & Judson, 1949). 
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ness of the Stanford students. Not least, the 
splendid success of Wiggins’ Sd in Walker’s 
own data likewise attests to this scale’s indif- 
ference to changes of collegiate environment. 


CONSTRUCTION oF Epwarps’ SD SCALE 


Walker’s “primary criticism” of Wiggins’ 
(1959) experiment alleges a failure to recog- 
nize Edwards’ (1961) position that the SD 
scale: “provides a measure of the tendency of 
subjects to give socially desirable responses in 
self-description under the standard instruc- 
tions ordinarily used with personality inven- 
tories” (p. 353, italics his). Despite the fact 
that more than a dozen articles and a sub- 
stantial portion of a book have been devoted 
to various aspects of Edwards’ SD scale, the 
original instructions to the 10 judges? from 
whose responses the scale was developed have 
never been published. These instructions are 
presented in the third column of Table 1. 
Those familiar with the extensive literature 
concerning the SD scale, but unfamiliar with 
details of its construction, may be surprised 
to note that rather than being developed with 
reference to social desirability scale values of 
items, SD items were keyed on the basis of 
dichotomous responses of a small group of role 
playing subjects. Moreover, Edwards’ SD 
scale was developed by use of role playing 
instructions not only similar to Walker’s but 
very much like those employed in the develop- 
ment of Wiggins’ role playing scale, Sd 
(Table 1). It is not clear why one scale (Sd) 
is held to be appropriately evaluated by role 
playing experiments while the other (SD) 
is not. 

In constructing Sd, Wiggins (1959) em- 
ployed the empirical method of contrasted 
groups in which an item analysis selected 
those items that significantly differentiated 
responses of a role playing group from re- 
sponses of a standard group. Edwards SD 
scale was constructed solely from role playing 
several graduate students, one 


d several undergraduates. . 
8 The use of role playing designs for evaluating 
«faki ood” tendencies has a 

Eaki jy antedates Edwards’ (1957) recent reformu- 
ech “social desirability” (see, for 


3 f the issue of 
een the studies of Cofer, Chance, & Judson, 


1949; Gough, 1952; Mechl & Hathaway, 1946; 
Ruch, 1942). 
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responses and hence, the significance of a high 
SD score must be determined from further 
empirical investigation. Edwards (1957) as- 
sumed that subjects who score high on SD 
under standard instructions are behaving in a 
manner similar to that of his role playing 
judges. This is probably true, but if the 
behavior of such subjects cannot be distin- 
guished from that of the rest of the standard 
group or from any other reasonable compara- 
tive group, then the fact that their responses 
resemble those of role players is without 
discriminative significance (Wiggins, 1962). 
Available item frequency data suggest that, 
on the average, items in Edwards’ SD scale 
will be answered in the desirable direction by 
approximately 80% of any normative group. 
The argument that we will never know to 
what extent a “normative” group is respond- 
ing in terms of social desirability only high- 
lights the need for operational definitions that 
have independent empirical significance. 

Walker (1962) evinces partial recognition 
of the foregoing state of affairs in noting 
that: “... the items of the SD scale are 
fairly obvious in terms of social desirability- 
undesirability. It seems reasonable that such 
would tend to elevate the control mean of 
the SD scale” (p. 162). However, this fact 
must be attributed to the items in the SD 
scale and not (as Walker does) to Wiggins’ 
procedure, since the mean SD score in 
Wiggins’ control group (30.9) is virtually 
identical to the mean SD score in Walker’s 
control group (30.5). 


REPEATED MEASURES VERSUS INDEPENDENT 
Groups DESIGNS 


The most obvious procedural difference be- 
tween the experiments of Wiggins (1959) and 
Walker (1962) is in the latter’s use of a 
design in which the same subjects served as 
both role players and controls. The major 
rationale behind the use of independent con- 
trasted groups was given by Wiggins (1959) 
in the original article: “. . . the population 
about which we would generalize contains 
ubgroups of high dissimulators and relatively 
honest people—rather than people taking the 
MMPI under several conditions (p. 20) $ 

In addition to providing a more he istic 
basis for generalization, the contrasted groups 
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design avoids the necessity of employing 4 
test-retest control group for the “improve 
adjustment” effect that frequently occurs W! i 
repeated administrations of the same in 
tory (Windle, 1954, 1955). When dissimula 
tion of social desirability is called for fo aa 
second testing, improved adjustment ae 
operate in the direction of the hypothes!s 
an additional control group which simy 
takes the test twice under standard condit! 
would seem to be appropriate. thet ' 
Windle (1954) has reviewed the E E 
extensive evidence for an improved ha: 
ment effect in repeated administration J 
personality questionnaires. In a later SPÍ, 
specifically directed toward the , MI tory 
Windle (1955) administered the in é 
twice to standardly instructed undere ag 
ates, with a 1-week interval between a 
His conclusions are particularly germal 
Walker’s procedure: 


Confirmation is gained from this study fo jen ' 
earlier finding that when personality tests ari pje 
a second time after a short time interval, A 
tend to get better adjustment scores than A 
original testing. With regard to the M ii axie) 
retest effect seems greatest on the so-called an thi 
scales. The validity scales also tend to Te 
retest effect (p. 248). 


d 
we 
The Taylor MA and K scales both ie ip 
significant ($ < .01) improvement effe n of 
Windle’s study. Walker’s demonstrat! 67 
improvement in SD (which consists 0! ” pe 
MA items and 13% K items) and E ip 
K scale is therefore difficult to interp" oup 
the absence of an appropriate control 8 
By use of a repeated measures n ive 
Walker was able to reduce the false 28° tal 
identification rate of SD in his experi” jp 
group by some 30% over that fou go 
Wiggins’ experimental group. From 4 $ pat 
tical standpoint it is interesting to note ~ gp 
this improvement was accomplished YD 
overall increase of experimental groUP, “of 
Scores of only three points over 2 jer 
Wiggins’ experimental group. From & 5” to 
tific standpoint, it would be interesting pe 
know if this three-point increment mig?! yat 
observed in a control test-retest grouP at ' 
took the inventory twice under sta? ict 
instructions. Windle’s (1955) findings 508 
this might be the case. 
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MMPI DIFFERENCES ASSOCIATED WITH SEX, RACE, 
AND CLASS IN TWO ADOLESCENT SAMPLES? 


ROBERT L. McDONALD 
Medical College of South Carolina 


AND 


MALCOLM D. GYNTHER 


Washington University Medical School 


This study was designed to determine if Negroes and whites perform differently 
on the MMPI with socioeconomic status controlled. The sample of 360 was 
drawn from high school seniors attending segregated southern schools. Class 


categorization was based on parental occupation. 


Analyses of variance and 


supplementary “t” tests showed that Negro students obtained higher scores 
than whites on Scales L, F, 1, 2, 5, 8, and 9 and lower scores on Scale 3, 
and males obtained higher scores than females on Scales J, 2; 5, 7, and 9 
and lower scores on 0, but that no differences were attributable to the class 
factor. It was hypothesized that the Negro-white differences reflect distinctive 
cultural patterns, The development of separate MMPI norms for Negroes was 


recommended. 


An MMPI Handbook (Dahlstrom & Welsh, 
1960) indicates that only four studies com- 
paring the performance of Negroes and whites 
on this instrument were carried out during the 
period 1939-60. Three of these studies were 
of prisoners in state institutions (Caldwell, 
1954; Fry, 1949; Panton, 1959): the other 
was of patients in a Veterans Administration 
hospital (Hokanson & Calden, 1960). The 
Negro samples generally obtained higher 
scores on the clinical scales and were sig- 
nificantly higher on Scales 8 (Sc) and 9 
(Ma) in three of the four studies cited. It is 
difficult, however, to generalize from these 
findings as the samples were drawn from 
populations clearly not representative of 
Negroes-in-general, 

A review of the literature to date discloses 
that only two additional studies of this prob- 
lem have been done (Ball, 1960; Miller, 
Wertz, & Counts, 1961). The latter experi- 
ment used Veterans Administration mental 
hygiene clinic patients which, again, could 
not be considered as a representative sample 
of Negroes. Ball used high school students, 
but his sample was so small—14 Negro males 
and 17 Negro females—that his findings of a 
high incidence of neurotic tendencies among 
the Negro boys as well as withdrawal and 


1The authors express appreciation to Thomas A, 
Carrere, Eugene C. Clark, Miss Margaret Broadnax, 
Jacob O. Sines, and Joseph L. White for their 
invaluable assistance during various stages of the 
study. 


be 
introversion among the Negro girls must 
considered as highly tentative. 
The relationship between socioeco? 
status and MMPI performance has 


omic 


150? 
investigated by Gough (1948), Noo 
(1952), and Perlman (1950). The tly 


finding was that K scores are signi Catt 
positively correlated with various a 
indices. Also, lower-class subjects gen cales! 
obtained higher scores on the clinical agi 
although the results showed few specifici PS f 
ments. That is, Nelson (1952), ae ub” 
chiatric patients, found that lower-clas cale 
jects got significantly higher scores pr sig 
1, 3, and 6, whereas Perlman’s (195 ef? 
nificant differences with normal subject "tet 
on Scales 1, 7, and 8. Furthermore, oF pe 
investigator found no status effects W 5 god 
examined the MMPIs of working-cl Tyro 
middle-class psychiatric patients. Pjata É 
and Welsh (1960) interpret these cle 
indicating that middle-class and upP® ay? 
test subjects are more cautious about ut 
things that may adversely affect their T oa 
tions and that lower-class subjects 4PP* ga 
the MMPI with less desire to covet ssi 
erect a facade of adequacy. Their cone itt 
appear to be principally based on the Powe 
K-status relationship and imply that ow? 
class subjects would obtain significant 
K scores than upper-class subjects. 
Dahlstrom and Welsh (1960) est d 
Handbook relate the race and status stain 
data by stating that “the findings 0? s 
y stating tha e 
in all studies except that of Fry are 


jp p 
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MMPI DIFFERENCES 


m mua be expected from known effects 

4 Cloeconomic inequities.” The purpose of 
ie study is to test Dahlstrom and 

ald pothesis, using samples which are 
de ntative of the adolescent subgroup of 

e cial group and which are large enough 
yield stable results. 


METHOD 


dei pent utilized a 2xX2X3 factorial 
factors ens sex, and socioeconomic level were the 
Were ieee es Three hundred and sixty subjects 
of the 12 pa in the study, with 30 subjects in each 
Stoup ae This sample was drawn from a larger 
Tals) Nep. O ected of 354 (196 female and 158 
White high s and 263 (132 female and 131 male) 
Staduatin, ae seniors who comprised consecutive 
Sated pth asses (1961 and 1962) of urban segre- 

9 with ern high schools. Ages ranged from 16 
t race, no mean age differences between sex 
and LyscuPational classification scheme by Schneider 
hh ere (1953) was applied to subjects’ parents 
Social Gy to classify subjects with regard to the 
derive, lass factor. Four occupational classes are 
Ver “ow, terms of the degree of supervisory power 
vi tY co er” occupations and independence of super- 
Classes Stee from “higher” occupations. The four 


L In 
lirest, dependent occupations (including executives, 


r 
S, owners of business or farm, doctors, 
“"sincers) ankers, ministers, professors, consulting 
Ld 
Uperyigoendent occupations involving skill and 
“Uervisore or manipulation of others (including 
e Ymer foremen, technical engineers in industrial 
Nt taimers) sales workers, agents, public officials, 
he l 
Uperyja adent occupations involving skill but little 
q and or manipulation of others (including 
Ustriag ompa aled workers and workers in in- 
: oyment) 
li e 
ine upa aent occupations involving little skill and 
a Vision or manipulation of others (includ- 
à] Sa line workers, laborers, janitors, farm 
ù ad workers, miners, drivers) 


its 
able p the present study employed an “unclas- 
ems i ar Eory for those parents listed as deceased, 
n loved Other city with job unspecified, or as un- 
of Èe basi he groupings were made, when possible, 
ayses wh of paternal occupation. In that minority 
clas’, the ere the father’s occupation was unclassifi- 
th, Silica Maternal occupational status was used for 
tiv, Sthnejq, PUPOses. It should be pointed ont an 
tly er-Lysgaard classification scheme 15 rela- 
G The use of additional indices such as 
me ara) Score Card should lead to more 
TMp A = of the status factor. 
eci ere administered by school counselors 
al group testing periods. Each student 


Ms 
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was also instructed to list the occupational status 
and job title, if known, of both parents on a form 
provided for this purpose. Inspection of the MMPI 
answer sheets revealed no invalid records either 
through inability to understand the instructions or 
through omission of an excessive number of items. 
Furthermore, MMPIs were not discarded because of 
high L or F scores as several studies (e.g., Gynther, 
1961) have suggested that such configurations are 
not invalid, but meaningfully related to personality 
traits such as hostility. 

Two judges, the junior author and a second clinical 
psychologist, independently rated the maternal and 
paternal occupations in terms of the various clas- 
sifications. In case of disagreement between these 
raters, the final rating was made by the senior 
author. Inspection of the socioeconomic classifications 
revealed skewed distributions for both races with a 
rtionate number of the white sample falling 
the upper socioeconomic level while the 
d to the Negroes. To insure sufficient 
ch cell, it was necessary to combine 
Classes 1 and 2 which resulted in the use of only 
three socioeconomic categories in the analysis. 

The total group of 617 was reduced to the experi- 
mental sample of 360 by alphabetizing the protocols 
within each of the 12 subgroups and deleting 
subjects by randomization procedures until the sub- 
groups were all of a size equal to the smallest 


subgroup (i.e. 30). 


dispropo 
within 

reverse applie 
numbers in ea 


RESULTS 


Table 1 shows cell means and standard 
deviations for the 13 MMPI scales. Tests for 
homogeneity of variance (Dixon & Massey, 
1951) resulted in rejection of the hypothesis 
of homogeneity for Scales 1 (Hs) and 3 (Hy). 
Table 2 contains the F values obtained by 
analysis of variance for the remaining 13 
MMPI Scales.” 

Inspection of Table 1 reveals that the mean 
scale scores are all within normal limits with 
most subgroups peaking on Scales 4 (Pd) or 
9 (Ma) which would be typical for adolescent 
samples (Hathaway & Meehl, 1951). Table 2 
shows that despite the general similarity 
among groups indicated by Table 1 there are 
also marked differences. Males obtained sig- 
nificantly higher scores on Scales 2 (D), 5 
(Mf), 7 (Pt), and 9 (Ma) than females, 


2 Analysis variance tables have been deposited 
ie ge none Documentation Institute. Order 
Document No. 7396 from ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Day of 
Congress; Washington 25, D. C. Remit in a er 
$2.50 for microfilm or $1.75 for photocopies, an 
make checks payable to Chief, Photoduplication 


Service, Library of Congress. 
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whereas females scored higher on 0 (Si). 
Negroes got significantly higher scores than 
whites on Scales L, F, 2, 5, 8, and 9. On 
Scale 1, males were significantly higher than 
females (¢ = 1.99, p < .05) and Negroes were 
significantly higher than whites (t= 3.27, 
p< 01). Analysis of Scale 3, on the other 
hand, disclosed that whites were higher than 
Negroes (¢ = 2.23, p < .05). No differences 
were found on any scale with regard to the 
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E some 

Table 2 also indicates that there inter- 

significant interactions. The sex by re that 
action on the L scale reflects the fa! 


; ores 
Negro males obtained much higher i “hile | 
than white males (¢ = 6.97, p < 0 her L 


Negro females obtained significantly ES oD) 
scores than white females (t = 3.22, P oreen 
but with considerably less difference on the | 
the means. The sex by class interaction uppe" 
K scale pertains to the finding tha per 


u 
class factor. class females got higher K scores than UP 
TABLE 1 
MMPI Scare Means AND STANDARD DEVIATIONS BY Sex, Race, anp SoctaL CLASS _ 
Males Females pee- 
White Negro White Negro 
MMPI m a a 
scales 1-2 3 4 1-2 3 4 1-2 3 4 i2 3 A 
n ee 
L 47.03 46.87 48.03 53.63 55.87 54.80 50.90 51.03 49.03 56.27 52.73 %19 
5.75 6.28 558 794 862 806 579 6.33 8.34 7.62 9.19 ; 
angy SRA 
F 54.17 55.87 54.33 5740 56.57 57.63 54.10 54.33 54.10 56.13 58.27 6.88 
5.09 7.89 5.61 8.68 887 7.77 667 676 625 7.69 88 A 
52. 
K 52.27 51.90 49.90 51.40 56.03 54.33 55.20 54.10 52.87 56.77 5120 7.84 
BOL ‘837 SAG 7163 786 aiz ag acs aor 792 8.2 i 
s 52 
Hs 51.37 51.23 51.50 55.10 57.10 57.03 52.00 51.23 51.63 53.60 5160 °9'57 
8.52 7.44 10.01 8.29 863 10.29 626 465 899 7.57 9.39 0 
s : a93 5 
D 54.90 53.20 51.23 5800 57.07 55.37 49123. S020 50.77 52.33 52.53 "9/58 
11.09 10.86 9.92 13.19 10.29 1242 1038 821 7.16 8.45 12.89 3 
é ee 52: 
Hy 55.27 56.57 56.53 55.13 56.77 54.60 5740 56.60 55.37 54.10 52.57 “932 
G80 O91 1029 859 BI Fat “Yor B2 €79 6.54 12.29 a 
58: 
Pd 60.97 59.53 59.77 5980 62.40 60.03 60.40 60.80 50.70 59.00 $9.90 7.46 
1085 106C EEE - 1073 821 sog Wsi OOL 035 8.93 5.23 f 
fg a r 53. 
Mf 54.60 56.43 53.20 57:27 55.03 55.23 46.37 46.00: 50,93 59.93 $1.80 638 
7.77 9.18 10.71 T.58 746 854 728 846 10765 728 9.71 Fy 
L 54 
Pa 54.00 54.80 52.33 53.00 52.90 53.30 55,30 57.63 55.50 50.63 57.43 "340 
8.03 10.67 789 944 9.60 741 i102 841 10.73 10.03 10.77 k 
É 54 
Pl 57.77 58.20 56.67 59.67 56.07 59.70 56.97 56.40 57.73 54.33 36.00 4°30 
9.91 1065 9.48 1186 878 735 853 827 949 712 875 n 
60. 
58.60 61.07 57.00 62.00 61.17 64.03 56.97 57.80 59.50 59.60 62.77 10.2 
2 11.19 12.09 10.06 10.32 10.78 7.03 942 1099 943 935 1220 7 
2. 
; 5 7 6708 
1.53 59.23 65.30 62.13 66.83 56.70 60.30 58.69 5940 62.3 3.0 
si a 197 11.81 11.65 9.74 824 1148 982 10.78 9.58 10.55 A 
i 57 52! 
49.57 49.90 49.27 49.10 4930 49.27 53.63 51,00 52.57 5771 
s Toi pe 774 6.38 5.29 7.62 638 864 10.02 850 gas © 
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TABLE 2 


FY. 
ALUES FROM ANALYSES OF VARIANCE oF MM 


PI Scares BY Race, SEX, AND SociaL CLASS 


Source L F e 5 m Mf Po PR Se Me Si 
Se; pe 
tee 230 Oty qse 980 15i 327197 325 417% M4 4.67 5.75* 
Clase ey apase 11.61%" 1.19 3.09% 0.72 10.94% 159 0.52 755** 10.95** 0.59 
ses ) 026 035 111 022 005 034 2.08 016 0.55 0.69 1.09 
Bx 552 013 340 019 207 434* 0.59 22 0.13 040 0.63 
ASC 096 oog 030 00L 0.13 0.56 0.72 0.20 011 140 065 
Noes oos oog zae L34 O47 306* 130 0.35 015 143 1410 

KC 216 105 127 005 14 10 06 136 207 042 0.70 

: 

b 
PER 


Class 

there oe (¢= 2.90, p< .01), whereas 
Ower-clas no difference between middle- and 
0.2 am s males and females (t= .82 and 
actions spectively). Two significant inter- 
Which Were found on Scale 5: sex by race 
Obtaining a consequence of Negro females 
(ts 94 higher scores than white females 
Males ‘Ob, P<.01), but Negro and white 
(ta p taining approximately equal scores 
te findin ; and sex by class which reflects 
ot iie ihat upper-class males’ scores did 
as inal significantly from those of lower- 
ales =a (t = 1.03), while lower-class 
eae to get higher scores than 
Pie aoe females (¢ = 1.80, 107 Ê > 05). 
“ale Pparently contrasting results for the 
3 artifact 2 oc interaction are actually 
thet associated with the reversed 
lo 


fen, 


n 
ie Mis of this scale. In other words, 
be Scores for upper-class males and the 
tig OMSide £ upper-class females both would 
On wip Cred as indicating more identifica- 
feminine interest patterns than is 


Se wi 
with the lower-class groups. 


DISCUSSION 


ba, Ere i 
vata i i remarkable degree of similarity 
wus Prisoner results and previous findings 
Nees have and psychiatric samples. All 
th Broes Iht „consistently demonstrated that 
ie c inical ain higher scores than whites on 
i ae scales of the MMPI. It is tempt- 
96g, COMclude, as others have (e-g., Ball, 
Nop meet Negro high school students are 
Not q aladjusted than white students. It is 
Sarj, 2%) however, that this conclusion neces- 
Ollows from the data. A basic limitation 


is that relations between scale scores or con- 
figurations and behavior have not been expli- 
cated with that high a degree of precision. 
Furthermore, alternative explanations are pos- 
sible. As noted earlier, Dahlstrom and Welsh 
(1960) have suggested that results of Negro- 
white comparisons are a function of “known 
effects of socioeconomic inequities.” No sup- 
port of this hypothesis, however, can be 
adduced from the results of the present 
experiment, as nO significant differences were 
attributable to the class factor. 

One might argue that the crudity of the 
classification scheme used in this study may 
have played some part in the lack of positive 
findings with respect to class or status. How- 
ever, it should be noted that there were really 
two class controls, parental occupation and 
test subjects’ educational level. These criteria 
are analogous to those employed by Nelson 
(1952) with adults, namely, occupation and 
education. One might also point to the gen- 
erally higher clinical scale scores for Negroes 
and ask à la Dahlstrom and Welsh (1960) 
whether this does not suggest that this group 
is actually of lower status than the white 
subjects, despite the attempted matching. A 
proponent of this viewpoint would, however, 
face serious difficulties in explaining why the 
two groups did not differ on K scores, 
especially as K has been shown to be 
significantly positively en po Eaa 

pothesis m4 

‘A more general HYPO, t racial MMPI dif- 


i namely, that ra 
the findings, It ay determined. More 


ferences are cl : 
specifically, one could state that the Negro 
high school seniors’ higher scores on L, F, and 
the various clinical scales are a consequence 
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Affect: 


1. Hostile client behavior will evoke hostile 
interviewer behavior. , i 

2. Friendly client behavior will 
friendly interviewer behavior. A , 

3. Hostile client behavior will evoke inter- 
viewer anxiety. 


evoke 


METHOD 
General Procedure 


Four client roles of the dominant-friendly client, 
the dominant-hostile client, the dependent-friendly 
client, and the dependent-hostile client were fabri- 
cated and student actors were trained to play these 
roles.* Each client-actor was presented in counter- 
balanced order to 34 interviewers-in-training for 
half-hour interviews. The interviewers did not know 
that they were seeing actors. They were told that 
their clients were on the waiting list of the Uni- 
versity Counseling Center and that the purpose of 
the experiment was to test the efficacy of a 30- 
minute interview while at the same time reducing 
the waiting list. The interviewers were instructed to 
adopt their typical interview styles. They were told 
to conduct what would amount to an initial inter- 
view in which their task would be either to effect 
disposition of the presenting problem during the 
interview or to make recommendations for subse- 
quent disposition, There was considerable evidence to 
indicate that the interviewers believed quite deeply 
that they were seeing real clients, Worried con- 
sultations with supervisors, deep concerns about the 
decisions on the disposition of the cases, and detailed 


referral instructions to the Counseling Center were 
the rule. 


In order to prevent the interviewers from talking 


about their experiences, three precau- 
tions were taken, First, they were told not to dis- 


cuss the study with anyone, especially their fellow 


were no indications that leakage had Occurred. 

The roles fabricated for the client-actors were 
designed to represent four Possible combinations of 
control (dominance-dependence) and affect (friendli- 
ness-hostility). Each actor was instructed to present 
a specific critical incident to the interviewer 20 
minutes after the interview had begun, Descriptions 
of the four roles follow: 


*The principal author is indebted to Peter D. 
Russell (1961) who first demonstrated to him the 
efficacy of using student actors as clients. The 
second author was stimulated to investigate this 
technique by Willis E. Dugan (1959) who uses 
actor-clients as a training method. 
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Dominant-friendly client. This client appears vee 
friendly, gets along with people, and likes Fain 
He feels comfortable manipulating people, ee irr 
people, and working with people in a Soe en to 
and leading way. He is a namedropper mee 
use people for his own purposes. In the in some 
situation he takes over. He shows acceptance e P 
extent but always keeps control. He prest ine 
problem in a casual, rather flippant manner z wants 
though it were not really a problem. He jus sation 
his own thinking clarified. He initiates One nts t0 
takes the lead often, talks about what he ie e 
talk about, and does not always respond recon- 
interviewer's directing. He comes in with a a k: 
ceived plan of what he wants the inresviemes AA 
He uses the interviewer as a tool. After 20 r into 
he attempts to pressure the se pie, his 
intervening for him by convincing others 
preconceived plans are appropriate. pts to 

Dominant-hostile client. This client T 
dominate, manage, and directly lead the in sertivê 
He acts confidently, tends to be boastful, ameni: 
aggressive, and rejects the interviewer’s sta els that 
He frequently engages in open sarcasm, He ye j5 
other people are clods to be manipulated, n js 
includes the interviewer, He views others ee J 
dain and talks of others as being without t come 
their own, following the crowd, He does 0 ress? 
voluntarily for help but is responding tO Ponte’ 
from others that he seek personal help. At 20 i) w 
after the beginning of the interview he ee nis 
get the interviewer to confirm his views that pe 
way of life and to make a direct statement 
is not in need of help. g pmissiv”! 

Dependent-friendly client. This client is su e as% 
docile, and conforming during the interview. sine bs 
the interviewer for direction and Jeans on a f 
wants the interviewer to think well of him expe 
times appears to be overconventional. oe t 
great things from the interviewer as he ter whee 
like to make decisions by himself. No mat! z jew? 
the interviewer does, he believes that the me pi” 
will help him. He acts nervous and usni ili 
self; he smiles throughout the interview and Sabie La 
and agreeable. He wants to talk things nie H 
benefit by other’s (the interviewer’s) expert abely 
wants his life so structured that he will kno” ent 
of time what to expect in all situations. A ae 
minutes after the interview has begun, on it, 
that time is running out and asks for, tten; 
viewer’s evaluation and help. He persists 7 ci J 
ing to force the interviewer to be speci 
wh i i is i lient wai 

nt-hostile client. This is a ¢ et BY put 
suspicious, distrustful, bitter, complaining, Yewel hiy 
and self-effacing. He submits to the intervi oste 
is resentful of him. He has considerable perk 
and exhibits it in a veiled way. He nee em $ ge 
needs their reinforcement, but drives te a iu 
and sees his lack of interpersonal success Mater pe 
tion of others. He is uncomfortable in ine by fe? 
situation. He gives information when pre a A 
interviewer but doesn’t take the lead hims' 


j 


es | 


STANDARDIZED CLIENT ROLES 


20 mii 
unutes, he presses the interviewer for advice 


but th A 
4 en rejects any co iv k 
interviewer. 3 nstructive remarks of the 


Training the Actors 


T 

eae Cooperation of the director of theater 

student acto tsity of North Dakota, a group of 

actors were eS Wes obtained for audition. These 

Neluding į Presented with specific role descriptions 
interview behavior, feelings to be expressed 


the i i 
histories ee presenting problems, personal 
tioning nd family background material. The audi- 


Specific heane required each actor to play a 

ing as nt role with one of the co-investigators 
auditioned Paene From the group of actors 
roles, » four were chosen to play the specific 


Six train; 
which training sessions were held with each actor in 
= ne was discussed. After each discussion, 
Rie Played the role with one of the 
ns, the S acting as interviewer. During these 
5 of int investigators tried to anticipate the 
an Drep erviewer behavior the actor might meet 
Sible, Ree him for as many contingencies as pos- 
Vestioat ing each role playing session, the second 
Mage eT acted as observer. When the actors were 
kiy 43 have met the specific role definitions set 
e investigators, they were pretested by 
ni Used em engage in interviews with interviewers 
sions in the study, Their behavior during these 
Siders s rated by six observers. An actor was 
; to be successfully prepared if five of the 


er : A 
“ia indicated that he had met the role 


e act 
CO-iny, 
Sessio; 


Obs 
finiti 


h 
"ervicwers 


hitty_ ‘ 
tele J ipur interviewers-in-training were used as 
D dents in © present study. All were graduate 
en kota, Tan, Psychology at the University of North 
oe levels ging in age from 21 to 50. Their experi- 
nt in aTe commensurate with what one might 
Seve frow aduate program in psychology, that is, 
a Vere a one-semester clinical practicum to 

*S of field experience. 


Ebe; 
ndeng Variables 


ery, 
.” ratings, Each interview was obese 
a eo! -way mirror by a judge trained to Ta e 
"Viewer, m and anxiety as demonstrated by the 
and nt instrument used to rate interviewer 
Ch trol was a modified form of the Inter- 
). The tee Lit (ICL) published by Leary 
CL was modified in two ways: (4) The 
than “xamp] all items were made interview specific. 
c, the item “Encourages others” was 


th 
LER o i 
to tem «o ead “Was encouraging to the client”; 


a ne «yy Pils people with kindness” was changed 
tach © intero $00 kind to the client for the good 
Pooja” View.” (6) Rather than simply checking 

ed hepa dicate the presence or absence of the 
: hanged 


avi à 
Vior, the judge’s response was ¢ 
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to a three-point rating scale to allow for increased 
discrimination. 

Interview anxiety was measured by a behavioral 
anxiety check list developed by the investigators. 
The instrument required the observer to make a 
judgment every 2 minutes about interviewer anxiety 
as manifested in the six following behavioral cate- 
gories: disturbances in mechanics of delivery, dis- 
turbances in content of speech, excessive silence, 
excessive smiling or laughing, extraneous movement, 
and involuntary reflex disturbances. 

The observers were four advanced graduate stu- 
dents in clinical and counseling psychology who did 
not participate in the study as interviewers. They 
were not aware of the hypotheses being tested but 
did know that the clients were in fact accomplices. 
Since it was not practical to have more than one 
observer per interview, each observer was randomly 
assigned to 34 interviews. In order to promote inter- 
observer reliability, the observers were trained by 
having them rate trial interviews simultaneously. 
Training was considered completed when an inter- 
observer mean correlation of .88 on the ICL was 
obtained for the same interview which occurred after 
rating eight trial interviews. Reliability was easier 
to achieve for the ICL than for the behavioral 
anxiety check list. On the latter instrument it was 
found that training produced intraobserver consist- 
ency but did not increase interobserver agreement. 
The ordering of anxiety scores produced by the 
observers on the last three trial interviews was 
identical despite the fact that there were some large 
disparities between the absolute number of anxiety 
items checked by each judge. Training was discon- 
tinued at this point and it was decided that the 
anxiety hypothesis would be tested by transforming 
observer ratings on this instrument into standard 
scores, so that each rating could be considered as 
a deviation from the mean of the observer who 


had produced it. 


RESULTS 


Actor-Client Consistency 

While the actors may have reached the 
criterion performance before the study was 
initiated, the question could be raised as to 
whether they were successful in maintaining 
their specified roles when exposed to a 


TABLE 1 


MEDIAN RATINGS OF ACTOR-CLIENT 
ROLE CONSISTENCY 


Domi- Friend- 
Client role nance liness 
Dominant-hostile 4.0 1.0 
Dominant-friendly 4.0 4.0 
Dependent-hostile 2.0 3.0 
1.0 4.0 


Dependent-friendly 
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TABLE 2 


MEANS OF OBSERVER RATINGS OF 
INTERVIEWER BEHAVIOR 


K. HELLER, R. A. Myers, ann L. V. KLINE 


TABLE 4 


GS 
ANALYSIS OF VARIANCE or ICL RATING 
OF INTERVIEWER FRIENDLINESS 


ICL M ICL M 
Domi- Friend- M 
Client role nance liness Anxiety 
Dominant-hostile 11.39 6.10 .09 
Dominant-friendly 12375 18.03 ~.01 
Dependent-hostile 14.81 16.64 03 
Dependent-friendly 15.59 24.79 06 


variety of interviewer behaviors. In order to 
answer this question, a set of master tapes 
was developed which consisted of 48 ran- 
domly selected 5-minute interview excerpts 
(12 excerpts per actor; 1 each day for the 
length of study) in random order. The master 
tapes were presented to 12 judges who were 
not otherwise connected with the study and 
who did not know that the clients were in fact 
experimental accomplices. The task of the 
judges was to rate the behavior of the client 
on two five-point scales representing control 
and affect. Table 1 Presents the median 
dominance and friendliness scores assigned 
to the actors. It can be seen that all actors 
were judged to have been exhibiting their 
specified role behavior except for the “de- 
pendent-hostile client.” In this case while 
the median judgments indicated submissive- 


ness, neither hostility nor friendliness was 
consistently evident, 


Observer Ratings of the Interviewers 


The means of the observer ratings of inter- 
viewer behavior are presented in Table 2, 


TABLE 3 
ANALYSIS OF VARIANCE oF ICL Ratincs 
OF INTERVIEWER DOMINANCE 


Source d MS F 
Client control 1 333.80 8.275* 
Client affect 1 38.97 827 
Subjects 33 97.01 
Control X affect 1 3.36 111 
Control X subjects 33 40.34 
Affect X subjects 33 47.14 
Affect X control X subjects 33 30,27 

Total 135 


*p < .005; one-tailed test. 


k 
Source df MS 
cal 4.90" 
Client control 1 2545.69 K on 
Client affect 1 3430.52 x 
Subjects 33 4740.36 of 
Control X affect 1 pao 
Control X subjects 33 170.80 
Affect X subjects 33 214.77 
Affect X control X subjects 33 127.77 
Total 135 i OOA 
** p < 001; two-tailed test, 
== p < .0005; one-tailed test. 


te 
For each dimension of the ICL a or 
treatment X treatment x subjects From 
(Lindquist, 1953) was performed. these? 
Table 3 it can be seen that the hypo inant 
concerning control were supported. og d 
client behavior was judged to have Penc? 
significantly more interviewer dep? ients 
than was the case for dependent € 
regardless of condition of affect. +p oryiewe 
The hypotheses concerning gee Te 4 
affect were also supported. From 3 we 
it can be seen that client friendlines ot? 
judged to have evoked significantly cast 
interviewer friendliness than was trict 
for client hostility. However, unpre" ser 
significant effects on this dimension 
also found for client control; that F Je 
dominance was judged to have evoke th 
friendliness on the part of the inte” pat 
An examination of the means reveals. yter 
most of the effect of client control 0” 


v 
di! 


TABLE 5 


NGS 
ANALYsIs oF VARIANCE OF RATING 
OF INTERVIEWER ANXIETY 


\ 


= = f 
Source df MS 
Client control 1 0002 66 
Client affect 1 08 
Subjects 33 1410 430 
Control X affect 1 14 ' 
Control X subjects 33 116 
Affect X subjects 33 m 


Affect X control X subjects 33 
Total 135 


STANDARDIZED CLIENT ROLES 


teed p was mediated by the dominant- 
Table -o For as was demonstrated in 
OE yes, : an dependent-hostile client was 
could E ostile and therefore evoked what 
Dons a considered an elevated friendliness 
sien his interviewers. With regard to 
client ee ta it was hypothesized that 
interviews ility would evoke significantly more 
this h er anxiety. Table 5 reveals that 
Ypothesis was not confirmed. 


DISCUSSION 


Boe ge, the results of this study sup- 
e ua that psychotherapy should 
action” ase reciprocally contingent inter- 
an inter (Jones & Thibaut, 1958). In such 
tesponda both participants are variable 
rom ha” each alert to the incoming cues 
a Partial other and each in turn acting as 
(Kanfer cause of the other’s behavior. Others 
that the, 1961; Krasner, 1958) have noted 
thei ated influence the productions of 
bs ients even though attempts at in- 
E ieee? not be consciously initiated. 
evo ent study suggests that clients may 


niig 


ais reciprocal behaviors from their ther- 
be PA though this influence may not 
Unlike Ceived. It is further suggested that, 
Yn rago wntertransference, which is idio- 
ture, 'C to the therapist's personality struc- 


e 
Side Oked behavior should also be con- 
Walities > a function of the real stimulus 


two i 
4 oug portant differences between the studies 
a = Noted which would lend greater 
t i. Russell’s results: Russell used a 
in eo of anxiety indices than were 
do tclient present study; and, Russell’s 
he ance Stimulus was unconfounded by 
oSA one dependence. All clients 17 the 
fend ly, including those designated as 
eve ibe toduced management problems 
Verne m; erviewer, Hence, the anxiety they 
nting di. Y have been uniformly high, pre- 
Mo cay mination by the observers. 
tions affect the nature of the gen- 
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eralizations that can be made from this study. 
The first of these is that implied by the reliance 
on observer ratings for the dependent variables. 
A question might be raised about the specific 
effect of the treatment conditions upon the 
observers. That is, the investigators recog- 
nized the possibility that the observers them- 
selves were being influenced by the stimulus 
characteristics of the actor-clients to such 
an extent that their own evoked feelings 
may have colored their perceptions of the 
interviewers’ behaviors. Had this been the 
case, the findings would continue to have 
some meaning in terms of the hypotheses. 
However, the W of the sample would have to 
be reduced from 34 (interviewers) to 4 
(observers). 

While the interviewers used in this study 
did vary in experience from those with one 
semester’s clinical practicum to those with 
several years of field experience, the question 
can be raised as to whether more experienced 
counselors and therapists would have re- 
sponded in a similar way. It is the subjective 
impression of the investigators that more ac- 
complished interviewers would experience the 
same personal reactions although they would 
be better able to control their actual inter- 
view behavior. In spite of this caution, how- 
the results should be useful in the 
on of therapists and counselors-in- 
training for the purposes of identification, 
understanding, and management of their own 
evoked behavior in the interview situation. 


ever, 
preparati 
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LANGUAGE PREDICTABILITY AS A FUNCTION OF 
PSYCHOTHERAPEUTIC INTERACTION’ 


STANLEY FELDSTEIN anp JOSEPH JAFFE 


Research Department, William Alanson White Institute, New York City 


An inverse relationship was hypothesized between number of psychotherapeutic 
interviews and therapists’ errors in predicting their patients’ speech, Recorded 
Speech samples were obtained from Ss before their 1st psychotherapeutic inter- 
views, and immediately after their 10th, 20th and 30th interviews. The samples 
were transcribed verbatim, mutilated according to the Cloze Procedure and sent 
to the Ss’ therapists and to 2 groups of therapists who served as controls. 


Instructions were to guess the deleted words. 


were scored for proportions of incorrect guesses. A trend 
although the therapists were found to be 


did not support the hypothesis, 


better predictors of their own patients’ spee 


T 
di erer dundancy of a language refers to the 
could Spi the amount of information 
of the lan conveyed by most efficient use 
actua] ee and that amount conveyed by 
Otmation thar concept, derived from in- 
eful in eory, has been found increasingly 
language tackling some of the problems of 
tion, Te Structure and linguistic communica- 
Process Possible usefulness in exploring the 
fact of psychotherapy is suggested by the 
Mte a the latter may be thought of as an 
(Jaire, I, similar communicative systems 
n ). 
1951) technique, proposed by Shannon (1948, 
"dunda, or obtaining a rough estimate of 
Correc Ney requires that a native speaker 
Ample Y guess a linguistic event—for ex- 
Prece lng letter or word—on the basis of a 
ort, th Sequence of such events. In other 
tee’ singl technique uses successive guesses 
igeique © speaker. A modification of this 
` 53 Called the Cloze Procedure (Taylor, 
"upg Ses a single guess from each of a 
tn ta Speakers, 
of let (igescussion of information theory, 
the gu 4) pointed out that an implication 
à essing procedure is that the more 
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The returned transcriptions 
analysis of the data 


ch than of other patients’ speech. 


an individual knows about the source and 
content of the text being guessed, the greater 
will be his success, and, hence, the greater 
will be the estimate of redundancy. On this 
basis, Miller predicted that the longer a 
therapist interacted with a patient, the 
greater would be the redundancy estimates 
from his attempts to guess the patient’s 
speech, The present experiment tested the 
following restatement of Miller’s hypothesis: 
There is an inverse relationship between 
r of psychotherapeutic interviews and 
of incorrect guesses in thera- 
s to predict their patients’ 


numbe 
the proportion 
pists’ attempt. 


speech. 
METHOD 


s were male and female patients from 
an outpatient psychotherapy clinic who were seen by 
the experimenter on four different occasions during 
the course of their treatment. The first occasion (0:1) 
occurred before their first psychotherapeutic inter- 
view, the second occasion (Oz) occurred between 
their tenth and eleventh interviews, the third (Os) 
between their twentieth and twenty-first interviews, 
and the fourth (O.) between their thirtieth and 
thirty-first interviews. The strongest test of the hy- 
pothesis would have been to have seen the same sub- 
jects on each of these four occasions. Unfortunately, 
this procedure was not feasible. Of the 22 subjects 
who participated in the fourth occasion, 4 were seen 
on all four occasions, 14 were seen on three occasions, 

nd 1 on one occasion. To allow 


3 on two occasions, a o 
for the use of a repeated measurement analysis, each 
of the 18 subjects who had missed one or more 


occasions was matched, on the basis of a vocabulary 
test score, Sex, and age, with another subject who 
had participated in the missed occasion or occasions. 
Thus, the data for each occasion were based on the 


The subject: 


123 


124 


participation of 2 subjects, of whom 7 were males 
5 were females. 

ge subjects were of native birth. The mean 
ages of the groups on the four occasions ranged from 
23.6 to 24.5, with standard deviations that ranged 
from 4.4 to 5.1. On a brief measure of verbal intel- 
ligence (Thorndike & Gallup, 1944) the mean scores 
of the four groups ranged from 15.1 to 15.4, with 
standard deviations from 2.4 to 2.7. All the means 
are at about the 85 percentile in terms of a 
“standard voting population” (Thorndike & Gallop, 
ae a subject was seen individually and allowed 
5 minutes to tell stories about four pictures.? The 
stories were electronically recorded and enough of 
the stories were transcribed to make one typed page. 
Verbatim transcriptions, without editing or punctua- 
tion, were used. The transcribed portion was then 
mutilated according to the Cloze Procedure (Taylor, 
1953), that is, every fifth word was deleted. The 
mutilated typescript was sent, with the subject’s 
name on it, to the subject’s therapist with instruc- 
tions to guess the deleted words and return immedi- 
ately. Copies of the same typescript, without the 
subject's name, were also sent with similar instruc- 
tions to two other therapists who acted as controls, 
The return dates of the typescripts provide the only 
check on the cooperation of the therapists, and 
seem to indicate that, for the first occasion, the 
therapists complied with the instructions. Twenty- 
one male and three female therapists took part in 
the experiment and were rotated so that they were 
in the experimental group for the prediction of their 
own patients’ speech and acted as controls for their 
prediction of other patients’ speech. 

Upon their return, the typescripts were scored in 
terms of correct and incorrect guesses. To be correct 
a guessed word had to be the same as the original 
word in the Passage. Words different from the cor- 
responding original words and uncompleted deletions 
were counted as incorrect guesses. In order to obtain 
a score independent of the size of the transcribed 
speech sample, the number of wrong guesses was 
divided by the total number of deletions in the 
sample, For each subject, then, there were three 
scores: the proportion of Wrong guesses made by 
his therapist and by each of two controls. 


RESULTS 


The hypothesis stated that, compared with 
those of the control groups, the proportion 
of incorrect guesses of the experimental 
group of therapists shows a consistent de- 
crease from the first to the fourth occasions. 
The hypothesis was tested by a trend analysis 
using a complex repeated measurement de- 
sign (Table 1). 

It was thought unnecessary to transform 


2 Cards 2, 8 GF, 12 BG, and 10 from the Thematic 
Apperception Test were used as the stimulus material, 
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TABLE 1 m 
— E PR 
ANALYSIS OF REPEATED MEASUREMENT OF oe 
PORTION OF INCORRECT GUESSES OF THE qa 
MENTAL (E) AND CONTROL (C) Groups OF 
PISTS OVER FOUR OCCASIONS 


Source df MS F 
Matches (M) 21 0288 | 
Occasions (0) 3 0228 7.20" 
Groups (G) 2 0821 12 
Linear O X G 2 0062 212 

Linear E X Ci, 1 0106 2:16 
Quadratic O X G 2 ~~ .0108 $36" 
Quadratic E X Ci, 1 0278 i 

OxM 63.0176 
GXM 42 Ol 14 
OXGXM 126.0050 
= rte 
Note.—The G X M mean square was used as the err x 


for evaluating the differences among Groups., Ths 4 
X M mean square was used to obtain the remaining sg god 
In addition to the quadratic component, there preted. 
significant cubic component which cannot be inl 
*p< .05, 
**p< 01, 


the proportions for the analysis because Di 
were no marked fluctuations in the of UY 
nators (total number of deletions) com 
original ratios. Analysis of the jinear ow 
ponent of the curve of the experimenta onii 
compared to that of the combined Pat! 
groups (Figure 1) yielded a nonsig?! conp” 
of 2.12. Comparison of the quadratic jeld 
nent of the two curves did, however, K cot 
significant difference (F= 5,56). Furth 
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mental and control groups across Son of © ye 
(The means of the summed proportions e 
Control Groups were used to obtain 

Control Groups 1 -+ 2.) 


LANGUAGE PREDICTABILITY 


Ms gat aan Ł tests, between the means of 
itd an af second, second and third, and 
Mental warty occasions for the experi- 
Macen up, indicated that its proportion of 
second o guesses decreased significantly on the 
e a cton anid increased significantly on 
ird occasion. 

Was co between groups F (7.20) 
new RA analyzed by means of Duncan’s 
level of oe range test, using a protection 
© rene (Edwards, 1960, pp. 138-140). 
tion of in s indicated that the mean propor- 
oup Resist yy guesses of the experimental 
tantly fiery the four occasions was signifi- 
Even ang than that of either control group 
eluded ne mean of the first occasion 1S 
id not | diff e means of the control groups 
us, alth er significantly from each other. 

it own ough the therapists’ predictions of 
patients’ speech did not improve 


AN any TABLE 2 
MS Incon, STANDARD DEVIATIONS OF THE PROPORTION 
AND jasi: Gursses or THE EXPERIMENTAL (E) 
AND FoR ROL (C) Groups on Eact Occasion (0) 
THE MALE AND FEMALE SUBJECTS ON 

Eacn OCCASION 


Group Subject 
sex Statistic Or O: Os Ou 


Mixe M 43 34 40 .39 

ii sp 10 .07 .09 ll 

Si Mixed M 42 42 42 43 
sp 09 .15 10 10 

St Mined Mo aS 42 49 45 
sp 09 410 13 13 

M M 46 35 43 31 

r ng sp 08 05 .09 -13 
Fem M AL 34 39 40 

ale sp 10 .08 .09 -10 

M M 41 45 48 42 

&i ii SD ‘10 .15 09 -12 
Fema M 40 .40 .39 44 

ale $p ios .15 09 -09 

Mal M AL .42 .54 -46 

Q i sp 06 .06 .06 -13 
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consistently from occasion to occasion, they 
were, in general, significantly better than their 
predictions of the speech of other patients. 

One question that might be raised is 
whether it was easier to predict the speech 
of same- rather than mixed-sex groups of 
subjects. To explore this possibility the scores 
for the male and female subjects within each 
occasion were investigated separately. Inspec- 
tion of the means and standard deviations in 
Table 2 suggested that the predictions for the 
separate-sex groups were not markedly dif- 
ferent from those of the mixed group. 


Discussion 


Although the results do not support the 
hypothesis, they are somewhat difficult to 
interpret. In general, the therapists were 
better able to predict the speech of their 
own patients than that of other patients. If, 
as Miller suggested, accuracy of prediction is 
considered a measure of the predictor’s 
knowledge of the source, then it might be 
said that this finding had a high antecedent 
probability. On the other hand, Miller also 
that increased accuracy of predic- 
tion would imply, with reference to the 
psychotherapeutic situation, “better under- 
standing” (Miller, 1954, p. 137) on the part 
of the predictor. It may, however, be more 
fruitful to conceptualize the present finding 
in a different way. The fact that the experi- 
mental passages were transcriptions of spon- 
taneous, unpunctuated speech suggests that 
through interaction with their patients the 
therapists learned the vocabulary and sequen- 
tial dependencies of their patients’ speech 
well enough to be able to simulate them. 
Whether such simulation is a measure of 
understanding remains to be tested, as does 
the provocative question of whether quite 
similar speech patterns imply cognitive 
imilarity. 
eo finding of the study was that the 
increase in accuracy of prediction occurred 
at some point within the first 10 interviews. 
One general question raised by this finding 
concerns the duration of interaction required 
in order that the participants predict each 
other’s speech with better than chance accu- 
racy: Again, does the increase accuracy 


suggested 
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occur gradually or suddenly? The answers 
to such questions are relevant to both 
theories of cognition and of psychotherapeutic 
interaction. : 

The fact that the predictions of the experi- 
mental group were poorer on the third oc- 
casion than on the second occasion cannot be 
adequately interpreted. It may be that by the 
twentieth interview some kind of change had 
occurred within the therapeutic relationship 
which made the patients’ speech less predict- 
able. Perhaps a more likely conjecture is that 
the loss of accuracy reflected decreased 
motivation on the part of the therapists. 
Some support for the conjecture seems to be 
provided by the concomitant decrease in 
accuracy of one of the control groups, and 
by the fact that the therapists were not as 
prompt in returning the deleted word sheets 
(as they had been earlier in the experiment). 

In brief, then, the results indicate that the 
relationship between number of psychothera- 
peutic interviews and therapists’ ability to 
predict their patients’ speech is a complex one, 
The results also indicate, however, that thera- 
pists do become better predictors of the speech 
of their own patients that than of other pa- 
tients. The knowledge is simple, but is im- 
portant for at least two reasons. On the one 
hand, it has been said that what patients 
learn in psychotherapy is to talk their thera- 
pists’ “language.” The implication is that the 
patients of a group of therapists of the same 
training milieu learn to speak the “language” 
of that group. Were such a conjecture true, 
it might have been expected that the control 
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therapists of the present experiment would 
have become as able to predict the ial 
of the patient sample after the latter ha a 
10 sessions as did the experimental oe 
The results do not support the conjecture F 

On the other hand, the results suggest i f 
the Cloze Procedure is a rather simple, ° al 


tive way of measuring change in ae su | 
peutic relationship. The correlates ° ald be | 
change have yet to be explored. It wo ther 
useful, for example, to relate change ve pe 
pists’ ability to predict their patients ontinu® 
to the length of time their patients © m bol! 
treatment, or to reports of progress fr0 
therapists and patients. 
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cana ACCURACY OF ACTUARIAL PREDICTION 
XPERIENCED CLINICIANS, AND GRADUATE STU- 
DENTS IN A CLINICAL JUDGMENT TASK* 


LAWRENCE C. GREBSTEIN * 


3 groups of judges, varying in amounts of clinical experience, 


Veterans Administration Hospital, Lexington, Kentucky 


predicted 


Wechsler-Belleveue IQ scores from Rorschach psychograms in competition 


with a multiple regression equa 


tion. The results indicated that: (a) There 


Was no significant difference between the degree of accuracy of the judgments 


of the equation and of the clinici 
regression equation were able to ju 
chance. (c) Clinical predictive accu! 
relations between predicted and actua! 
did not increase significantly wi 


Despi 
deng Pite the availability of extensive evi- 
listiga] © Question of “clinical versus sta- 
the peta is still an unsettled one 
Mthough ‘hie of many clinical psychologists. 
Ptessiye hey eehl (1954) has offered an im- 
so acta of data for the superiority of 
eat Perso approach to clinical judgment, 
thy clusions ns have resisted accepting his 
ait the slog example, Holt (1958) feels 
p Prong vidence in favor of the actuarial 
a atal and be a function of the experi- 
aga a in which the clinician is at 
ual Pine rather than a function of the 
ie. Suggests thay of the statistical method. 
[Í Made oF that an interesting study could 

1 Simultaneous attempts to predict 


E 
a 
B 


shis 
the M: in poet is based upon the author's master’s 
Asters ie fulfillment of the requirements for 
d in Sres; University of Kentucky. It was 
ological vAumarized form at the 1961 American 
Mpy T is i Ssociation Convention in New York. 
o kein a debted to the University of Kentucky 
ta, inva ae Director and to John Donahoe, 
to ka assistance in the analysis of the 
the se Griffith, Chief Research Psy- 
Ong D Ken eterans Administration Hospital, 
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d fie Partici ospital, Lexington, for their co- 
of Presen Vailabne tion in the study, and for making 
enp tly $ 

Y van ang aduate student at the University 
“tug Tans A a Clinical Psychology Trainee at 
M dministration Hospital, Lexington, 


e N S a 

ES ZE ZEEE. 
A o 
eZ 


ans. (b) The judges and the multiple 
dge IQ scores significantly better than 
racy, as indicated by high positive cor- 
1 IQ scores and low average error scores, 
th amount of clinical experience. 


the same criteria from the same data by 
clinicians and statisticians who have gone 
through the same preliminary steps. In the 
same vein, Meehl (1956) states, in reference 
to a review of the studies in this area: 


I have reservations about some of these studies; 
I do not believe that they are optimally designed to 
exhibit the clinician at his best; but I submit that 
it is high time that those who are so sure that the 
“right kind of study” will exhibit the clinician’s 
prowess, should do this right kind of study and back 
up their claim with evidence. 


One of the purposes of this study is to 
carry out Holt’s suggestion and accept 
Meehl’s challenge. Accordingly, the present 
study is concerned with the ability of cli- 
nicians to judge Wechsler-Bellevue IQ scores 
from Rorschach psychograms in competition 
with the ability of a linear multiple regression 
equation to perform the same task. Although 
this is not a function in which clinical psy- 
chologists are specifically trained, it is one 
which they frequently do. Also, since the 
experimental design includes judges with dif- 
fering amounts of clinical experience, it per- 
mits an evaluation of judgment in relation 
It is an attempt to clarify the 
many previous studies in this area in two 
ways. First, the task is equally amenable to 
either statistical or clinical analysis, but favors 
neither approach; and second, the results can 
be evaluated in terms of a more objective 


‘on of validity. 


to experience. 


criteri 
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HYPOTHESES 


The following hypotheses are being tested 
in the present study: 


1. There will be no significant difference 
between the degree of accuracy of the judg- 
ments of the linear multiple regression equa- 
tion and the sophisticated group of clinical 
psychologists. Pa 

2. Both the judgments of the sophisticated 
group of clinicians and the linear multiple 
regression equation will be significantly better 
than chance. 

3. Clinical predictive accuracy, as indicated 
by high positive correlations between pre- 
dicted and actual IQ scores, will increase with 
amount of clinical experience. 


METHOD 
Judges 


Three groups of judges, five in each group, were 
used in this study. The naive group consisted of 
graduate students who had completed an intro- 
ductory course in Rorschach administration and 
scoring. They had not been trained in advanced 
Rorschach interpretation and had no formal clinical 
experience with the Rorschach. The semisophisticated 
group of judges was composed of graduate students 
who had completed both an introductory and inter- 


were trained in the 
Rorschach Test. 


Psychograms 


Approximately 100 IQ scores of patients tested in 
1954 and 1955 were taken from the files of the 
Lexington Veterans Administration Hospital, and the 
distribution and range of these scores were deter- 
mined. From this original population, 30 IQ scores, 
ranging from 57 to 142, with the same approximate 
distribution as the original sample, were selected 
for the judges’ task. The mean of this sample was 
91.56 and the standard deviation was 20.34, Each 
record was screened to insure that the following 
criteria were met: 

1. The Wechsler-Bellevue Intelligence Scale and 
Rorschach must have been administered to the 
patient within 1 week of each other. : 

2. In no case were any of the judges taking part 
in the study to have administered or scored any 
of the tests or collaborated in the final report. In 
addition, the scoring was checked for each of the 
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rams 
records and any irregularities in the ea 
were corrected. In a few cases, some wa ont 
originally selected had to be replaced be satis 
or more of the above criteria had not peni orden 
The psychograms were typed, put in rani ca jode 
coded by number, and presented to cai ach 10 
The judges were instructed to SaNa wanted 
Each judge was allowed as much time as necesi ‘, 
The judges received the psychograms $ Se at $ 
so that none of them were doing the gf com 
same time, and each judge was asked no 
anyone else in making his judgments. 


pst! 


Multiple Regression Equation 


scot 
Three hundred and nineteen additional 1 i7 
were chosen from the same population adit, 
files as that used for the judges’ task; the fie uae 
scores were then put in the form of @ Q p% 
distribution with class intervals of 10 tion t0 4s 
From these, 100 were selected in propo” 131: lie 
original distribution. The range, 47 t0 iyt 
approximately the same as for the sample eh psy 
judges. As in the previous sample, i regress 
gram was checked for errors. A multi e at 
equation was derived, based on 10 Romet ng) 
from the Beck scoring system. These W tent Qe 
W%, M, F+%, Z, A%, P, range of con this © 
and range of determinants (RD). BION use 
tion, the most significant variables pon dete 
determine the best linear equation; a ™M the Red 
description of this procedure appears oe nd! 
section. Finally, scattergrams were ples 
of the 10 Rorschach factors, to determ cu vi 
relationship to intelligence was linear oat w 
This was done to justify the use of a ae one 
regression equation as the appropria 
Prediction. 


RESULTS ke 


. pultit! 

Comparison of Judgments with i 
gression Equation 

i ë use y 


The best linear equation to b pinet y 
compete with the judges was deter fro g 
successively dropping out factors, u of 
10-variable equation until the resi bs ij 
of squares for regression N (grouP ant fy 
ables dropped) represented a signifi This 
when compared with regression /V- que’ 
lows the procedure suggested by ó 
(1952), who states: 


sable ©° 
To test whether any particular variabl en ves 40 
significantly in the prediction of a depe” analy 


d 


P y 
., s 4 
it is necessary first to carry out ine a i% y 
variance when the variable is include is isio if 
repeat the analysis when the variable ue g’ 


In this manner, the sum of square gimat? 
inclusion of the variable may be € 75)" 
tested, using the variance-ratio test (P- 
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z TABLE 1 
REDICTIVE TRACY 
TIVE ACCURACY OF THE JUDGES AND THE MULTIPLE REGRESSION EQUATION 


: Multiple 
. Semi- regression 
Naive sophisticated Sophisticated equation 
Correlation 5 
3 .50* 65** .68** 56** 
Average error 17.77 13.92 14.49 13.53 
4? $05, = 
LESIN 
Thi 
in S Proce ; 
tion» tha dure also allows for the “educa- judge (Table 3) represents the mean devia- 
Mgful com equation, so that a more mean- tion from the 30 actual IQ scores of his 
Perfo, parison can be made between its 30 predictions of IQ. The group mean average 
errors were computed in similar fashion to 


r ‘ 
thalysis ne, with that of the judges. The 
Mt log ated that there was no signifi- 
educeg ti hen the 10-factor equation was 
ps dimin 5 factors or when the W% factor 
pa Ver e from the 5-factor equation.* 
range E was a significant loss when 
tig r Mali ne was dropped from the 
in Was cho ion. Thus, the following equa- 
wel cant geo to compete with the judges. 
ated S (.05 and .01 levels) were asso- 


Wit 
h all of the factors in the equation. 


IQ = 
rena + 3.73 M + .20 
Th % + .25Z + 1.20 RC 


e 
aS Preqi A 
am relatio S made by this equation had a 
The! 13.53. with actual IQ and an average 
atea © Broun 
ied by cor mean correlations were calcu- 
Dreg Can pied oe the actual IQ scores with 
Rap tions icted IQ (based on five judges’ 
Se. It ig ri each group) for each psycho- 
è Compute ertant to note the method for 
ate as since a careful inspection 
tiong? high ght suggest that the group means 
Of th When compared with the correla- 


hoe the: a 
f he avera individual judges in each group. 
tapo 8e error score for each individual 


ci signi , containing an analysis of 
mat pets erificance of merece (Figure A) and 
st, ep, "ae associated with factors in the 
Aylty siteg gression equation (Figure B) have 
z D the American Documentation 
Race Taye blie at No. 5 from ADI 
by, ttt fk ibray. rations Project, Photoduplication 
ce Congress; Washington 25, D. (> 
a $1.25 for microfilm or $1.25 for 
make checks payable to: Chief, 
ervice, Library of Congress. 


the group mean correlations. That is, the 
difference was found between actual IQ and 
the mean predicted IQ for all five judges 
within each group for each psychogram. 

Table 1 summarizes a comparison of the 
performance of the equation with the judges. 

The results summarized in Table 1 indicate 
that Hypotheses 1 and 2 were supported. 
Both the sophisticated group of judges and 
the multiple regression equation were able to 
predict Wechsler-Bellevue IQ scores signifi- 
cantly better than chance, and there was no 
significant difference between the judgments 
of the sophisticated group of judges and the 
multiple regression equation. 


Effect of Experience on Judgments 
Table 2 shows the individual and mean 
correlations of predicted with actual IQ for 


the three groups of judges. 


TABLE 2 


epicrep IQ wiru AcrvaL IQ 


CORRELATION OF PRE 
FOR INDIVIDUAL JUDGES AND GROUPS 


eee — = E 
Semi- 
sophisti- Sophisti- 
Naive cated cated 
.50* 65" .68** 
Me 1 —.02 Si .56** 
or 2 34 ae sive 
wk Pe has 57 
ee A Bi a E hi .58** 
tae 5 56** 64** 68** 
g! 
aipee ponnn 
+p <.05. 
a <.01. 
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TABLE 3 


AVERAGE ERROR Scores For INDIVIDUAL 
JupcEs AND GROUPS 


Semi- 
sophisti- Sophisti- 
Naive cated cated 
Mean 17.77 13.92 14.49 
Judge 1 22.17 14.93 14.56 
Judge 2 16.87 14.70 16.40 
Judge 3 14.90 13.36 12.97 
Judge 4 16.67 12.10 14.86 
Judge 5 18.87 14.50 13.63 


Hypothesis 3 was tested by an analysis of 
variance (Table 4) based on predicted IQ 
scores for the three groups of judges. The 
analysis indicated that there was no significant 
difference among judges between groups. 
Therefore, it was concluded that the ability 
to judge IQ from Rorschach psychograms 
does not improve with additional clinical 
experience. The only significant source of 
variance was between Rorschach psycho- 
grams. 

Fisher’s z transformation test (Edwards, 
1954) for the significance of the difference 
among correlations indicated that the mean 
correlation for the three groups of judges did 
not differ significantly from one another. Since 
the analysis of the results indicated that there 
were no statistically significant differences 
between the groups, it was necessary to 
reject the hypothesis that accuracy of clinical 


TABLE 4 


ANALYSIS OF VARIANCE FOR DIFFERENT 
LEVELS oF JUDGES 


Source of variation df MS F 
Between judges (J) 14 
Groups (G) 2 901.04 1.55 
Between judges 
same groups 12 580.27 
Within judges 435 
hachs (R) 29 2,409.31  23.27** 
Tow ' 58 88.06 .85 
Pooled J X R 348 103.54 
Total 449 
*k p <.01 
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judgment increases with amount of clinical 
experience. 

Cues used by judges. Additional analyses 
of the data indicated that the interjudge 
reliability was much higher for the more 
experienced judges (semisophisticated and 
sophisticated groups) than it was for the 
naive judges. That is, experienced judges 
tended to resemble each other in their pet 
formance, making similar errors. The rang? 
of interjudge correlations for the three groups 
was: Naive, —.07 to +.72; Semisophisticate® 
+.55 to +.85; and Sophisticated, +.54 © 
+.80. Individual multiple regression on 
tions derived for each judge showed that he 
more experienced judges frequently used ) 
same four variables, (M, Z, F+%, and R a 
as the competitive equation. The multiple w 
gression coefficients associated with all A 
judges ranged from +.84 to +.98, indicati? 
that the judgments of the individual judg” 
could be predicted with a high degree 
accuracy. W 

An inspection of the individual predict!© E 
of all the judges and the multiple regressi? 
equation suggested that predictive accura' 
was consistently lowest for the higher f 
scores. The average error computed bewsa 
the five highest IQ scores and their on 
sponding predictions by all the judges 4 
the equation was 23.48, as compared Wie 
their mean average error of 14.92 for st 
entire task sample (including the five nighe 
IQ scores). It appears that the relations ie 
between IQ scores and Rorschach variab 
breaks down considerably within the UPP 
range of IQ scores, and this tends to Jow" 
the overall correlations of the judg? 
process for both the clinical judges and 
multiple regression equation. 


e 


DISCUSSION 
ust 


Any investigation of clinical judgment Oe 


face certain unavoidable difficulties. Fon 5 
ample, we might ask about this study’, | 
predicting IQ scores from Rorschach P 
grams a true indication of clinical judg™ 

Tt is very possible that this task, while al 
lowing for a more rigorous experim ited 
design and statistical analysis, is s0 atio” 
that the results do not yield any infor™ 


Í 
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about the complex process of clinical judg- 
ment. It is difficult to determine if a cli- 
Nician is doing the same thing when he makes 
a judgment on the basis of assorted informa- 
tion as he is when he is predicting one test 
Score from another test, as the judges did in 
this study, ; 

The question might be raised as to whether 
many of the judges were taking the task seri- 
ously enough and whether, with more time 
fhe involvement, they might have “beaten” 
s F equation. The following statement was in- 

erted in the instructions, in order to motivate 
b “Since you will be competing with 
te er clinical judges and also with a multiple 
8ression equation, try to be as accurate as 
aca” While no objective measure of the 
gl Motivation is available, most of them 
on € very eager to know how well they did 
Ds the task, Most of the judges kept the 
to chograms several days, but it is difficult 
estimate how much of this time was actu- 
ae doing the task. Several of the judges 
Brar rted that they referred to the psycho- 
in th S on different occasions, “mulling it over 
Actua minds. They reported, after this delay, 
ually taking from 30 to 60 minutes writing 
own their estimates. Not only did the judges 
tho W that the psychograms to be rated were 
Rock, of patients, but many of the Lt 
fan at the hospital from which the psyc 10> 
Ma S were chosen. Although every effort was 
Riven to limit extraneous information i 
Ma; n to the judges, it is possible that they 
esti ave a tendency to lower their total 1Q 
with wes on the basis of their familiarity 
Confront hospital population. This hpo = 
delibe ed as a possible source 0 min 
tang rately using a wide but representa", 
€ of IQ scores. In order to determine 


h . 4? 
bi cher or not the judges’ predictions were 


a; ) $ 
bu by their familiarity with the hospital 
twa ation, correlations were computed be- 
a h group 


the mean predictions for eac 
Th all the actual 10’s except the fiye highest. 
expense telations increased for the two more 
gro enced groups, but not for the nain 
from when the highest IQ’s were remove 
ing, the sample. However, since the largest 
“qu ase occurred in the multiple regression 
Teg, On, it cannot be concluded that bias was 
“Ponsible for the judges’ predictions. 
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It might be argued that depriving the 
judges of the original protocols represents a 
departure from the manner in which a cli- 
nician who does not depend upon actuarial 
methods arrives at his judgment of intel- 
ligence. This procedure was deliberately used, 
and the rationale behind it is based on some 
findings reported by Todd (1954). On a 
similar task, involving the estimating of IQ 
from the Rorschach test, Todd found that, 
while the IQ ratings using total protocols were 
significantly more highly correlated with IQ 
than psychogram ratings alone for the group, 
only one of the 10 judges significantly in- 
creased his ratings. He concluded that the 
addition of verbal cues may increase some 
clinicians’ ability to judge intelligence from 
the Rorschach, but there is no evidence that 
this is generally the case. He also found that 
39 sophomore college students were able to 
rate 78 Rorschach verbal protocols (without 
the scoring records) according to the degree 
of verbal complexity believed to be reflected 
in the protocol. The correlation of the median 
ratings with IQ was +.58, as compared with 
+.47 for the clinicians’ ratings based on 
formal factors alone. This suggested that 
verbal factors were at least as valuable as 
formal factors as cues to intelligence. On the 
basis of these findings, the present investi- 
gator felt that allowing the judges to use the 
complete protocol would probably not increase 
their accuracy significantly and it might tend 
to confound the results. That is, one would 
not be able to say that ability to do the task 
was dependent upon the amount of experience 
or training in clinical psychology. In addition, 
there was no good way to include the qualita- 
tive differences of the verbal protocols into 

equation. 
Do (1955) has indicated that judgmental 
ability is positively correlated with intel- 
ligence. Therefore, the objection might be 
raised that, in the present study, the success 
of the judges in predicting IQ scores is as 
much a function of their intelligence as it is 
of their relative amounts of experience. In 
order to see if this was the case, a rank order 
correlation coefficient was computed between 
] aptitude score (consisting 


he mean genera [ 
of quantitative and verbal percentiles) of the 
Graduate Record Examination and accuracy 
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of prediction for the 10 graduate students of 
the naive and semisophisticated groups of 
judges. The rank order correlation coefficient 
(Rho) obtained was —.28. Thus, it seems 
that, for this study, the judges’ ability to 
predict IQ’s was not a function of their 
intelligence alone. 

Finally, Hammond (1955), in his discus- 
sion of the clinical method, has included data 
from the study by Todd (1954), which is 
comparable to some of results obtained in 
the present study. For example, Todd found 
that the median correlation coefficient be- 
tween Rorschach-estimated IQ and Wechsler- 
Bellevue IQ for 10 experienced clinicians 
was +.47. Although Hammond states that 
this correlation is “better than chance and 
fairly impressive,” it is considerably lower 
than the +.68 correlation coefficient ob- 
tained by the the experienced clinicians in 
the present study. In terms of the relationship 
of various Rorschach factors to IQ, Todd 
reports the following correlations: +.31 with 
R, +.33 with M, —.30 with F%, and +.28 
with F+%. The present study indicated that 
M and F+% correlated positively and signifi- 
cantly with IQ, suggesting some stability of 
the relationship between these Rorschach 
factors and Wechsler-Bellevue IQ. As in the 
present study, Todd predicted the responses 
of the clinical judges by developing separate 
multiple regression equations (based on the 
four factors best for each judge) for each of 
the 10 clinicians. The median was +.85 and 
the individual correlations ranged from +.74 
to +.92. In the present study, the corre- 
sponding range of correlation coefficients was 
from +.84 to +.98 for all judges, regardless 
of their experience. These data support 
Hammond’s (1955) contention that “. . . the 
multiple correlation model which predicts 
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that the clinician combines the data from the 
Rorschach in a linear, additive fashion is 4 
good one...” (261). 


FINAL REMARKS 

s discussion, 
st be inter- 
same time, 
simistically 
been 


It is obvious, from the previou 
that the results of this study mu 
preted with reservations. At the 
they must not be viewed either pes 
or capriciously. These findings have 
obtained. 

1. Experienced clinical psycho 
make accurate estimates of intelli 
Rorschach summaries, but there is roo 
improvement. 

2. A linear multiple regression equati 
do the same task with approximately 
but no greater success. 

3. It appears that clinical 
beyond an introductory course in R 
does not significantly increase jud 
predictive ability. 


logists ca? 
from 


gence ri 
m for 


on can 
equa 
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PURDUE PEGBOARD AS A 
AND LATERALITY 
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A portion of the Purdue Pegboard Te: 


given to 80 patients on @ neurology servic 
to indicate the presence and laterality 


optimum cutoff scores it was possible 
or diffuseness of brain 


support the use of sensorimotor ra 


damage in 70% of the cases. 


PREDICTOR OF THE PRESENCE 


CEREBRAL LESIONS* 


Jr., ERIC LEVITA, anv NORMAN FARBER 


Yeshiva University 


st taking 3 minutes to administer was 


e of a general hospital. With 


Brain damage without 


determined in 90% of cases with 7.59 


e test was then cross-validated 
tive efficiency. The data 


ther than perceptual cognitive tests in 


are less dependent upon educational 


screening for brain damage because they S 
lizing significance of sensorimotor 


background and because of 
dysfunction. 


oe pts of psychologists to diagnose the 
een pine laterality of cerebral lesions have 
can be ized for failure to use tests which 
and Ser gi reliably, to control for age 
And cros ation, and to use adequate validating 
lates Porras procedures (Meyer, 1961; 
isanos ory ‘The usefulness of psycho- 
Sidereq j of brain damage must also be con- 
Phy terms of cost in time and money, 
Without with which diagnosis can be made 
base ý such psychological study, and the 
in e incidence of neurological disorder 
iate platio sampled. 

and Co, and Vaughan (1962) and Vaughan 
ogniti? (1962) in a survey of perceptual 
brain de and sensorimotor performance in 
for as amaged patients found deficits in per- 
Co Dare on all tests in brain damaged as 
Cognitive to control groups. The perceptual 
Wi Au tests, however, correlated highly 
` Neons] cation. In a group of patients hetero- 
ton, the distributed with regard to educa- 
Woulg he predictive efficiency of such tests 
Set up F lowered unless separate norms were 
“Wueation groups differing in amount of 
eq cate PProach which overcomes effects of 
Patina a level involves intraindividual com- 
techni S on a large variety of tests. Such 
ig reveal specific or general tests 
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defects correlated with brain damage (Reitan, 
1962). The results are obtained at the cost 
of the time involved in a complete evaluation 
and thus such batteries are not feasible as 
screening procedures. 

An alternative approach to the psycho- 
logical assessment of brain damage lies in the 
use of sensorimotor tests (Semmes, Weinstein, 
Ghent, & Teuber, 1960; Teuber & Weinstein, 
1954; Yates, 1954). These tests offer meas- 
ures of function essentially independent of 
educational achievement. We found the 
Purdue Pegboard (Science Research Associ- 
ates, 1948) to be a test which is correlated 
with other indicators of sensorimotor per- 
formance (Vaughan & Costa, 1962) and 
independent of educational level in normals. 
We decided, therefore, to assess its use as a 
screening device in detecting the presence and 
laterality of cerebral lesions. 

The addition of a short screening test of 
high accuracy to the routine battery of psy- 
chological tests given in many clinical set- 
tings would be of great value. The presence 
of positive findings would suggest medical 
evaluation or re-evaluation, perhaps even in 
instances where brain damage was not sus- 

ected. Additional psychological techniques 
also relevant to brain damage but not often 
used routinely might then also be used. The 
small cost of administering a short sensori- 
motor test routinely would be amply repaid 
by the detection of even a few unsuspected 


cases of brain damage. 
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TABLE 1 


CORRELATION OF AGE AND EDUCATION WITH INTELLECTUAL AND SENSORIMOTOR TESTS IN A 
Group oF 26 Noy-Braty-Dawacep Patients 


Education with 


age held 
Tests Age Education constant 
Age — —.56 oe 
Mill Hill vocabulary —43 69 -60 
WAIS block design —48 ‘60 56 
Pressure threshold (right hand) 74 —.o1 — 40 
Two-point discrimination (right hand) 22 = 30 =a 
Finger oscillation (right hand) = 57 39 14 
Purdue Pegboard (right hand) —.39 -30 Al 


PROCEDURE AND SUBJECTS 


The peg placement part of the Purdue Pegboard 
test was administered for the left hand, right hand, 
both hands simultaneously, exactly as prescribed in 
the examiner’s manual (Science Research Associates, 
1948). The actual testing time was 90 seconds, 30 
seconds for each condition. With required demonstra- 
tions and instructions the entire testing procedure 
required approximately 3 minutes. The number 
of pegs placed correctly was recorded for the left 
and right hands, and the number of Pairs placed 
correctly was recorded for the simultaneous condition, 

Two samples of patients were tested. The vali- 
dating sample consisted of 80 patients referred to 
the neuropsychology laboratory for psychodiagnostic 
evaluation. They were independently diagnosed by 
clinical neurological examination, electroencephalogra- 
phy, and neuroradiographic procedures. All patients 
considered testable were capable of reaching the test- 
ing room on foot or in a wheel chair. Psychological 
test performance or 54 of these patients has previ- 
ously been reported (Costa & Vaughan, 1962; 


n mple 
Vaughan & Costa, 1962). The cross-validation 55 ‘ 
consisted of 65 consecutive testable admiss deter 
the neurology service, Their diagnoses wert dating 
mined by the same procedure as in the hibite 
sample. The brain damaged patients ‘aie 
wide variety of acute and chronic disorders of 
monly seen on an adult neurology service. 
neoplasm, traumatic injury, degenerative, 
and infectious disease were all represented. 
trol groups contained patients admitted riphe! 
neurology service with lesions either in the peporacie 
nervous system or below the level of ma g non” 
spine. Some patients were diagnosed as a: tes 
neurologic lesions. Ten patients could not jon and 
gorized on the basis of neurological evaluate ays 
were excluded from consideration, Table amples 
the sex, age, and education data of the $ 
studied, 


RESULTS AND DiscussiON 


ent. 
_On the basis of inspection of the fred" je 
distributions of the scores of patients ? 


TABLE 2 
Sex, AGE, AND EDUCATION oF VALIDATION AND Cross-Vatwation SAMPLES Ee 
wr Age Education 
EE ee ee 
Group N M F M SD M SD 
Validation sample 
Control 26 10 16 51.92 16.73 8.53 2.90 
Left lesion 20 i 9 57.50 13.80 805 4.44 
Right lesion 20 10 10 57.75. 851 7.65 3.07 
Bilateral lesions 1 8 6 47.14 12.93 8.17 3.68 
Cross-validation sample 
Control F PS 48.33 14.20 9.60 3.84 
Left lesion 0 3 47.10 15.35 9.10 2.84 
Right lesion w ee 55.10 14.20 6.10 2.88 
Bilateral lesions 0 Ar. 3 51.60 15.79 7.80 3.38 


PURDUE PEGBOARD AS A PREDICTOR OF CEREBRAL LESIONS 


eels sample alone the following cutoff 
rules? were derived: 


Subject below age 60 


Sie brain damaged if one or more of the 
S10 Un phir: left < 11, right < 13, simultaneous 
non-brain d right, right > left + 3. If none apply call 
2. Call | amaged. 
it right Kion (a) left if left > right and (0) right 
do n eft +3, If called brain damaged but a and 
ot apply, call brain damaged bilateral. 


Subject age 60 and above 


tlan brain damaged if one or more of the 
8, eS eee left <10, right < 10, simultaneous 
On-braj right, right > left +3. If none apply call 

2. Can eased. 
if menal lesion (a) left if left > right and (b) risht 
and b left +3. If called brain damaged but a 
do not apply, call brain damaged bilateral. 


Th 
ela rules were constructed to provide 
“imal predictive accuracy with no special 


Consi ; f ; 
ideration given to minimizing false posi- 


2 
poate Proportion of left-handed patients was too 
© permit the development of separate norms. 


135 


tive or false negative results. Obviously other 
cutoff rules could be determined to minimize 
either kind of error at the cost of increasing 


the other. 


Predictive Efficiency 


The cutoff rules were then applied to the 
independent cross-validation sample of con- 
secutive admissions. 

Table 3 shows that the predictive accuracy 
of the test with regard to the presence of 
brain lesions is approximately 90% in both 
the validation and the cross-validation sam- 
ples. In all 135 cases there were only six 
false positives and eight false negatives. We 
assume that the cross-validation sample of 
consecutive admissions provides an estimate 
of the base rate incidence of cases falling into 
each diganostic category. Seventy-three per- 
cent of all cases are brain damaged so a 
predictive accuracy of 73% can be attained 
by calling every case brain damaged. The 
accuracy of 89% obtained with the Purdue 


TABLE 3 


Accuracy OF PREDIC 


TION OF PRESENC! 
WITH THE PURDUE PEGBOARD 


E OR ABSENCE OF BRAIN LESIONS 


Prediction 
i Percent 
Sample a N Correct Incorrect accuracy 
Validation Control 26 n 2 92 
Left lesion 20 19 i a 
Right lesion a 13 al F 
Bilateral lesion 
a enia 64) (48) (6) (89) 
Total 80 "= : a 
(a 
*08s-validation Control 13 1 0 itt 
Left lesion 10 9 1 5 
Right lesion 10 19 1 95 
Bilateral lesion 20 (38) (2) (95 
(All brain damaged) (40) ) 
5 Total 55° 5 ; y 
“dation cross- Control 41 A : S 
alidation combined Left lesion 30 28 3 93 
Right lesion 30 32 2 95 
Bilateral lesion 34 86) (8) (92) 
(All brain damaged) (94) ( 
Total 135° 121 = a 


a 
Ten nondiagnosed cases omitted. 
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TABLE 4 
ACCURACY OF PREDICTION OF LATERALITY OF BRAIN LESIONS WITH PURDUE PEGBOARD 
i Prediction Accuracy 
Neurological Non- Lef i i 
: urologic: . No Left Right Bilateral cor- Percent 
ample diagnosis N lesion lesion lesion lesion. Correct ines accuracy 

Validation Control 26 24 1 — 1 92 
Left lesion 20 4 11 1 4 A : 55 
Right lesion 20 1 — 13 6 y : 65 
Bilateral lesion 14 1 2 3 H E 6 57 

: 2 3 6 
(All brain damaged) (54) (6) (13) (17) as) (32) (22) (59) 
Total 80 30 14 17 19 56 24 10 
Cross-valid- Control 15 1 1 1 2 73 
ation Left lesion 10 8 = 2 4 k 80 
Right lesion 10 1 = 6 3 : á 60 
Bilateral lesion 20 1 4 7 8 È é 40 
(Al brain damaged) (40) w a» a3 (13) G eis) (655) 
Total 55 13 13 14 15 33 22 60 
Validation Control 41 35 2 1 3 85 
cross-vali- Left lesion 30 4 19 1 6 i ; 63 
dation Right lesi 30 2 — 19 9 n ji 63 
combined Bilateral lesion 34 2 6 10 16 16 k 48 
(All brain damaged) (94) (8) 5) GO (31) 64) G0) (57) 
aail 135 43 27 31 34 89 46 66 

er 


Pegboard is significantly higher? (p < .05) 
than that obtained calling all cases brain 
damaged. 

The predictive accuracy with regard to 
lateralization is shown in Table 4. Seventy 
percent accuracy in the validation sample 
shrinks to 60% accuracy in the cross-valida- 
tion sample. If cases were assigned to cate- 
gories of lateralization on the basis of their 
representation in the cross-vali- 


dation sample of consecutive admissions an 
accuracy of lateralization of only 27% would 
result. The accuracy of lateralization using 
the Purdue Pegboard is significantly higher 

p < .01, see footnote 3) than 27%. 

Thus both with regard to the presence and 
laterality of lesions the Purdue Pegboard 
offers a significant improvement over predic- 


tion from base rate. 


proportional 


Errors in Prediction 

e false positive cases with 
regard to the presence of cerebral lesions 
suggests the test is weakest in ee 
patients with conversion symptoms an eo 
peripheral nerve or cord disease affecting the 


he significance of d 


Analysis of th 


34 tests for t ifferences between 
Ł tests 
percentages. 


upper extremities from brain damaged P% 
tients. No generalization seemed to apply : 
the false negative cases. Many of the bi- 
lateral lesion patients who were misdiagnos® 
as having lateralized lesions did, in fact, g! 
independent clinical evidence of greate" I 
pairment in the hemisphere predicted io 
with 

t 


nevertheless were considered as errors 
regard to lateralization. The test does n 
discriminate about 25% of patients wit! 
lateralized lesions from patients with pilateral 
lesions. No explanation of this fact was 4 i 
parent from a consideration of the case 0 
volved. It is to be noted that only 1 ° g 
patients with lateralized lesions was actually 
mislateralized. 


Implications 


Even though the Pegboard test 
demonstrated to be useful as a sC" 
device, its predictive usefulness in other a 
tings remains to be determined. situation 
where the base rate incidence of diagr?” it 
groups is different, such as in outpatie” 
clinics, state hospitals, or other institution’ 
would change predictive efficiency and mig? 
require new cutoff scores- 

No consideration was 


fulness of the test in diagnosing 


has bee” 
eening 


given here to the uS% 
«qiffcult 


D 


PURDUE PEGBOARD AS A PREDICTOR OF CEREBRAL LESIONS 


i ae to “easy” cases. While some of 
Isio ma tested were hemiparetics whose 
Sher Ae uld be lateralized almost at sight, 
dbyius mey lateralized patients showed no 
No a Sensorimotor disturbance. 
T a made that the Purdue Pegboard 
amaged ecessful in differentiating non-brain- 
with ra patients from patients 
ise of ral lesions. This, except for a few 
did oa one hysteria where the test 
question a ng not a relevant diagnostic 
Validated. n the setting where the test was 
ates Purdue Pegboard test is not a substi- 
Various f large battery of tests assessing 
oes A i of dysfunction. It, in itself, 
A large . er promise for correctly making 
made iro variety of inferences that can be 
a mae such batteries (Reitan, 1962). 
test Shove experience, however, the Pegboard 
Strongly S promise as a screening device. It 
Which A Suggests the superiority of tests 
Verbal ot sensorimotor and minimize 
indicators intellectual operations, as single 
tests hen of brain damage. Such sensorimotor 
ive ina € two obvious advantages: their rela- 
eir ap hae of educational level and 
ility to elicit bimanual samples of per- 


137 


formance which may indicate laterality of 


lesion. 
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EMPATHY, NEED TO CHANGE, AND IMPROVEMENT 
WITH PSYCHOTHERAPY * 


ROSALIND DYMOND CARTWRIGHT 
University of Illinois College of Medicine, Chicago 


AND 


BARBARA LERNER 
Billings Hospital, University of Chicago 


To study the influence of: the patient’s need to change, the therapist’s 
experience level, the empathic understanding of the therapist, the sex of the 
patient and therapist, and the amount of psychological distance between 
them, on psychotherapy improvement, 28 patients in client-centered counseling 
were tested before and after therapy with scales built from the Kelly Role 
Construct Repertory Test. Patients’ intitial need to change was found to 
be directly related to improvement. Therapists’ final level of understanding 
was also directly related. Patient’s need and therapist’s empathy considered 
jointly produced a prediction model for therapy length and success, 2 success 
groups were found: same-sex patients of experienced therapists whose dis- 
tance from him the therapist initially reduced, and opposite-sex patients of in- 
experienced therapists whose distance from him the therapist initially increased. 


There is a growing body of evidence indi- 
cating that improvement in psychoneurotic 
patients takes place concurrent with psycho- 
therapy, but as yet little consensus concern- 
ing the factors responsible for such improve- 
ment. To the majority of writers in this field, 
improvement is a function of specific patient 
variables, therapist variables, and the inter- 
actions among them. However, different 
writers select different factors as accounting 
for the major portion of the variance in thera- 
peutic outcome. Since the research evidence is 
most often unreplicated, or even contradic- 
tory, and since there are still large unexplored 
areas, one can pick and choose among the 
scraps and fashion a patchwork quilt after 
one’s own heart or theoretic commitment. 

Although the various schools of psycho- 
therapy have different views as to what consti- 
tutes the necessary and sufficient conditions 
of the therapeutic process, the studies which 
compare the respective percentages of suc- 
cessfully treated cases have shown rather 
similar results for all theoretic approaches. 
This frequent finding suggests that some 
common elements exist among the various 
approaches to psychotherapy Pasa ae hee 
might very well be more significantly relate 

1 The authors are pied degra ie i A 
considerable aap oor am the critical reading of 
TE ait = Ernest Haggard and Fred L, 
Strodtbeck. 


to improvement than the elements On which 
the schools differ. 
The evidence for two such comm dies 
ments has been accruing slowly. The stu b) 
of Heine (1950) and Fiedler (1950a, 1950) 
strongly suggest that one important theraps 
variable, independent of technique different 4 
is the therapist’s ability to empathic’ i A 
understand his patient. A patient ber | 
which is suggested by the work of Butler 56); 
Haigh (1954), Rosenthal and Frank ( ie A 
Cartwright and Cartwright (1958), im- 
Kirtner and Cartwright (1958), as one 
portant to any form of verbal therapy: 18 E 
patient’s initial recognized need to chant 
These two promising variables, the ther apt 


on ele- 


e 
empathy and the patient’s need to horia 
were selected to form the basic ae t 

eel 


of this study. However, since we now 1, £ o 
therapy depends not only on the qualities -e 
the two participants but also on how te 
affect their relationship, this study attemp of 
to tease out some of the interaction effec 
these two major variables. ith 
Part I of the study was undertaken y 

three primary hypotheses: 


e 

1. The degree of need to change O” ms 
part of the patient is directly related t° 2 

provement with psychotherapy. Pe 

2. The empathic understanding of the Ro 

tient by the therapist is directly relate 

the degree of improvement in the pê 


with psychotherapy. 


tient 
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IMPROVEMENT IN 


3. i 
a The two variables considered jointly 
eich give a better prediction of outcome than 
er taken singly, 


fe cane the data relative to these 
to the dat which had been formulated prior 
concernin a seollection, other questions arose 
ables mith the interaction of additional vari- 
With the a selected for this study and 
art II ee variable, improvement. 
è patient „this paper considers the sex of 
Pist, the « in relation to the sex of the thera- 
dist puts Misha sob distance” the thera- 
e aaki himself and his patient, and 
additional ence level of the therapist. These 
clarify so variables were explored to help 
Underlyin me of the interpersonal processes 
e majo bi results obtained from testing 
Volves th hypotheses. This study, then, in- 
of their e consideration of five variables, some 
e raeno; and their relation to 
Variables ent with psychotherapy. These five 
threefolq can be organized according to a 
epend scheme: (a) patient characteristics 
ecogni ent of the particular therapist 
atact zed need to change), (b) therapist 
lent eristics independent of particular pa- 
(c) {therapist's level of experience), and 
ticular aracteristics dependent upon the par- 
empath Patient-therapist pair (the therapist’s 
in his Y or ability to understand the patient 
nd a terms, whether or not the patient 
thera rapist are of the same sex, and the 
Pist’s distancing of the patient). 


METHOD 


Th ; 

Sought qubjects were 28 patients who voluntarily 
Universi meat at the Counseling Center of the 
Tale, ves of Chicago.? Fourteen of the subjects were 
With a emale. They ranged in age from 19 to 43 
nee ites age of 27.7. Although these patients 
f tapisty ormally diagnosed, according to their 
À OM te, ahaa on “severity of illness” they ranged 
tered T mild” to “near psychotic.” Sixteen client- 
ari f erapists were involved, and the case length 

rom 6 to 116 interviews with a mean of 40. 


2 
dg No selectivity of patients was exercised. Each 
Stugy PPlicant was requested to participate in the 
6 until the quota of 30 subjects was reached. 
tin? Patients had not completed treatment at the 


© data analysis was begun. 


PsYCHOTHERAPY 139 


Instruments 


The two instruments in this study are referred to 
as Scale A and Scale B. Scale A provided the data 
for the improvement criterion and Scale B provided 
the measure of the patient's need to change, the 
measure of the therapist’s empathic understanding 
of his patient, and the therapist's distancing of the 
patient. 

Scale A was administered to the therapists on two 
occasions, after the second interview, referred to as 
the pretherapy rating, and after the last interview, 
for a posttherapy rating. The Improvement score 
was the sum of four components: three change 
scores between the pre- and posttherapy ratings of: 
(a) the patient’s integration; (b) his kind of organ- 
ization (defensive versus open); and (c) his present 
life adjustment. The fourth component of the Im- 
provement score was the therapists’ final rating of 
the outcome of therapy. All ratings were made on 
nine-point scales. 

Scale B differed in content for each subject. Each 
subject in a sense supplied his own items. This was 
done in an effort to insure that the scales would be 
highly personally relevant to each patient. The items 
were the personal constructs obtained by first 
administering Kelly’s (1955) Role Construct Reper- 
tory Test to each subject. From these personal con- 
structs which each patient introduced as constituting 
the important similarities and differences among the 
real people in his life, the first 10 discrete ones were 
selected. Scale B was then made up of the 10 items 
supplied by each patient arranged as five-point rating 
scales. On the first testing occasion following inter- 
view Number 2, the subject made various ratings 
of these items: (a) to describe himself as he is at 
present, and (b) to describe himself as he wants to 
be when therapy has been completed. The sum of 
the squared discrepancies between these two sets of 
ratings was used as the measure of his felt need to 
change on the 10 items of particular significance to 
him. The patient repeated this task at posttherapy 
to again describe himself as he was then. 

Scale B was also the basis for the measurement 
of the therapist’s empathic understanding of his 
patient. ‘After the second interview the therapist was 
given a Scale B form containing the 10 items 
chosen by his patient. He then attempted to describe 
“the patient as he sces himself.” The empathy 
measure was the squared discrepancy between the 
patient's self-description and the therapist’s at- 
tempt to predict the patient’s self-description. The 
therapist repeated this task at posttherapy time. This 
procedure was used to measure the therapist's 
empathic understanding after increased contact with 


the patient. 
RESULTS 


Part I 


Hypothesis 1: The degr 
on the part of the patien 


ce of need to change 
t is directly related 
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TABLE 1 


Mean Scores oN PATIENTS’ NEED TO CHANGE AND THERAPISTS’ EMPATHY 
FOR THE IMPROVED AND UNIMPROVED PATIENTS 


Variables Improved? Unimproved> t af 
Need to change 47.26 20.00 sire 26 
Pre-T empathy (E;) 15.93 16.69 =- 

Post-T empathy (E2) 9.40 14.46 2.24* 26 


to improvement. Table 1 shows that Hypothe- 
sis 1 is strongly supported. 

The mean need to change score was much 
higher for the improved than for the un- 
improved group. 

Hypothesis 2: The empathic understanding 
of the patient by the therapist is directly 
related to the degree of improvement in the 
patient with psychotherapy. Table 1 shows 
that there was no significant difference 
between the improved and unimproved cases 
on their therapists’ ability to understand their 
pretherapy self-image. However, at the close 
of therapy, the therapists understood the self- 
image of the improved patients significantly 
better than they did those who were un- 
improved. Also there was a significant gain 
in the therapists’ empathy score between the 
first (E,) and second (E2) occasions—but 
only for the improved cases. This result 
showed that there was a significant relation 
between improvement and increased under- 
standing. For those who were rated unim- 
proved through therapy, the therapists made 
no significant gain in their understanding 
of them. 

Perhaps the explanation of the relation 
between high posttherapy empathy and im- 
provement is not the obvious one, that the 
therapist coming to understand his patient’s 
own view of himself contributes to more and 
better therapeutic work being done. Perhaps, 
instead of the therapist coming to understand 
the patient, the patient is adopting his thera- 
pist’s view of him. This alternate view might 
well account for the results. Such a change 
would make it easier for the therapist to 
predict how the patient “sees himself” and 


measure, the lower the score the greater the empathy. 


also possibly be valued as a growth in insigh 
making it more likely that the case be <> 
“improved.” We have, then, to test: do ay 
proved cases change so that their postther 
self-description resembles the descrip en 
therapist made of them at pretherapy Jf- 
more closely than their pretherapy 
description did. The chi square test her! a 
significant in the opposite direction. At tions 
therapy, 11 of the improved cases desct!P ists’ 
of themselves were less like the eg ey 
pretherapy descriptions of them than more 
had originally been, and only 4 ia mi 
like them. For the unimproved group t were 
verse relationship was found, 9 cases con 
more like their therapists’ pretheraPy jike 
ceptions of them and only 4 were re cient 
them. Improvement, then, goes with 4 p con” 
change away from the therapist’s early erapy 
ception of him. Is the therapist’s postne pis 
conception of the patient then less lÍ the 
pretherapy self-description and more Ji give” 
patient’s self-conception? The answe! Re for 
by a highly significant chi square, 15 es of 
patients rated improved, The therap cep” 
improved cases have changed their C° nem 
tions of the patients in ways that bring 
closer to the patients’ own view. 
Perhaps the improved cases stay® t tim? 
and so these therapists had more contan ding 
within which to improve their underst? inte 
Actually, the improved cases had few®’ roved 
views (M = 37.33) than the unimP iffet 
cases (M = 43.69, a nonsignificant 
ence). Thus neither a change to ann of 
therapists earlier views nor the oe ound 
contact alone accounts for the relatio y ant 


3 : ra 
here between improvement in the p. 


e was 
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increase in the level of understanding of the 
patient by the close of therapy. 
7 H ypothesis 3: The two variables considered 
jointly will give a better prediction of out- 
come than either taken singly. By taking the 
two variables together, improvement and 
length of treatment, some interesting differ- 
ences were revealed. Dividing the cases at the 
median by length into Long (25 or more 
interviews) and Short (less than 25), and into 
Improved (Improvement score 10 or greater) 
and Unimproved (Improvement score 9 or 
less) four subgroups were formed. Table 2 
gives the means for these groups on the two 
Measures, 

Schematically, then, the relations between 
the variables were as follows: 


5 


Length im- Necd Empathy 
provement to change (E) 
Short improved High High 
Long improved High Low 
Short unimproved Low Low 
Low High 


Long unimproved 


Tn terms of the joint consideration of the 
he therapist’s 


Patient’s need to change and tł : 
“Mpathic understanding, several generaliza- 
‘ons can now be offered. 
. Therapy is short when either of two condi- 
tions obtains. (a) Both the patient’s need to 
“tange and the therapist’s understanding of 
© patient are high. Although the mean 
number of interviews is small, these cases 
Te therapy with high improvement scores. 
The patient’s need to change and the 
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therapists understanding of him are both 
low. These cases leave therapy in an equally 
short time but as unimproved. 

Therapy is long, on the other hand, under 
two other conditions. (2) The patient’s need 
to change is high but the therapist initially 
misperceives him. The high degree of patient 
motivation to change seems to be sufficient to 
keep him in contact long enough for the 
therapist to correct his misperception and 
come to see the patient in his own terms 
(E; = 20.00, E2 = 8.57). These are long term 
cases but do eventually leave improved. 
(b) The therapist understands the patient’s 
self-conception but the patient doesn’t really 
want to change. The high understanding 
probably has sufficient reward value to keep 
the patient in contact but without the internal 
pressure to change or recognition that change 
is possible, he eventually leaves therapy 


unimproved. 


Part II 

Having reached this stage in the analysis, 
the study might well have been concluded. 
However, further questions kept occurring 
for which some clarification might be reached 
by searching the available data in new ways. 

The finding that therapists varied in their 
level of understanding of their patients’ self- 
conceptions after an exposure of only two 
interviews was not surprising. Nor was it 
surprising to find that some therapists, whose 
first attempts at understanding their patients 


were pretty wide of the mark, increased their 


TABLE 2 
IMPROVEMENT SUBGROUPS 
-pApIsTS’ EMPATHY FOR Four 
Patients’ NEED TO CHANGE, THERAP = 
| i Therapists’ Empathy 
bagi Therapists’ 
Patients n E: change 
Need to change Pi Brh 
pa k Rank Va 
Grou M Rank M Rank 4 Bee 
ip A P 
10.12 2 2.2 
Short 12.37 1 
I 5 1 
N= proved 56.62 , am , ea 
ae Improved 36.57 2 20.00 P 
< 3 18.16 4 1. 
Short Unimproved 17.66 4 19.83 
es: 00 2 11.28 3 2.7 
ong Unimproved 22.00 3 cs 
= 7 
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TABLE 3 
Sex oF THERAPIST-PATIENT PAIRS AND EMPATHY, ĪMPROVEMENT, AND DISTANCING 
Variables Same Sex* Opposite Sex? t af 
E: 20.21 12.35 2.278** 26 
Ee 12.00 11.50 — 
Eiz 8.21 85 1.277" 26 
Improvement 10.00 9.35 — 
Distancing + 5.21 — 6.42 2.211** 26 
aN =14. 
bN =14. 
+p < .10. 
p < 05. 


understanding to a high level by the time 
therapy had drawn to a close. However, since 
there was a strong relationship between the 
therapist’s ability to understand his patient’s 
self-conception at the close of therapy and 
the outcome of the case, there was interest 
in discovering what characteristics of the 
relationship made the initial understanding 
either easy or difficult, and what character- 
istics were related to an improvement in 
understanding through time. The first of these 
characteristics to be investigated was the sex 
of the therapy pair. 


Sex 


Therapists obtained significantly higher 
empathy scores on the first occasion with 
patients of the opposite sex than with pa- 
tients of the same sex. By the time therapy 
had been completed this difference no longer 
held. It seemed that the therapists had more 
initial difficulty understanding patients of like 
sex than of the opposite sex but that this 
handicap was overcome with time. (Thera- 
pists’ empathy for clients of like sex did 
increase significantly t= 2.37, p< .05, df 
13.) Perhaps therapists at the beginning of 
their contacts with patients of the same-sex 
err in understanding by assuming that they 
are more like themselves than is warranted. 
This assumption of similarity would be less 
likely to occur with patients of the opposite 
sex, leaving the therapist freer from a projec- 
tive set and more open to discovering how 
it is that the patient views himself. 


Distance 


In order to test the suggested explanation of the 
effect of the sex pairing on empathy, a distance 
measure was devised. First the actual similarity in 


the ratings that the therapist and patient each ma 
to describe themselves was calculated. The squa o 
discrepancy between these was called a measure 
their “Real Similarity.” Next the similarity © 
therapist’s description of his patient and of 
was calculated. The squared discrepancy b ist's 
these two was called a measure of the thera ph e 
“Assumed Similarity.” The difference between 3 
Real and Assumed similarity scores was used eo 
measure of the distance the therapist placed pem 
himself and the patient. If he “assumed” Bion 
similarity than was “real” he was erring pre 
logically by bringing the patient closer to eee 
than was represented by the reality distance betwe 
their two self-ratings. These errors were scored 

a positive sign and are referred to as errors 1? ist 
direction of reducing the distance. If the therr i 
assumed less similarity than was real be © 

in the direction of putting the patient farthe 
him than corresponded to the real difference 1” tive 
ratings. These errors were scored with a neg? the 
sign and referred to as errors of increasing 
distance. 


etween 


Therapists on the first testing occasio® 
reduced the distance with same-sex pati 
and increased the distance with opposites 
patients. This tendency would seem to help ial 
account for the finding of poorer er 
empathy with same-sex than with oppo at 
sex patients. It appears from Table 3, ts 
therapists potentially can understand patie? 7 
of either sex equally well (Eg 12.00 and at 
11.50) but that the initial assumption t 
the self-images of same-sex patients We, 
more like their own than in fact they “pe 
temporarily delays the full operation © also 
empathic capacity. Although therapists “ 7. 
err initially with patients of the opposite een 
by overemphasizing the differences bima 
them, this negative distancing does not ore: 
to interfere markedly with the empathy ctl 
That is to say, therapists may et 
perceive the patient as very different 
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jp iasel and still perceive the way he sees 
imself reasonably accurately. 


Di 
tstance, Sex, and Improvement 


a NOW what is the effect of this initial dis- 
a ee of same- and opposite-sex patients 
sho eir eventual therapeutic gain? Table 4 
of a that the therapist’s initial distancing 
4 oth same- and opposite-sex patients who 
pee judged improved at termination was 
Ent more extreme than his distancing 
th those who were judged unimproved. Al- 
a gh both improved sex groups were subject 
large distancing errors, these were in op- 
haite directions. The same-sex patients who 
poved in therapy were initially seen by the 
r erapist as more like himself than their own 
atings would suggest. This reduction of dis- 
‘i seems to imply an immediate emotional 
ee of these people. In contrast, the 
ed patients who were subsequently 
at €d as unimproved were held off emotionally 
the begining of therapy and seen by the 
rapist as more different from him than 
eir own ratings placed them as being. For 
re, ©pposite-sex patient the relationships were 
ersed, The opposite-sex patients who im- 
Proved in therapy were early seen by their 
ae rapist as being very different from him— 
t Ore different than their self-ratings showe 
Hen to be. This viewing the opposite-sex pa- 
aie as very different seemed to imply a 
to Ssification of, and insistence upon, the = 
ath distinction. The cross-sex patients who 
tiled to improve were not seen by the thera- 
oa as occupying a pattern distinct from hi 
Wn. They were seen as being about as muc 


like the therapist as their self-ratings showed 
them to be. 

The really big difference was between same- 
and opposite-sex patients who improved. 
Same-sex patients who improved were early 
accepted by the therapist as very like him and 
opposite-sex patients who improved were 
early seen as very different. 


Experience of the Therapist 


The effect of the experience level of the 
therapist on distancing, empathy, and im- 
provement was investigated next. Eight of 
the therapists were classified as more experi- 
enced, on the grounds of having handled more 
than five research cases prior to the present 
one, and eight were classified as inexperienced, 
having treated fewer than five. For five of the 
therapists the present case was the first one. 
The experienced therapists made distancing 
errors in both directions with about equal 
frequency. They were as likely to err in the 
direction of seeing more similarity than 
actually present as they were to see less. The 
inexperienced therapists, though, had a sig- 
nificant bias in the direction of the negative 
errors. Inexperienced therapists tend to see 
patients as less like them than the patients’ 
own views show them to be. Perhaps what 
one learns as a result of experience in doing 
therapy is that nothing human is really 
foreign to us. Or perhaps it is that the new 
therapist is more open to threat and more 
easily made anxious by seeing patients as 
similar to himself and so does more defensive 


distancing. N $ 
Now to relate these various findings to 


TABLE 4 


ncInNG OF SAME 


. AND OpPpost 


TE-SEX PATIENTS ON THE 


ciara JupPROVED AND UNIMPROVED CASES 
Distancing 
5 A 3 7 3 
Sex pairi Improved Unimproved if 
ae pairing = 4 = r : 
Same nex aa +14.00 -6.50 
sex N=10 N a eh d fe 
Opposi —11.85 =y 
pposite sex T me 
; 4.649 am 
a 14 
p .001 ns 
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improvement with psychotherapy, it appears 
that improvement was rated high for two 
groups of patients: same-sex patients treated 
by experienced therapists, and opposite-sex 
patients treated by inexperienced therapists. 
In both groups the therapists’ posttherapy 
level of empathy (E2) was high and the 
original distancing of the patient by the thera- 
pist extreme. The experienced therapist 
achieved high empathy and improvement with 
same-sex patients with whom the psycho- 
logical distance was immediately reduced and 
the inexperienced therapist achieved high 
empathy and improvement with opposite-sex 
patients with whom distance was immediately 
increased. 
DISCUSSION 


The study shows that psychoneurotic pa- 
tients have the best chance for a successful 
treatment experience with nondirective ther- 
apy, if they come to it with a high need to 
change, and meet a therapist who can accu- 
rately understand the way they see them- 
selves. Some patients meet with therapists who 
have difficulty perceiving them in their own 
terms. If they have sufficiently high motiva- 
tion to change they continue in therapy, and 
the therapist does come to understand more 
accurately how it is they see themselves. 

For the therapist to improve his under- 
standing in this way he must have more 
access to material relevant to the task. Pre- 
sumably he misperceives originally because 
of barriers to this material, either in himself 
which distort his perceptions, or because of 
barriers in the patient which prevent him 
from revealing himself accurately. Barriers of 
both kinds are, in all likelihood, related to 
defenses against the potential threat involved 
in conscious recognition. If threat is reduced 
in the patient he would be more able to 
communicate personally relevant material to 
the therapist. This in turn would enable the 
therapist to base his view of his patient’s self- 
image on a deeper understanding of him. If 
threat is reduced in the therapist he would be 
freer to perceive the patient’s communications 
without distortions. , 

It would seem that the inexperienced thera- 

pist is the more open to threat in this situa- 
tion than is the experienced therapist, and, 
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in truth, it is he who tends to increase the 
distance with his patients. This in effect denies 
some of the similarity between himself and 
his patients which can be interpreted as 2 
defensive maneuver to reduce his threat. This 
distancing might well be experienced by his 
patients as a message to the effect that “you 
are really very different from me.” Such i 
message would probably raise the threat ge 
of the same-sex patients but reduce it oF 
opposite-sex patients. The experienced sie 
pist who has less personal threat nae 
distance between himself and his eee 
patients and conveys by this, “you are really 
much more like me than you think you ona 
This message from a prestigeful person of a 
same sex probably reduces the patient’s oe 
level. This tentative explanation would = 
count for the high improvement and final le n 
of empathic understanding of the same-5 = 
patients treated by experienced therap's 
and of the opposite-sex patients treate 
inexperienced therapists. 
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108 college women wrote at 2 sittings, 
g reliability data, 
ng stories from memory the other. Memory 


instructions both times, providin 
instructions one time, drawii 
sources were varied in 

were used in separate tesi 
no difference diagnostically 
from memory. When corre 
set and session number were 
or zero under all conditions, 
than Dependency rs. The da! 
for individual diagnosis. 


the Se (1961) has recently stated that 
Use rm general assumption underlying the 
is pre Projective testing is: “If an individual 
io a stimulus situation per- 
respons variable responses, the particular 
Cterj ses that he emits will reflect his char- 
een response patterns and tendencies 
that ease (p. 146). Although he recognizes 
Projects. are many sources of variance 1n 
implie lve test responses, his assumption 
reliable that projective testing can lead to 
only i and valid diagnosis of individuals 
Ons i the extent that intraindividual ie 
nditi consistency occurs, under specifie 
ions of data collection, as demonstrated 
lo Uitably designed research. Empirical 
hie a of Lindzey’s general assumption 
stidis woe a very wide variety of actua 
ee of which the present is one example. 
Cotister here concerned with intraindividual 
Depe ency in Aggression scores and in 
‘hes oe scores from TAT protocols, 
Vari the test is administered with specifie 
ations in instructions. The particular 


has been improved 


Co 


1 
T 
thro he final draft of this paper 


Sechren comments by D. T. Campbell and_ 
Final st for which we wish to express appreciation. 
dy and the opinions 


Cxpre, sPonsibility for the stu 


Ssed rests, of course, with the authors. 


2 subconditions. Aggressi 
ts of the clinically assumed hypothesis that it makes 
whether S is being creative or drawing stories 
lating scores from 2 types ©: 

balanced. Intraindividual consistency is low 
despite reliable scoring. Ag 
ta do not justify using the 


either following the usual (creative) 
or following “creative” 


on and Dependency scores 


f instructions, card 


gression rs are larger 
TAT as scored here 


sets of instructions were chosen in order to 
explore the proposition that remembered 
stories are subjected to the same motivational 
influences as are fantasied (creative) stories. 
Of course TAT testers typically do not know 
whether a given story is one which the 
subject remembers from a book, movie, or 
other source; oF whether it is a more 
genuinely creative product. Nevertheless, they 
seem to feel that this lack of knowledge is 
not a serious handicap, since their scoring 
and interpretive procedures involve no sys- 
tematic attempt to identify the sources of 
the subjects’ stories. Amazing as it may 
seem, an empirical test of this practically 
important example of the general assumption 
of response consistency has not heretofore 
been attempted. 

The findings pertinent to the main hy- 
pothesis concerning “creative” and “memory” 
influences must be evaluated against the test- 
retest reliability of the TAT when it is given 
under the usual creative instructions com- 
monly employed in clinical work. Accord- 
ingly, suitably instructed control groups are 
included in the present study. In view of 
the astounding scarcity of published reli- 
ability information for the TAT, these groups 
give data which are interesting and valuable 
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in themselves, as well as providing control 
data for the present experiment. 

In operational terms, the hypotheses tested 
in this study are: PP 

Hypothesis 1. There is a significant positive 
correlation between Aggression scores (or 
between Dependency scores) obtained from 
TAT protocols written by the subjects at 
one sitting and from protocols written by 
the same subjects at a second sitting ap- 
proximately a week later, when both sets of 
stories are written under the usual (creative) 
instructions. 

Hypothesis 2. There is a significant positive 
correlation between Aggression scores (or 
between Dependency scores) obtained from 
TAT protocols the subjects have written 
under the usual (creative) instructions, and 
from TAT protocols consisting of stories 
which the subjects choose from memory. 
This correlation will be as high as that 
between two sets of protocols by the same 
subjects done under creative instructions. 

In testing Hypothesis 2, two kinds of 
sources for the “memory stories” were used 
by different groups of subjects: stories pro- 
vided by the experimenter at sessions prior 

to those in which the subjects wrote their 
TAT stories, and stories which the sub- 
jects chose freely from their own personal 
memories of movies, books, short stories, 
case history materials, etc. 

From the clinician’s viewpoint, the more 
immediately relevant results come from the 
groups who freely chose their memory stories, 
because the situation more closely resembles 
what the subjects may do in the clinical 
testing situation. 

On the other hand, when the subjects are 
provided with a restricted, controlled set of 
possible memory stories, this means that 
they have all had relatively comparable in- 
fluences of immediately past learning upon 
their choice of memory stories, as contrasted 
to the situation in which each subject draws 
her memory stories from her own unique set 
of readings, movie viewings, etc. Intraindivid- 
ual consistencies which might be relatively 
obscured by sources of variable error in the 
“free choice” situation might appear more 
strongly in the situation where opportunities 
for choice of memory stories are made more 
comparable across all subjects. 
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Unfortunately, even the strongest trends 
in our results show disappointingly weak 
tendencies toward intraindividual consistency 
in Aggression and Dependency scores of p 
sort employed in this experiment. No ah 
port is given those who assume that suitably 
reliable discriminations among individuals can 
be made under any of the conditions used in 
this study. 


METHOD 


P r were 
Stimulus materials. Twelve TAT cards 


chosen and subdivided into two sets of six a 
called Set A and Set B. In the absence of publis to 
norms which would enable the experimenter: a 
choose two sets of cards comparable in their "ets 
ing power for Aggression and Dependency, z the 
were formed on the basis of the judgments 0 this 
two authors who are clinicians. Because roxi- 
method of equating card sets was only app for 
mate, the design of the experiment allowe E 
counterbalancing any effects of inequality oey 7B, 
the sets. Set A consisted of Cards 1, 3BM, "3G 
14, and 17BM. Set B consisted of 3GF, 7GF; 
13B, 17GF, and 20, ‘ned OF 
For each card three stories were obtaine two 
specially prepared. In the judgment of re cture 
authors who are clinicians, one story for each pe 
represented a strongly aggressive trend, tral 5° 
strongly dependent trend, and one was neu a i 
far as aggression and dependency were concern 
Two mimeographed booklets were prepare ’ con- 
each for Card Sets A and B. Each booklet vies 
tained 18 stories, one to a page, with the $ ran- 
pertaining to the various cards intermingle 
domly throughout the booklet. ds of 
Subjects. Results are based on the recor were 
108 female beginning psychology students WhO pro- 
divided into 10 groups, as described in the 
cedure Section and in Table 1. ether 
Procedure. Groups I and II were called tr Ses- 
in separate group meetings first (called Pre-' ipat 
sion). They were treated identically except Set 
Group I used booklets of stories prepared 
A, while Group II used the booklets of 
Prepared for Set B. In the Pre-1 Sessions, 
and II subjects received the following instrue t 
no 
When the experimenter tells you to do 5° a 
now), you are to open your booklet to etl 
one and read the story on that page 0 A 
while she reads it aloud to you. Concentr story? 
much as you can while she is reading the ry 
because your job is to remember the St? you 
well as you can until our next meeting: of 
don’t need to try to remember the wore ain 
the story, but pay special attention to fara 
events the story describes, what the ple ow 
think and feel, and the outcome, or proba ter 
come, described. As soon as the experimen a tell 
read the story on page one with you, she ea this 
you to turn to page two. She will then ™ once 
story to you while you read it silently. 
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TABLE 1 


DesiGN OF EXPERIMENT 
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Session 1 


Session Pre-2 


Session 2 


Group Session Pre-1 

I Read memory stories 
for Set A 

II Read memory stories 
for Set B 
Ill elt 
IV = 
y = 
gi” == 
Tir’ = 
Iv’ ee 
Vv = 
VI = 


Write memory stories 
for Set A 


Write memory stories 
for Set B 

Write creative stories 
for Set A 

Write creative stories 
for Set B 

Write “free” memory 
stories for Set A 

Write “free” memory 
stories for Set B 

Write creative stories 
for Set A 

Write creative stories 
for Set B 

Write creative stories 
for Set A 

Write creative stories 
for Set B 


Read memory stories 
for Set B 


Read memory stories 
for Set A 


Write creative stories 
for Set B 


Write creative stories 
for Set A 


Write memory stories 
for Set B 


Write memory stories 
for Set A 


Write creative stories 
for Set B 

Write creative stories 
for Set A 

Write “free” memory 
stories for Set B 

Write “free” memory 
stories for Set A 


Write creative stories 
for Set B 


Write creative stories 
for Set A 


N : 

h toe The subjects wrote TAT 

emoy rite stories selected from a 8 
source they wished to use. 


tion; 


t P 

e on this story so that y 

a things about it as you di 
Page one. We shall procee 


ne 18 stories in the book. Your jo 
to try to remember the main P 


Stories į 
Ories just as well as you can. 


patter a brief pause the 


them 
a thir to remember it. Finally, 


d silent reading in still a difi 


D thas 
their next sessions. 


n k š 
whieh sessions called Session 


e 
Was to use, a written copy of 


Writi 
limi $ TAT stories, and a watc 
Story herself to 7 minutes for Wr 
Ypie For Session 1, the direction: 
A ally used in administering th 
the subject was told to choosi 


a 
heey from the booklets she 


Pre-Session. She was told t 


Modif 


stories under varyin 
roup of stories they hi 
For full explanation see text. 


ou can remember the 
d about the story 
d in this way through 


ro! subjects read thi 
differs again, this time by themselves an 
Sending order from that of the 
each ng 1 minute on each story 
story in any way they thous! 
they gave ri 
ferent order, taking 
Shy eae per story. At no time W 
Y information that TAT pictures 


1 and Session 2, 


Sih e spaced about a week apar 
ee a week after the Pre-1 Session, 
ject y groups were treated individ 
Was seated at a table, given 


ually. 
the set of cards 
the directions for 
h with which to 


had read 
o improvise oF 


y if she could not remember clearly the 


b will always 
oints of the 


e stories 
d in a 
first reading, 
and studying 
ht would help 
the stories 


ere they given 
would be used 


t and begun 
members of 
The sub- 


iting of each 
s were those 
e TAT, except 
for each picture 
during 


story she thought most 
During Session 2, the dire 
same as for Session 1 
on being creative an 
from the mimeograp! 
sources such as mo 

Members of Groups I 
ually for Session 1, an 
Card Sets A and B resp 


(creative session) 
no mention was ma 
mimeographed booklets, 
not had experience wit! 
session called Pre-2 Sessi 
JII and IV met in separa 
stories for Set 
manner as descril 
Groups I and II. Of 
III and IV cou! 
be forthcoming in tl 
of Groups III and I 
identical with that o 
during Session 1. 

It is obvious that when motivational scores from 
are correlated with motivational 
ive stories across all 48 subjects 
the factors of card set and ses- 
ble between the two 


memory stories 
scores from creat! 
in Groups I-IV, 
sion number are compara 
arrays of scores correlated, 


the usual (creative) instruction, (b) with instruc- 


ad studied previously, D 
Sets A and B are two sets of six 


g conditions: (a) with sual in K ( 
(c) with instructions to write stories based on any 
cards each. 


appropriate to the picture. 
ctions for writing were the 
except that stress was laid 
d avoiding stories remembered 
hed booklets or from other 
vies or magazines. 

II and IV were seen individ- 
d wrote creative stories for 
ectively. The directions and 
procedure were identical for those of Session 2 
of Groups I and II, except that 
de of avoiding plots from the 
since these subjects had 
h the 
on, members of Groups 
te groups and read the 
A and B, respectively, in the same 
bed for the Pre-1 Session of 
course the subjects in Groups 
Id surmise that TAT cards would 
heir next session. The treatment 
V subjects in Session 2 was 
f Group I and II subjects 


booklets. For the 
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Groups I’ through IV’ were treated just as 
Groups I through IV, with the following important 
exception: in Groups I’ through IV’ no story- 


reading presessions were held. Instead, for the 
sessions in which the subjects were to produce 
memory stories, they were instructed to base 


their stories on something they temembered from 
real life, movies, books, case history materials, etc. 
The exact instructions given to these subjects were: 


We have said that this is a test of creative 
imagination. However, we should like to add the 
following requirement: Try to use, as a basis for 
each story, some situation or “plot” which you 
remember from a novel, short story, play, radio 
or TV drama, real life case history, or news 
story. You may write the story pretty much as 
you remember it, or you may modify it if you 
feel that is desirable in order to make it more 
appropriate to the picture than the literally 
remembered story would be. And, if you don’t 
remember very clearly the situation you are 
drawing on as a source for one of today’s 
stories, take what you do remember and use it as 
a point of departure in writing the story. The 
point is to base the story for each picture not 
on sheer imagination, but on some human situa- 
tion you have heard about before today... . 


In this Case, too, when motivational scores 
from memory stories are correlated with motiva- 
tional scores from creative stories across all 36 
subjects in Groups I’ through IV’, the factors of 
card set and session number are comparable be- 
tween the two arrays of scores correlated, 

Groups V and VI were treated like the other 
groups, except that they were given only creative 
instructions at both Sessions 1 and 2, as follows: 


Your task will be to tell as dramatic a story 
as you can for each picture, Tell what has led 
up to the event shown in the picture, describe 
what is happening at the moment, what the 
characters are feeling and thinking; and then 
give the outcome. You may find yourself 
wanting to use plots from movies, books, 
magazines, or radio or TV dramas. Please avoid 
that—be just as creative and original as you can 
possibly be. 


Group V wrote stories in response to Card Set A 
in Session 1, and to Card Set B in Session 2, 
while the reverse order of card sets was used for 
Group VI. Thus, when motivational scores from 
Session 1 are correlated with motivational scores 
from Session 2, across the 24 subjects in Groups 
V-VI combined, the factors of card set and type 
of instruction are comparable for the two arrays of 
scores correlated. , 

Before the experiment began, the subjects had 
been admonished to keep all aspects of the pro- 
cedure completely secret, and postexperimental in- 
terviews revealed that they had apparently co- 
operated in this to a very high degree throughout 
the experiment. Each subject was asked to write 
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her idea of the purpose of the experiment and ne 
subject had guessed it correctly. Prior to the ma 
ment the subjects had been assured their anony a 
would be preserved by a system of code numbers q 
their stories, and after their last session they ee 
shown that all handwritten records were type 
before being scored. 


RESULTS 


Each story from Groups I through IV 
typed on a separate page, together with i 
code number, decipherable only by the firs 
author who did not participate in the a 
The two authors who are clinicians took I; 
of these stories at random and independent y 
applied Aron’s (1949) scoring scheme for tw 
variables, Aggression and Dependency- or 
both scorers left a scoring unit unscored; ne 
both scored it for the variable under A 
sideration, this was counted as ——— 
Out of 306 total possible agreements, 
were 79% agreements on Aggression nn 
88% on Dependency. This was not se 
sidered satisfactory, so the judges scored E 
discussed some stories not being used in 
study, and then independently scored in 
more stories selected randomly from ae 
the study. On this second reliability. wet 
there was 91% agreement on Aggression a 
of 368 scoring units, and 93% agreem is 
on Dependency. It was decided that ry 
degree of reliability would be satisfactory? 
so the remaining stories from Groups en 
through VI were divided at random betwe 
the scorers. The final data for Groups, 
through VI are based on the combined ju oy 
ments of the two scorers on the stories they 
had both scored, and on the separate mi e 
ment of one or the other of the scorers 0” 
remaining stories. 

Data from Groups T through IV’ ie 
collected after the first six groups, a nese 
Psychology graduate student scored we 
stories, after practicing on stories of wae as 
I through VI until she achieved at leas the 
high independent agreement with each O yith 
original scorers as they had achieved nett 
each other. (A split-half reliability © ach 
was later made, within each session 0 vary 
group, and split-half relabilities did m0 jeak 
systematically as a function of group, "° op- 
ing that creative-memory correlations 
tained from Groups I’ through 


re 


z 
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safely be compared with those obtained from 
Groups I through IV.) 

Since there were marked differences in 
story length, each story was given a score 
obtained by dividing the number of scoring 
units judged as Aggressive (or Dependent) 
by the number of scoring units in the story. 

hese proportion scores were summed for 
each subject for her six creative stories and 
for her six memory stories, and the two 
Sums were used in testing the hypotheses, 
With respect to the motivational variable 
under consideration. 

Table 2 presents the findings relevant to 
the hypotheses. 

Hypothesis 1 is confirmed by the fact that 
data from Groups V-VI yielded a positive 
Correlation between Aggression scores from 
the two sets of stories, both of which were 
Written under creative instructions; and a 
Positive correlation between Dependency 
Scores in the two sets of stories, both written 
Under creative instructions. The correlation 
or Aggression is significant between the .05 
and .02 levels, on a two-tailed test; but the 
Correlation for Dependency scores reaches 
the .05 level of significance on a one-tailed 
a only, While these levels of significance 

fice to confirm Hypothesis 1, the small size 
S the coefficients gives no support to the 

lea that one may reliably use sets of six 
TAT stories for individual diagnosis of Ag- 
8ression or Dependency, even when the card 
Sets are chosen specifically for their supposed 
relevance to these needs, and the stories are 
Written under a procedure rather closely 


TABLE 2 


N MOTIVATIONAL ScoRES FROM 
UNDER IN- 


0 DRAW 


Corretarions BETWEE! 
Wo Sets or TAT Srortes WRITTEN 
STRUCTIONS TO WRITE CREATIVELY, AND T 

on Memory SOURCES 


Cie y= 
Group N Aggression Dependency 
Memory versus Creative 
HIV 48 .26* .06 
IAW 36 37 —.03 
Creative versus Creative 
V-VI 24 434 38* 


i a x + 
tx bS -05; one-tailed test. 
PLS .02; one-tailed test. 


resembling the usual clinical testing pro- 
cedure. 

In examining the results pertinent to Hy- 
pothesis 2, one needs to consider Groups 
IHV as a combined group, and Groups I’-IV’ 
as another combined group. This is so be- 
cause the memory stories wrtten by the 
subjects were drawn from restricted sources 
in Groups I-IV, while the subjects in Groups 
J-IV’ were instructed to draw freely on their 
personal memories when writing their mem- 
ory stories. Table 2 shows that, for Groups 
I-IV’, the correlations between Aggression 
scores from creative and memory stories 
is significantly positive, as Hypothesis 2 
predicted. This 7 is almost the same size as 
the r found in Groups V-VI between Ag- 
gression scores from the two sets of creative 
stories. Thus, the introduction of a different 
set of instructions (to draw stories from per- 
sonal memories) did not seem to lead to any 
lower intraindividual consistency than is 
found in two sets of stories written under 
the usual creative instructions. Nevertheless, 
the intraindividual consistency again falls 
short of that needed for reliable individual 
diagnosis. 

With respect to the Dependency scores of 
Groups I-IV’, the results are different. Here 
there is an insignificantly negative correlation 
between scores from memory stories and 
creative stories. There is no support for the 
hypothesis that intraindividual consistency in 
motivational influences occurs across two 
sorts of processes: choosing and writing a 
story from one’s own memories and creating 
a story imaginatively. 

When one examines the data from Groups 
LIV, one finds that Aggression scores from 
creative stories correlate +.26 with Aggres- 
sion scores from memory stories. Although 
this 7 is in the predicted direction, it is barely 
significant at the 05 level on a one-tailed 
test. It is not significantly lower than com- 

arable 7s from Groups I-IV’ and Groups 
V-VI; and like the other rs it is obviously 
far too small to warrant reliable individual 
diagnosis. For the variable Dependency, the 
correlation between creative and memory 
stores for Groups I-IV is insignificantly 
positive (7 = +.06). 

In short, for both Aggression and Depend- 


150 R. C. Wyre, B. D. Sisson, AND E. TAULBEE 
TABLE 3 
INTERNAL CONSISTENCY CORRELATIONS FOR MOTIVATIONAL SCORES witHIn SETs oF Six TAT 
i STORIES WRITTEN UNDER INSTRUCTIONS TO WRITE CREATIVELY, AND TO 
DRAW ON MEMORY SOURCES 
Aggression Dependency 
Group N Creative Memory Creative Memory 
I-IV 48 Ags —.03 63*** = 
Ty! 36 3g*** 35** 46*** .26 
V-VI 24 —.03 (Session 1) — -27 (Session 1) =< 
-11 (Session 2) -11 (Session 2) 
Note.—Values are Cronbach alphas, corrected by the Spearman-Brown formula. - 


** p < .02; one-tailed test. 
+*+ p = .01; one-tailed test. 


ency, the creative-creative correlations are 
significantly positive, even though too small 
for individual diagnosis. Intraindividual con- 
sistency is also apparent in the creative- 
memory correlations for the variable Aggres- 
sion, but for the variable Dependency intra- 
indvidual consistency disappears when com- 
parisons are made across the two differing 
situations: writing from memory sources and 
writing stories from creative imagination. 

How can one interpret the fact that the 
obtained rs are low or essentially zero? There 
are several obvious possible sources of intra- 
individual inconsistency within the experi- 
mental situation. First of all, it may be 
important that the two scores correlated 
were based on sets of stories written about a 
week apart. There are many chances for 
varying influences to have affected individual 
subjects during the elapsed time and at 
either of the two specific occasions when they 
wrote their stories. Then there is a pos- 
sibility that a particularly large amount of 
intraindividual inconsistency is associated 
with one or the other of the card sets used. 
Or perhaps the failure to get high correlations 
between creative and memory stories is due 
to intraindvidual inconsistencies associated 
with one or the other of the types of instruc- 
tions used. 

Table 3 presents internal consistency co- 
efficients for each of the three combined 
groups as a function of type of instructon 
(creative or memory). The values given are 
Cronbach alphas, computed by the method 
described in Guilford (1954, p. 385). This 
type of coefficient, being a function of all 


possible split-halves, is independent ae 
chance goodness of selection of test E “ 
Between any two data arrays corre n K 
session number and card set are compara no 
Inspection of Table 3 shows that there rail 
obvious tendency for intraindividual © ; 
sistency to vary as a function of YP rity 
instruction, with the exception that reliaD es 
is always zero for scores on memory sto by 
drawn from the pool of stories provide 
the experimenter (Groups I-IV). sible 
To examine more minutely the Po oric 
role of instructions, a split-half tetrac 
r was estimated within each session O 
of the 10 groups using the method des¢ 
in Edwards (1954). This yielded 2071 V onde 
for Aggression, and 20 r, values for Dep ee 
ency (data not tabled). Obviously ased 
coefficients are unstable, since each is 
on an W of only 9 or 12 cases. However pal 
20 correlations from a given motivat 
score (e.g., Aggression) can be divide e of 
visual inspection into two arrays, by tYP pet 
instruction, with card set and session nop rt 
comparable between the two arrays Her a 
values. When this is done, it appears rapal 
slightly larger number of positive SP Tories 
coefficients come from the sets of § than 
written under creative instructions truc- 
from stories written under memory Mojther 
tions. No significance tests seem to be © 


ribe 
alues 


warranted or appropriate. of the 
Returning to the possible influence half 7 
card sets, one can divide the 20 split-h®. 19 


values for a given motivational scot’ | od 
two arrays, associated with Card oat d 
B respectively. Now type of instruc 
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session number are comparable between the 
two arrays of rẹ values. Visual inspection of 
the two arrays makes it obvious that intra- 
individual consistency is not a function of 
card set. 

Finally, one can divide the 20 split-half r 
values for a given motivational score into 
two arrays, associated with Session 1 and 
Session 2, respectively. In this divison, type 
of instruction and card set are comparable 
between the two arrays of 7 values. Here, 
too, visual inspection of the two arrays makes 
it obvious that intraindividual consistency is 
Not a function of the session number at 
which the stories were written. 


DISCUSSION 


The most striking finding of the study is 
the Low degree of intraindividual consistency 
found in any of the conditons explored. This 
finding cannot be laid to unreliable scoring, 
Nor to an arbitrarily unfortunate choice of 
Split halves, in the case of the intrasession 
Correlations, 

f It appears that drawing memor. 

from an artificially prepared pool depresses 
Mtraindvidual consistency among the mem- 
ory stories wrtten at a single sitting. How- 
ever, these coefficients are no lower than 
Some of those based upon sets of creative 
Stories written at a single sitting. When the 
Subjects draw their memory stories from their 
own reading or other experiences, the degree 
f intraindividual consistency is not sig- 
nificantly lower than that occurring within 
most of the sets of creative stories. Thus we 
Cannot reject the clinician’s implicit hy- 
Pothesis that characteristic individual dif- 
erences influence free choice of memory 


Stories to the same degree as they influence 


Subjects? i i tories. The 
m nely creative S J 
ore genuinely o little 


rouble is, of course, that there is $ : 
Ntraindividual consistency within any 0 
€ sets of stories written at a single setting. 

ince the intrasession correlations for Ag- 
Bression are low, and in some cases insig- 
icant, it is surprising to find that all E 
Ntersession correlations for this variable 
ach minimum significance levels. Perhaps 


€ significant intersession 7S may be due to 


y stories 
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greater similarity between card sets than 
within card sets, but neither our data nor 
published information permit us to check this 
idea. 

In contrast to the findings on Aggression, 
the creative-memory correlations for Depend- 
ency are insignificant. The unexplainable fact 
that Aggression and Dependency variables 
yield dissimilar outcomes implies the need 
for caution in generalizing from limited 
research on the TAT. It may turn out that 
generalizations concerning intraindividual 
consistency must be quite specific to the 
scoring variable used. 

All in all, the results seem to be a specific 
case in point for Campbell’s (1960) recent 
statement. 

The scientific evidence justifying the introduction 
of projectives was solely the evidence of the 
“failure” of the structured approaches and not in 
the least evidence of the superior validity of the 
projectives. . - + Now belatedly projective tests 
are being checked in ways similar to those that 
invalidated the structured tests, and the evidence 
for projectives looks even worse (p. 551). 


Since clinicians do in fact use the TAT 
for individual diagnosis, it is not appropriate 
to rationalize or evade such evidence of weak 
or nil intraindividual consistency as this 
study provides. Instead, much more research 
of the sort presented here should be done, in 
order to see whether such weak tendencies 
toward intraindividual consistency are char- 
acteristic of various TAT scores for varying 
card combinations and subjects. 
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EVIDENCE REGARDING THE EQUIVALENCE OF IPSATIVE 


AND NORMATIVE PERSONALITY SCALES 


ALFRED B. HEILBRUN, Jr. 


University of Iowa 


The equivalence of ipsative and normative personality measures was studied 
both with regard to interscale correlation and relative validities. Need achieve- 
ment and nurturance were measured by normative check lists, forced-choice 
ipsative scales, and ipsative Q sorts, composed of identical items and 
administered to the same 197 college Ss. The interscale correlations ranged 
from 43 to .72, the highest correlations being between the normative and 
Q sort measures. Only the forced-choice ipsative measure of need achieve- 


ment related significantly to college GPA with ability partialled out, whereas 
the normative scale of nurturance showed the only reliable relationship with 


number of charitable activities. 


The results were interpreted as supporting 


the use of ipsative measures for normative predictions. 


Personality scales fall into one of two 
broad classes with respect to the nature of 
the predictions which can logically be gen- 
erated from them. If the self-ratings or 
observer-ratings on a given trait are based 
upon the comparison of that trait with other 
traits within a single individual, then the 
scores allow one to predict that this par- 
ticular individual is more (or less) likely to 
behave in one fashion than another. The 
“strengths” of the traits are measured rela- 
tive only to the strengths of other dimensions 
of personality within the same individual. 

The second class of personality scales in- 
cludes those which require the rater to 
specify whether trait-relevant behaviors are 
or are not characteristic of an individual or, 
perhaps, how characteristic these may be 
relative to some reference group. In either 
case, the scores which a person obtains can 
be compared across individuals and predic- 
tions made regarding the probability that the 
person will or will not behave in a particular 
way relative to other people. 

Cattell (1944) designated those measures 
whose scores reflect only the relative strengths 
of traits within a given individual as ipsative, 
whereas he termed those measures whose 
scores reflect the strength of traits for a 
given individual relative to other individuals 
as normative. Cattell felt that scores derived 
by these two procedures differed in meaning 
but allowed that “. . . normative treatment 
of scores primarily ipsative . . . seems legiti- 


mate and real” (p. 302). Guilford ie 
p. 528), however, has warned against on 
use of between-individual differences UP le 
ipsative measures, because there is no sing 
scale for all individuals. + alence 
The question of the degree of equivale is 
between ipsatve and normative measur a 
of practical as well as theoretical significan’™ 
since it has become rather common, om 
normative statements to be derived ae 
ipsative measures. The Edwards as 
Preference Schedule (EPPS) (Pawa 
1959) and the various Ọ sort procedu 
would be two examples. the 
As Block (1957) has pointed out, ase 
equivalence of ipsative and normative M° 
ures is open to empirical investigation 
well as rational argument. He compia 
observers’ ratings of personality obta! om 
from normative rating scales and ae 
ipsative Q sorts and found a remarkal 5 
high degree of equivalence (median aoc 
and .79 for the two conditions). P e- 
speculated that the high correspondence A 
tween measures was attributable to a 1° the 
tive frame of reference which existed ae 
expert judges even in an ipsative j 
procedure, Thus, in the ipsative rating 


-orent 
- the decision between qualitatively ae 
variables as to “salience” is made with refer tion 
a very large body of information and era ich) 
as to the psychology of human beings t 
- « is based upon some sum or integration a 
rater’s lifetime experience with and conceP 
tion of various Personalities (pp. 52-53)- 


jiza” 
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Pi and Fordyce (1962) compared the 
oe between three ipsative Q sort 
es s and a normative check list and found 
a ee to range between .63 and 
i ey concluded that the two types of 
easures had “practical equivalence.” 
we present study sought to broaden the 
= ce regarding the equivalence of ipsative 
‘a normative measures in two ways. Two 
a of ipsative scales were evaluated, 
iiei Block as well as Kogan and Fordyce 
eee only Q sorts. Also assessment of 
oe ence was extended beyond interscale 
of i onships to include the relative validities 
ipsative and normative scales. 


METHOD 


one The 197 subjects employed in the in- 
mora were volunteers from a large under- 
eaaa class at the University of Iowa. This 
er included 99 males and 98 females. 

Gore scales. The two personality traits 
art for study were need achievement and need 
mene aA The normative measures of achieve- 
select. (Ach-N) and nurturance (Nur-N) - were 
K ed from the Need Scales (Gough & Heilbrun, 
on press), which are scored from self-descriptions 
Go the ‘300-item Adjective Check List (ACL) 
i a & Heilbrun, in press). Evidence has ac- 
(Heilb supporting the validity of both scales 
tin om, 1958; 1959 1960, 1962). The correla- 
ang brtween Ach-N and Nur-N is low (r = -19), 
ade item overlap is negligible (3 out of 102 
stand ives included in both scales). The ACL is 
äl ae, administered to the subject with free- 
the = present or absent checking instructions, and 
daa T scores are based upon the number of 
ices oe endorsed as self-characteristic which in- 
numb the presence of a given need minus the 
this er of endorsed adjectives which contraindicate 
ett types of ipsative scales were developed as 
5 ures of need achievement and nurturance for 
nogen One type was a forced-choice measure 
hae after the Edwards Personal Preference 
ener to the extent that the subject was pre- 
ach with items which included two adjectives, 
to measuring a different need, and was required 
ee gee the more characteristic adjective within 
Pair. The other type was 4 Q sort procedure. 
forced-choice measure of achievement (Ach-FC) 
nurturance (Nur-FC) utilized the adjectives 


i 
‘cluded in the normative scales described above. 
e adjectives for col- 


le 


male and 100 female undergraduate subjects not 
included in the present study. Adjectives from the 
Ach-N and Nur-N scales were then paired so that 
each pair included either indicative or contraindica- 
tive adjectives, and so that the base rates were 
as similar for each pair of adjectives as was pos- 
sible. Different forms of the forced-choice measure 
were developed for each sex. The matching proved 
moderately successful in that the average deviation 
in base rate between paired adjectives was ap- 
proximately 5% for the male form and 11% for 
the female form. Both forced-choice measures had 
66 adjective pairs, including two pairs keyed to 
correct for the two overlapping achievement and 
nurturance adjectives which are scored in opposite 
directions. The forced-choice measure was ad- 
ministered with instructions for the subject to 
choose the more characteristic adjective in each pair, 
omitting none. Scoring proceeded by giving a plus 
score for each indicative achievement adjective 
endorsed and a minus credit for each contraindi- 
cative achievement adjective endorsed and subtract- 
ing the total minus from the total plus. Nur-FC was 
scored in identical fashion. This procedure re- 
quires that each subject will receive the same 
average score across Ach-FC and Nur-FC which 
corresponds to the identity of average raw scores 
on the ipsative EPPS across the 15 variables 
measured, 

The ipsative Q sort decks were composed of the 
64 Nur-N adjectives and 38 Ach-N adjectives 
printed on individual cards. The nurturance and 
achievement decks were administered to the subject 
separately with instructions to sort them into eight 
a function of how characteristic the ad- 


piles as 
jective on each was; a rectilinear distribution of 
cards was required. Scoring for Ach-Q and 


the algebraic summation 
of plus and minus credits over all cards in a deck. 
A +7 score was assigned to any card bearing an 
indicative adjective and placed in the “highly char- 
acteristic” cateogry down to a zero if an indicative 
adjective were put in the highly uncharacteristic 
pile. Similarly, a —7 was scored if a contraindicative 
adjective were placed in the highly characteristic 
pile down to a zero if such an adjective were 
sorted as “highly uncharacteristic.” 

It might be noted at this point that since the 
three types of scales include identical test items 
and the scale scores were obtained from the same 
subjects under similar testing conditions, the form 
of item presentation appears to be the only im- 
portant scale difference. 
apace 

1A rectilinear distribution of sorts was required 
because of an earlier finding by Jones (1956) that 
the average distribution of personality statements 
for college subjects under free-sort conditions 
differed significantly from the quasi-normal dis- 
tribution which is often presumed. Further, in- 
spection of his results suggested that their average 
much more closely approximated a rectilinear than 


a normal distribution. 


Nur-Q was based upon 
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Criteria. The criterion for need achievement was 
level of academic performance at the University of 
Iowa as measured by cumulative grade-point average 
with intellectual ability held constant. Ability level 
was estimated by the composite percentile obtained 
on the Iowa freshman entrance examination. 

The nurturance criterion was one used previously 
by Heilbrun (1959). The subjects were asked to list 
by name the “charitable, medical research, rehabili- 
tation, church, or educational activities to which 
they had contributed time, money, or personal 
effects within the previous two years.” The criterion 
score was the total number of listed acts (e.g. 
money contributed to a specified charity) or 
activities (e.g., “big brother” program at the school 
for physically handicapped children) which were 
judged to be both charitable and voluntary by the 
investigator. 

Procedure. The subjects were tested in two 
sessions. In the first session the ACL and the 
forced-choice measure were administered; the second 
session included the two Q sorts and the question 
regarding charitable activities. The order of tests 
given above was used for all subjects. Median 

testing time was well below an hour for each 
session, and the time between sessions was no more 
than 2 weeks for any subject. In a few cases, 
the subjects appeared for one testing session and 
not the other, and the results of their tests appear 
in only some of the analyses. For this reason and 
because the entrance examination scores were un- 
available for a small number of subjects, the Ws 
reported for the various statistical analyses in this 
study will vary slightly. 


RESULTS 


Interest relationships. The correlations? 
between the three measures of need achieve- 
ment and need nurturance are presented in 
Table 1. All rs were positive, of moderate 
magnitude, and highly significant. The cor- 
relations among the achievement scales did 
not differ reliably, although the correlation 
between Ach-N and Ach-Q (r =.50) was 
higher than that between the two ipsative 
scales (7 = 43). This same finding can be 
observed for the nurturance measures. The 
normative scale’s z with Nur-Q (.72) was 
higher than was the 7 between ipsative scales 
(.45), and, in this case, the difference was 
highly significant as evidenced by a t for 
correlated coefficients (Lindquist, 1940, p. 
218) of 5.32 (p < .001). 


2 Since there was no basis for expecting the 
intertest relationships or the test-criterion relation- 
ships to interact with sex, proportional numbers of 
male and female subjects were combined for all 
statistical analyses in the present study. 
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TABLE 1 


CORRELATIONS BETWEEN IpsATIVE AND NORMATIVE 
SCALES oF NEED ACHIEVEMENT AND 
NEED NURTURANCE 


Need achievement Need nurturance 


a Forced- 
tenet hoise, 
Normative ipsative Normative ipsativ 
Q sort 4 
ipsative 50 43 372 oe 
Forced- 
choice 
ipsative 43 50 


sah e 
Note.—The numbers for the rs reported in this table range 
from 189-193 and include equal proportions of males a! 
males. y = .20 significant at p < .01, 


Relationships with criteria. The partial 
correlations between the various achieve- 
ment scales and college GPA with ability 
held constant were as follows: Ach-N; 
r= 07; Ach-FC, r= .31; and Ach-Q, 7= 
.09. The only partial correlation which dif- 
fered reliably from zero was that betwee? 
Ach-FC and college performance (t = 4.46, 
Ż < .001). Although none of the partial 7S 
differed significantly from each other, the 
difference between r= .31 for Ach-Fc an 
7 = .07 for Ach-N approached significance 
(¢ for correlated coefficients = 1.91, p< .06)- 

The curtailed range of scores on the 
nurturance criterion (i.e, 82% of the ae 
and 60% of the females reported four : 
less charitable acts) made correlational anal- 
ysis inappropriate. Instead, high and lo y 
nurturance groups were defined by splitting 
the male and female criterion score distribu- 
tions as near as possible to their respective 
medians and combining the two high et 
the two low groups. The combined hig 
nurturance group (V = 93) included 
males (charitable acts = 3) and 36 females 
(charitable acts = 5 ), and the low nurturanc® 
group (N = 99) was composed of 39 male 
and 60 females, The mean scores for these 
two criterion groups for each of the nurtut 
ance scales were then compared. These res 
can be found in Table 2. Only Nur-N relate 
significantly to the criterion. is 

Since the preceding nurturance analy ne 
did not provide a direct between-scale wee 
parison of validity, one further analysis oie 
conducted. The distribution of scores 


| 
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TABLE 2 


Comparison oF Hick AND Low NURTURANT GROUPS ON IpsATIVE AND 
NORMATIVE NURTURANCE SCALES 


Nurturance scale 


Forced-choice ispative 


Q sort ipsative 


Criterion Normative 
= yv x = ; Na D E a ee 
pom 91 SLS 101 ay 03 120 122 92 98.31 33.00 
Ba gg on Be i 99 244 1258 100 95.87 36.88 a 
*p< o5, 

tributable 


each 
The e rane scale was divided near the 
Wo low r each sex and the two high and 
ow e ups were combined. These high- 
With the em ities were used in combination 
Mine the igh-low criterion groupings to deter- 
Ject falls Proporton of “hits” (e8. the sub- 
Stoup) the high scale group and high criterion 
M high s misses” (e.g., the subject falls 
~~ group but low criterion group) 
Ree each scale. The three proportions 
Vur- ae 110-79; Nur-FC 98-94; and 
tions by ~95. Comparison of these propor- 
Y sip Stuate tests failed to indicate 
the xX ificant differences in the proportions, 
values ranging from .06 — 1.97. 


DISCUSSION 
The i 


k tale correlations which were ob- 
OW th e present study presumably fall 
alth ug € reliabilities of the individual scales, 
k cients only the 10 week test-retest CO- 
No of the two normative scales were 
Coding AEN = 84, NUN = 75); Ae 
ale a it seems safe to assume that each 
sel ove show a higher relationship with 
pite a time than it will with other con: 
T this ogee of the same variable use 
rele Sate y. Despite this general caution 
ip ations ae equivalence, the intertest cor- 
w ative TA to support the contention that 
cae uncti normative personality measures 
tng eg interchangeable in making 
Rete a between individuals. This 1s 
thon the Fa the fact the correlations be- 
S Ose a wo ipsative scales were less than 
“ale Ween the ipsative and normative 


S 
Sleet k 
gesting that the uncommon vari- 


ance among scales is primarily at 
to factors other than the ipsative-normative 
mode of item presentation. This line of evi- 
dence must be regarded with some caution, 
however, because of the nature of construc- 
tion of the two ipsative measures. The forced- 
choice measures involved the relative com- 
parison of behaviors representing two needs, 
whereas the Q sorts involved relative com- 
parisons of behaviors related to a single need. 
The latter procedure may allow the test- 
taker to utilize a more normative model in 
self-evaluation. The correlations reported in 
Table 1 tend to support this, since the Q sort 
ipsative versus normative 7s were higher 
than the forced-choice ipsative versus norma- 
tive rs. Thus, it may be important to stipu- 
late the specific nature of the ipsative scale 
in talking about functional equivalence to 
normative measures. 
The other and, perhaps, more crucial line 
of evidence which suggests that ipsative meas- 
sures can be used to make normative state- 
ments is that the forced-choice ipsative scale 
of achievement did in fact correlate sig- 
nificantly with college achievement, ability 
t. Thus, the self-appraised 


being held constan 
strength of achievement motivation relative 
to need nurturance within the individual re- 
lated to the actual achievement of that in- 
dividual relative to other individuals. This 
finding deserves special emphasis since the 
reviously validated normative achievement 
scale failed to predict academic performance 
using the same subjects. 
The possibility remains that the relative 


success of Ach-FC was a consequence of the 
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particular variables which were inadvertently 
selected for the forced pairing. Achievement 
in a competitive academic situation may in- 
volve an actual element of decision between 
need achieving and nurturant behavioral 
alternatives for many students; certainly, 
attempting to outperform others and extend- 
ing benefits to others are to some degree in- 
compatible. If so, forcing the subject to 
choose between achieving and nurturant test 
alternatives would be consonant with a forced- 
choice between actual achievement and 
nurturant social alternatives. The ipsative 
scale would be matched with an “ipsative” 
criterion. 

The relationships between ipsative and 
normative ratings reported by Block (1957) 
(prior to correcting for attenuation) and later 
by Kogan and Fordyce (1962) were generally 
higher than the between-scale correlations 
found in the present study, These lower mag- 
nitudes are consistent with Block’s hypothesis 
that the high correspondence between ipsative 
and normative ratings in his study was due 
to a broader frame of reference used by the 
judges when traits were considered relatively 
within the individual. Since it can be pre- 
sumed that the trained psychologists employed 
by Block share a greater “body of informa- 
tion ++ + as to the psychology of human 
beings” and, thus, a more applicable frame 
of reference than college adolescents, lower 
coefficients would be expected in the current 
study . In any case, the present results, taken 
in conjunction with earlier findings by Block 
and by Kogan and Fordyce, provide a sub- 
stantial case for the equivalence of ipsative 
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and normative personality measures for 
making normative predictions. 
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HYPNOTIC SUSCEPTIBILITY AND MMPI PROFILES 


ROBERT E. SCHULMAN? ann PERRY LONDON 


University of Illinois 


4 groups totaling 87 undergraduate females of relatively different hypnotic 
susceptibility as determined by Form A of the Stanford Scale of Hypnotic 
Susceptibility completed the questionnaire form of the MMPI. The most 


susceptible group scored significantly 


lower than the others on the Pd scale 


(p< .01), but no other significant differences were found between groups. 
Mean scores of all susceptibility groups fell within the normal range, sug- 
gesting that hypnotic susceptibility is not grossly related to psychopathology. 


et revival of interest, in formal 
ility has ie ypnosis and hypnotic suscepti- 
in th een accompanied by renewed inter- 
adjustiye : personality characteristics and 
ivi ual ates of the hypnotically susceptible 
nineteenth The notion was popular among 
anet, th century psychiatrists, such as 
trait fòu ri susceptibility to hypnosis was a 
and to ibi primarily in hysterical neurotics, 
tended ¢ 's day psychoanalytic theorists have 
notic t describe the ability to enter hyp- 
ilder Sin terms of temporarily regressive 
al Ross if not permanently patho- 
~ havior (Fenichel, 1945). 
Unhinderes © in this area was almost entirely 
Search za by the results of empirical re- 
âttempt ntil Wells’ (1931) early work which 
Various ed to relate hypnotic susceptibility to 
SEE y personality traits, His results how- 
"liable. e generally contradictory and un- 
Perce unire (1937) used the Thematic 
Ceptibinin ion Test to demonstrate that sus- 
te base to hypnosis can be predicted on 
ad atti of an individual’s social attitudes 
‘ling; tudes towards the phenomenon itself, 
Sarg hg later confirmed by Rosenzweig and 
They (1942), y 
Sse or chach has also been used to study 
rein & 3 (Brenman & Reichard, 1943; 
Sults of eta 1942; Steisel, 1952), but 
ambi ese studies have been conflicting 
eon It is a fair conclusion 
ithe ctive tests have by and large 
Onalig Ccessful in identifying clinical and 
A n Y correlates of hypnosis. 
er of recent studies have found low 


c 
logic 


1 
B Pre 
Voh sent or 
holga,” 2 Postdoctoral Fellow in Adult Clinical 
EY at th 
e Menninger Foundation. 


but significant correlations between some 
scales of inventory tests such as the Min- 
nesota Multiphasic Personality Inventory, 
Cattell 16 PF, and California Personality 
Inventory (CPI) with volunteering as sub- 
jects for hypnosis and similar behaviors 
(London, 1961; London, Cooper, & Johnson, 
1962), but these studies did not specifically 
examine differences between hypnotically sus- 
ceptible and unsusceptible subjects. Hilgard, 
Weitzenhoffer, Landes, and Moore (1961) 
have used the CPI for the latter purpose, but 
generally with inconclusive results. 
With the exception of the work of 
Weitzenhoffer (1958), Hilgard et al. (1961), 
and Hilgard and Lauer (1962), most of the 
studies which used projective tests and factor 
analyzed personality inventories to study sus- 
thodologically limited by 


ceptibility were me 

such details as small sample size, etc. It is 
worth considering, however, that their results 
would in any case have to be limited to test- 


related statements about personality charac- 
teristics whose clinical implications were at 
best indirect. 
Among existing tests, only the MMPI is 
empirically and directly derived from norms 
obtained on relevant pathological populations. 
Three studies related to susceptibility have 
reported using the MMPI (Faw & Wilcox, 
1958; Sarbin, 1950; Secter, 1961). Their 
results appear generally unimpressive, but 
may be obscured by ambiguities in reporting 
criteria and results as well as by the use of 
small and biased samples. 
The present research was concerned with 
examining MMPI scale and profile differences 
for four groups of females of different 


relative hypnotic susceptibility. 
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METHOD 


The Stanford Hypnotic Susceptibility Scale, Form 
A (Weitzenhoffer & Hilgard, 1959), was individually 
administered to 87 female undergraduates who had 
volunteered to participate in a hypnotic experiment, 
The questionnaire form of the MMPI was subse- 
quently completed by each subject as follows: 

Subjects were classified into four groups of relative 
hypnotic susceptibility on the basis of their Stanford 
Scale scores. There were 16 subjects in the High 
(10-12) scoring group (H), 19 in the Medium (8-9) 
group (M), 33 subjects obtained scores placing them 
in the Low (5-7) group (ZL), and 19 fell in the 
relatively Unsusceptible (0-4) group (U). 

None of the subjects were informed of their precise 
susceptibility ratings, but members of the two rela- 
tively high susceptibility groups, H and M, as well 
as of the relatively unsusceptible group, U, were 
immediately assigned to another experiment. This 
experiment was finished within 3 weeks after 
the Stanford Scale administration, and the subjects 
then completed the MMPI. Members of the L group 
were not assigned to another experiment, however, 
and completed the MMPI within a few days of 
the Stanford Scale. 


RESULTS 


Inspection of the average profiles and means 
(Table 1) of the four groups plainly suggests 
that they are all more alike than different, 

A simple four-way analysis of variance 
(Walker & Lev, 1953) was performed sepa- 
rately on each MMPI scale. The Pd scale 
showed significant mean differences between 
groups (p<.01), as did the Pt scale 
($ < .05) (Table 2). In both instances the 
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TABLE 1 
MMPI Mean T Scores 


Susceptibility group 


Srle Medium High Low 
Scale (0-4) (8-9) (10-12) (5-7) 
L 46.53 43.63 47.00 46.00 
F 50.84 53.68 53.69 53.82 
K 56.58 57.05 56.19 54.33 
Hs 49.63 49.74 52.12 48.91 
D 48.47 49,74 47.12 51.48 
Hy 56.11 53.32 54.69 56.45 
Pd 57.16 59.16 53.94 59.67 
Mf 46.42 46.11 49.44 44.12 
Pa 55.74 53.42 54.75 57.27 
Pt 54.42 54.95 51.12 58.58 
Sc 55.79 57.42 55.12 60.03 
Ma 55.79 58.58 58.56 60.82 
Si 47.32 50.95 46.31 50.21 


H group was significantly lower than the 
other three groups. E 
Since the sequence of conditions for the 
group differed from that of the three groups 
who had participated in a separate is ae 
ment, a simple three-way analysis of varianc 
Was completed for the three groups who ae 
taken the MMPI 3 weeks after se 
been administered the Stanford Scale. i 
results were generally the same as those e 
the four-way analysis except that the Pt a 
difference did not occur; Pd scale differences: 
however, were still significant (Table 3)- 


A d 
Each profile was then scored for high 4” 


TABLE 2 
ANALYSIS OF VARIANCE OF Four Susceprinmity Groups 


Variable Total z Between 4 Within j F 
7 7 3 33 mae 

Ë 2,777 125 2,052 

F 3,519 124 3,395 i 
K 5,208 113 5,095 SS 
Us +.5K 4162 112 4050 = 
D 7,332 238 7,004 = 
Hy 6,197 135 6,062 = 
Pa+ AK 7,384 965 6,419 4.16" 
My 7,137 292 6,845 1.18 
Pa 7,391 194 7,197 = 
Pre IK 7,100 645 6,455 2.76%" 
Se+ 1k 6,529 355 6,174 1.59 
‘Ma + 2K 11,740 308 11,432 — 
Se 8,348 288 8,060 - 
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TABLE 3 
ANALYSIS OF VARIANCE OF THREE SUSCEPTIBILITY GROUPS 


Vari af df 
ariable Total 34 Between 2 Within ¢ F 
: 1,682 122 1,560 1.99 
x Pe 98 2,130 117 
P D 6 3,055 = 
a + 5K 2,798 67 2,731 = 
1 4,348 59 4,289 — 
y 2,902 73 2,829 = 
n + AK 4,067 810 3,257 6.34* 
ue 4,361 107 4,254 == 
p 3,663 51 3,612 — 
4 +1K 3,496 145 3,351 1.10 
Se + 1K 3,170 49 3,121 5 
Ma + 2K 8,520 95 8,425 = 
Si 4,612 214 4,398 1.24 
*?< o, 
low 
Poin à š 7 Sai 
Tevea] 4 codes, a procedure which could personality differences within the normal 
regardless erns of differences between groups range, it is hardly surprising that it is in- 
Ry... CSS Of the similarities of scale scores. sensitive to possible differences between 
ample. It is of interest also 


Minati, 
me of these codes did not reveal 
ingful groupings relative to the four 


ley, 
e 
of susceptibility. 


Tina DISCUSSION 
ne of oiis of the data indicates that only 
“isting h MMP1’s clinical scales seems to 
difer map consistently between groups which 
“Cores of protic susceptibility. The Pd 
Significant most susceptible group (H) were 

“OUD in y lower than those of any other 
“nees et both analyses. The absolute differ- 
Ser Pity them even for this scale, how- 
a face ga ones. Taking these differences 
i tie ue, one might allege post hoc that 
we e people are less aggressive, 
A s wie iant, and less negative about rela- 
tag a authority figures than are others. 
m Photie very willingness to respond to the 
o thet ee communicates this idea 

the aipy ively than the MMPI, and the size 
sta, 22Y erence on Pd is too small to argue 

atus ` Steat difference in either clinical 
ie behavior, 

i. performances of members of all 
Y all essentially normal ones, with 
ard devi means falling within a single 

t E ie of the mean of the Normal 

not ion group. Since the MMPI is in 

very adept at identifying subtle 


groups of this s 
that our results offer no support whatsoever 


for the connections between Hy and sus- 
ceptibility previously obtained with smaller 
samples. All told, the conclusion seems in 
order that susceptibility to hypnosis can best 
be viewed as a trait falling within the normal 
range of behavior rather than representing 
hysteria or any other gross psychopathology. 

A more general conclusion concerning both 
the present study and the host of work pre- 
ceding it may be worth noting as well: There 
may indeed be personality traits which dis- 
tinguish persons of relatively different degrees 
of hypnotic susceptibility, and these traits 
may be well worth discovering; but it seems 
quite clear that they are not going to be 
discovered by any of our existing gross 
y inventories, whether the factor 
d by previous workers or the 
d one used here. Neither our 
or current construct cate- 
Jevant to this trait. It is 
dies like this one and 


personalit 
analyzed ones use 
empirically derive 
old Kraepelinian n 
gories seem very Te 
time to stop doing stu 
ek a fresh approach. 


to se 
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REVISION OF THE MMPI K CORRECTION PROCEDURE 
FOR IMPROVED DETECTION OF MALADJUSTMENT 
IN A NORMAL COLLEGE POPULATION * 


ALFRED B. HEILBRUN, Jr? 


University of Iowa 


The problem investigated was whether a better system of K weighting could 
be developed for the 10 MMPI clinical scales to enhance their usefulness as 


2-group discriminant analysis was used to determine the K value for each 
scale to maximize discrimination between maladjusted and adjusted college 


Ss, the sexes being considered separately. The revised weighting system 


differed greatly from the standard system; major 
weighting of Hy and deletion of weights from Hs, Pd, and Ma. Lesser changes 
d D, Mf, Pa, and Si continued to be 
was shown for maladjusted 


differences were in negative 


in Pt and Sc weights were obtained, an 


ooe = O S 


unweighted. Cross-validation of revised weights 
d psychopathic hospital cases. 


college Ss and seriously maladjuste 


The M; 
Bento naesta Multiphasic Personality 
1951 eA (MMPI) (Hathaway & McKinley, 
and ieee the most extensively used 
for dia, ated psychometric device available 
Snosing psychopathology, yet for 


measures of adjustment level within a grossly normal college population. 


samples of diagnosed patients and normals. 
Five clinical scales were corrected by adding 
from .2 up to 1.0 of the raw K scale score, 
consistent with the logic that higher K scores 
mean greater guardedness which, in turn, is 


Seve, 

Rest th years evidence has accrued to sug- associated with unduly depressed (and 
plied the MMPI has been inefficiently healthier) profiles on the MMPI. McKinley 
Maladjy to the task of detecting less severe et al. (1956) warned, however, that 

lion Stment within a grossly normal popu- , . . these weights are optimal, within our sample, 
the te n the early stages of development of for the differentiation of largely inpatient psy- 
i chiatric cases of full-blown psychoneurosis and 


e CESI 
Clinica ’ it was found that certain of the 


sis from a general Minnesota “normal” group. 


psycho. 
her clinical purposes it is possible that other 


irimia showed an improvement in For ot 
Alling Pon between hospitalized patients \ values would be more appropriate. Thus, it seems 
and an thin various diagnostic categories likely that for the best separation of “maladjusted 
easy 4Dselected normal group when a scale normals,” such as those which abound in a college 
Dor. Ting test-taki z . r- counseling bureau and would be formally diagnosed 
H ate as a TEME attitudes Was: ee in a psychiatric clinic as simple adult maladjust- 
Wag Way a variable (MeKiniey, ment, other weights might be better (p. 115). 
lege eveloped Meehl, 1956). The K a - Since the person who obtains a high score 
Whits”eness ised or fndin a at tend on K does so by denying personal inade- 
og istinowiche y by finding MMPI 1 d quacies, the significance of a high score ob- 
dor izeq soe ed between normals an tained from a hospitalized psychiatric pa- 
the "al test i ae patients who showed tient and from a normal person making an 
the jie Scale os een ake ee on adjustment in the community might be ex- 
in atter grou the assumption being that pected to differ. In the former case, prima 
deta. Undul a nets portray. mg themselves facie evidence suggests a defensive distortion 
to mineg E avorable light. Empirically of fact, whereas in the latter case there is a 
: weights were then determined Considerably higher probability that the 


'Scrim] À 
l seg oe ee ee person is giving a factual account of his 
Ue amy tigation was supported by a Te- psychological adequacies. Research evidence 
eget nt from the Graduate College, State has been presented to substantiate the hy- 
vak author wi yothesis that the K factor is positively 
4 ESS erage a Norton f sychological strengths in normal 
it stati assistance in the planning and conduct related to psy S 


i 
Stical analyses of the data. 


populations. 
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Block and Thomas (1955) and Berger 
(1955) found a positive relationship between 
K scores and degree of self-acceptance. A 
picture of the high K normal as poised and 
comfortable in social situations emerged 
from studies at the Institute of Personality 
Assessment and Research (Block & Bailey, 
1955; Gough, McKee, & Yandell, 1955). 

Smith (1959) reported a significant nega- 
tive correlation of .39 between K scores and 
judgments of defensiveness for his normal 
subjects; further, higher K scorers were rated 
as more insightful. Heilbrun (1961) tested 
the hypothesis that K is a measure of psy- 
chological health in a grossly normal popula- 
tion and found some support in the case of 
college females where an adjusted group 
scored higher on the K scale than a counsel- 
ing service maladjusted group. He further 
demonstrated that the K scale is more highly 
correlated with test-taking defensiveness for 
both male and female maladjusted subjects 
within a normal college population than for 
their adjusted counterparts. Nakamura 
(1960) found that presumably more mal- 
adjusted disciplinary cases scored signifi- 
cantly higher on K when retested as part of 
an evaluation which might culminate in 
probation or suspension from school relative 
to test-retest comparisons for college non- 
disciplinary controls, 

The present study sought to determine 
empirically the K corrections for the 10 
clinical MMPI scales which would maximize 
the discrimination between adjusted and 
maladjusted persons in a college population. 
The importance of improving the MMPI’s 
ability to distinguish maladjusted persons 
within a college population goes beyond the 
clinical benefits to be obtained. Considerable 
research into the nature of behavioral mal- 
adjustment is conducted in college settings, 
much of it employing the MMPI as the 
measure of maladjustment. Obviously, the 
more precisely the psychometric instrument 
can measure degree and kind of maladjust- 
ment within this type of population, the 
more likely the research will provide lawful 
relationships. 

METHOD 


Subjects and procedure. The study was separated 
into two phases. The purpose of the initial phase 
was to determine the optimal amount of the raw 
K score to be added to or subtracted from the 
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raw scores of the 10 MMPI clinical scales to 
maximize the discrimination on each scale between 
maladjusted and adjusted subjects in a college 
population. Maladjustment was defined as having 
sought help at the University Counseling See 
University of Iowa, for personal adjustment prod- 
lems. Since the contribution of K was to be in- 
vestigated separately by sex, groups of 50 une 
graduate male and 50 undergraduate female alien 
were included in the initial samples. Adjusted, for 
turn, was defined as not having sought help 1° 
any type of problem at the Counseling Service Th 
until the time of data collection for the study. e 
order to specify more clearly the type of eola 
population to which the results of the study ars 
be generalized, a base rate for occurrence ie 
maladjustment was estimated and the appropria 
ratio of adjusted to maladjusted subjects neice 
in the size of the respective groups. It wasana, 
that 10% of college undergraduates experience p T- 
chological difficulties of sufficient intensity to Rie 
rant personal adjustment counseling. According 
450 male and 450 female undergraduates W° 
included in the adjusted groups, maintaining s, 
nine-to-one ratio of adjusted to maladjusted Ibe 
The second phase of the study included two fs 
dependent cross-validations of the optimal K Wet as 
obtained in the initial phase. The first or ual 
validation was a direct replication of the aa 
phase conditions. MMPI records of 30 male Jing 
30 female undergraduates, seen at the Coun tes 
Service because of personal adjustment ginean e 
were compared with those obtained from zo by- 
and 270 female adjusted undergraduates. Te ales, 
pothesis tested was whether the 10 clinical ‘mitia 
using the optimal K weights obtained in the } 
phase, differentiated these two groups ae 
fectively than the clinical scales as star = 
weighted. ings 
In addition, the generality of the initial orl 
was tested by comparing the newly develoP®" he 
weighting system to the standard system Husted 
discrimination between more seriously malad) The 
college groups and adjusted college students. | re 
maladjusted subjects in this cross-validatio® ersitY 
patients at the Psychopathic Hospital, pi ment 
of Iowa, which specialized in short-term me male 
of acutely disturbed psychiatric cases- Fifty were 
and 50 female patients (about 85% of whom pal- 
inpatients) constituted the more seriously as 
adjusted groups, all of these being unselec! ae 
to psychiatric diagnosis. The adjusted suber male 
this comparison were 450 male and 45°. c 
undergraduates. These adjusted a 


a 


rdly 


samples 557 
the 270 males and 270 females in the other 
validating comparison but did not overlap initial 
the 900 adjusted subjects employed in the 
phase of the study. for G 

Test administration. The MMPI records y of 
of the 2,060 subjects included in the aor rior 
obtained under group testing conditions J at the 
to the beginning of their freshman Jer rec 
university, The exceptions were the ital f 
obtained from the Psychopathic Hosp? part a 
MMPIs were individually administered a5 adjust 
a diagnostic battery. Adjusted and ™ 
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rae a in the initial and first cross-validation 
and aa closely matched for age (about 18 years) 
is Se ager (12 years). No attempt was made 
Mean se the Psychopathic Hospital samples for 
parison ge and education with the adjusted com- 
the a The only restrictions imposed for 
Gti ee subjects were that they 
ad at oe y young (<39 years of age) and that they 
The Ma time attended a college. 

Were dr PI records for the 1,800 adjusted subjects 
MMPIs pie from the 1959 freshman class. The 
subjects on the 160 Counseling Service maladjusted 
ing pro Were obtained as part of the freshman test- 

gram in all years from 1958-1961, inclusive. 


Ta RESULTS 
Nitial Phase 


en group discriminant analysis was per- 
ormula o determine the multiple regression 
otom which would best predict the di- 
adjust Us criterion of adjustment versus mal- 
With ent for each raw score combination of 
e n of the 10 MMPI clinical scales. 
timal] tiple biserial correlations between the 
Ment tee combined raw scores and adjust- 
orrela. were compared with the biserial 
ardly i between the same scales as stand- 
hea in T units, with the traditional 
es im ations included on five scales. Two 
"eVisio ere followed in deciding whether a 
n in K weights was warranted. 


(a o 
‘al . x the X scale was assigned a statisti- 
Significant beta weight in the multiple 


fo 
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regression formula (i.e., made a contribution 
to prediction beyond that contributed by the 
MMPI clinical scale) and the multiple R was 
significantly greater than the r between the 
standardly scored scale and the adjustment 
criterion, the new optimal K weight for that 
scale was adopted. 

(b) If the K scale was not assigned a statis- 

tically significant beta weight in the multiple 
regression formula and the standardly scored 
scale did not have a significantly higher 7 with 
the criterion than did the K uncorrected raw 
scale score, no K correction was adopted for 
that scale. 
The results of the analyses of the initial phase 
data are given in Table 1. The only scales for 
which K corrections significantly improved 
the discrimination of adjusted and malad- 
justed college subjects relative to raw scale 
scores alone were Hy, Pt, and Sc, these results 
holding true for both males and females. 

The differences in correlations with the ad- 
justment criterion between the standardly 
scored clinical scales and either the new K- 
corrected Hy, Pt, and Sc scales or the raw 
scores on the seven remaining scales were then 
tested, and the F values are reported in the 
final two columns of Table 1. It can be seen 
that the three scales with revised K weights 


correlate significantly higher with the adjust- 
do these same scales as 


ment criterion than 


TABLE 1 


OMP, 
ty ARISON OF OPTIMAL AND STANDARD 
IFFERENTIATING LEVELS OF ADJUS 


K Scare WEIGHTS FOR THE 10 
TMENT (CRITERION) wW 
CoLLEGE POPULATIONS 


MMPI CLINICAL SCALES 


Irun MALE (M) AND FEMALE (F) 


R Raw 
r Raw scores 
r Te scores scores pE F Values” (1) 
versus versus bites versus 7 (2) 
r K versus -riterion criterion criterion Beta for duwt ERO 
MMp i a oe as Q) @) raw scores eta fo! 
> scales 1 F 
Scalea! £ - 5 M F M F M F 
: M F ar ME ME [qm 
Us - D ae ae 23 12.67% 11.29%* 
D -41 09 16 AT 22 Si a jose 29%" 2.95 2.26 
Hy aas A0 at A es 16s 25% 6.65* 8.71 
Pa = m a a aa U A Te amh B9 10.77% 
Mf —.27 18.27 a7 31 a7 3 ak: 08 24 26 
Pa Dos g os 27 ort 23 O Yog 8 67 00 
Pi =e Gf as Aias 22 TE ee 25.88%" 8.34** 
a7 : s 3 
Se —69 25 24 28 24 = ee aget 36% 7.38" a 
=, pi oe ee 93 1. 
s ie ai A Se Ai 133 4398 
‘49 i ‘06 16 ll io al steer’ 10 
=. T : k 
Sia, ps EP i i gf =masculinity-femininity, Pa =para- 
» Pps Sh D ` nae =psychopathic deviate, Mf=m 
t < pa Y Pochondriasis, D =depression, Hy =hyster, Pd =P jal introversion. 


b gp Ds; 

Stang chasti 
+ Stand. 
w2 <s Y 
ATA 


henia, Sc =schizophrenia, Ma =hypomania, 
employed scales scored in T score units. 
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id the stal 
no case did sober 
| currently scored. In late reliably high 
| ale correla main- 
ardly scored sc on the re 
ae did ghessimple ose sence on met “iid 
d in some in di 
tot hobb tho ing seven scales, an the criterion than 
= s\Kessekseae better to 3 
££ 2ZIAf“SarR AG scores related Ea 1 K weights. +: be 
z BE Sls “8 SAS the scales with traditiona’ a analysis in t 
a4 The final stage of the ee ie rules for | 
AE a 
2 E es initial phase was to ap EY de T, a 
oe re = 5 h. i 
z gai BES ARSs8aseeR revision of K weights ta these samples Z 
Zz 24 \283/44 Ee sie or thes -i 
3 e258 ane E rected clinical A be tested bY ero 
=Z 9 a . cou e em 
S 57 PEN gag|ë college subjects scales inet the require che 
g g 24 WH SSE SaaS 3 validan. png th males and female "ined 
a Se i 1 for bo deter 
a Ne ğa = for Rule hese scales d to 
£ 2 e z n ights for thes unde Í 
3 g = optimal weig ion formula and ro 
= |e = from the regression icons: 
5 ja £ 8 re as 
S Bt = sessilis t tenth we 
2 Es sigs 5 5 5 S TAR v the neares Females 
a2 g PD S et > Males 5K 
Be oO a -, 
Ss a =a < -1K a 
A o O da = Hy JK 8K 
A 32S = OK x 
aa acoj, É œ |e pi I, IK 
a s 10 r 
gal [eee BES|S2eget5uge d S TK of a} 
H San es = D e 
sga 95,15 5 a the us A 
ggsj? a st PPR negated fa, 2 
E 28 = Application of Raile a Pd, Mj, Pa, M% 
oH 4 3 E > . Pa, 
ae 2 Zs] ~— SSS8SSS5/2 K correction for S, female samples: 
Be ggn = Si for either male or 
a5 i g2 ; 
[S 7 
oO ` š e 
E g ross-Validation Phase : 
e's i gee id anid is. GE cross-vall counsel / 
A 5 $ > s A c ‘of 
4 egeB8hancbbesle mne resu ighting system în both COS 
Ag E SOR EE “ls revised K weigh ng s; hospital discri” er? | 
gal |B $ service and psychiatric ble 2 If revision ose 
z SES 3 Fern iven in Table 2. for Oe. 
ag oe $ situation are giv ed that st 
fi = i on 
ae 3 eile Z8| Asaa % effective it would be sea newly othe 
=| ARANA ; 
a8 Bala a 4 scales where K weights v ome value f ad 
E 5 a s tuted (Hy) or aa of leve ° pan 
6 E S icti er GA 
a |e Bs ase Pt, Sc), pre et d 
3 E z s PRSES H a ant bie be section, weighty 
e 
g i 5 for the same scales as standard standard y 
T 2 3 Further, for those scales Lage “ores 
sala ** z 3 d ed in the w Sd 
8a Lolit 2233/2 weights were ropped it om raw ite 
n EEIEEREE 237381} (Hs, Pd, Ma) prediction ont wei 
# 8 BS Sia gw z should beas good as from stan haws 7 i 
E iie 2 scores, Inspection of ee volving K ful 
; ep & = he 24 comparisons? inv wer 
E wI] g &aw2|E. 20 of the } ations 
5 5 2! E Eg SSSA bi a= revised scales, these expect ak co 
2 aa 7g Sy ive sae! 
= i i iv is 
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=a he fo 
4 2 o Baggage rection in neither the stan 13 of the 16 "yg 
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Bs os SE8Saeess oV SS i a k j5 
Z SIASA SAA Ša (1 —r2)/N — p ber of subjects 
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REVISION OF THE MMPI CORRECTION PROCEDURE 


ST F prediction or that the scale raw 
le faces more highly correlated. The Pa 
the stand, e a to perform more effectively using 
effective ar T scores, whereas D was a more 
predictor using raw scores. 


DISCUSSION 


Poe the revised system for applying K 
rathn o 0 the MMPI clinical scales held up 
3 Abe upon cross-validation, its use when 
ion a PI is employed in a college popula- 
ore lade warranted. Needless to say, be- 
is ste, established psychometric technique 
at Pea by another, more evidence than 
Evera] ered by a single study is necessary. 
One ab still remain unanswered. For 
student the optimal weights derived from 
Useful S within a single university prove 
ence a other colleges? What relevant evi- 
Mets (Black, 1956; Goodstein, 1956) 
for ee that when the modal MMPI profiles 
dents oe of male and female college stu- 
Signifcarr diverse schools are compared, no 
might quat eewional differences are found. This 

Wei 3 lcate that over large groups optimal 
Scho 02S Would also be homogeneous from 
Weighti to school, In any case, if optimal 
dig tng should turn out to be specific to 


iscy 
equal” school populations, this would be 
includi, Problematic for any universal system 
for ee the traditional one. Revised norms 


the ne 10 MMPI clinical scales incorporating 
Wish |> Æ Weights are available to those who 
Corre... Use and further evaluate the revised 
mee System.* 
ee question which remains is whether 
ang | ved discrimination between maladjusted 
any gs ted students upon a given scale has 
aq “tng upon the usefulness of that scale 
yer Sasure of some specific psychiatric dis- 
KR io {OF example, does the change from 4 
olle zero weighting on the Hs scale for a 
More p Population mean that we can make 
ane pneg inferences regarding bodily over- 
Chan, © from scores on that scale or does the 
TeRardin Weight allow only better prediction 
So rs “degree of adjustment”? Since there 
vance a limited degree of diagnostic rele- 
arn 2 Such terms as “hypochondriacal,” 
be moid» cos : 
“h viora] schizophrenic,” etc., when the 
tg Co pa problems of grossly normal ado- 
ale ret the college norms and the conversion 
Mab Taw K scores to specific weights are 
Without charge from the author. 
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lescents are considered, cross-validation of the 
revised scale weights within counseling service 
samples is not good evidence that the psychi- 
atric meanings of the individual scales have 
been more clearly specified. However, the re- 
vised weights were also cross-validated within 
a sample of psychiatrically disturbed, college 
educated subjects for whom such diagnostic 
classes have more relevance. Accordingly, 
somewhat more confidence seems warranted in 
the assumption that the optimal weights not 
only permit greater precision of assignment to 
broad adjustment classes but also a more 
accurate estimate of the specific nature of the 
psychopathology as given by scores on the 
individual scales. 
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RELATIVE IMPORTANCE OF TEST ITEM CONTENT+ 


WARREN T. NORMAN 


University of Michigan 


The “Deviation Hypothesis” and some related notions presented by Berg 
including the contention that test item content is unimportant in personality 
measurement are critically examined. Some previous findings cited in support 
of these formulations are briefly reviewed. 2 additional sets of data com- 
paring the effectiveness of test stimuli with different degrees of content 
relevance are presented. The results clearly indicate that for the criteria 
and subject populations considered, there exist marked differences in the 
validities obtainable from different classes of test stimuli—those with the highest 
degree of judged content relevance producing the most satisfactory results. 
The theoretical or explanatory value ascribed to the Deviation Hypothesis is 


called into question. 


Irwin Berg (1959, 1961) has argued for the 
unimportance of test item content in the 
development of measures to identify what he 
calls “deviant subgroups” in the population. 
This proposition, “. . . that particular con- 
tent of objective personality and interest tests 
is unimportant .. . [that] virtually any 
content of any sense modality should be 
suitable . . .” (Berg, 1959, p. 86) is actually 
presented as a corollary to a more general 
principle which Berg (1955, 1957) has 
dubbed the Deviation Hypothesis. 

In the sequel, I propose first to analyze 
critically Berg’s formulation of this principle 
and its derivatives. I will then cite (but only 
briefly) some of the data he offers in support 
of his position. I will next present some new 
data which I believe bear on Berg’s thesis 
and will argue that, at the very least, the 
Deviation Hypothesis and its attendant no- 
tions have been overstated. Finally, I will 
contend that the theoretical or explanatory 
value which Berg has ascribed to the Devia- 
tion Hypothesis is at best trivial, and that at 
worst, this position leads to an orientation 
toward personality assessment of a most 
militantly atheoretical variety. 


1Portions of this paper were presented at the 
meetings fo the Michigan Academy of Science, Arts 
and Letters, March 24, 1961. Material included in 
this report stems in part from a project sponsored 
by the Personnel Laboratory, Wright Air Develop- 
ment Center, Air Research and Development Com- 
mand, Lackland Air Force Base, Texas. 


ic 
DEVIATION HYPOTHESIS AND ITS ATTENDAN 
Notions: AN EXPLICATION AND CRITIQUE 


The following analysis of Berg’s position z 
based entirely upon his most recent papi 
(Berg, 1961) except where otherwise no 
Berg does not, however, seem to have ma : 
any major conceptual changes in this pe 
statement from those previously pubis 
Thus we find on page 335 the Deviation ae 
pothesis quoted directly from an ear 
paper; viz., 


Deviant response patterns tend to be gets 
hence those deviant behavior patterns whic thus 
significant for abnormality (atypicalness) and 60° 
regarded as symptoms (earmarks or signs) are Pich 
ciated with other deviant response patterns W are 
are in noncritical areas of behavior and usei 
not regarded as symptoms of personality ab pP 
o as symptoms, signs, earmarks) (Berg, 199% 
159). 


First, some purely semantical matters G 
serve mention if only because certain pa 
ingly crucial terms employed in this tater at 
are apparently intended quite differently — 
a casual reader—especially a psycholos* e- 
might otherwise expect. For example, rily 
spite the pathological connotations ori 
carried by such terms as “abnorma® id 
“deviant,” “personality aberration, 
“symptoms” both in the natural language © ah 
in technical psychological discourse, ng 
intension is necessarily here implied. 
merely the behavior deviations such as e 
phrenia are included but any abnorma 
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porion that is, ab-normal in the literal sense 
eing away from normal” (p. 334). 
poas even this latter usage appears to 
Ria E to suit Berg, for in reply 
« "chin s (1959) Be explicitly disavows that 
depatiun Eponge necessarily implies a 
ity ies e rom a general normative or major- 
the a The important thing is that 
Mielec ore of the criterion group differ 
given Ht from the deviant group at a 
(p. T of statistical significance . . .” 
Eaten Nor, apparently, is there any im- 
asymmet to be attached to the seemingly 
“critert ric relation suggested by the use of 
grou ae Fg oi in contrast to “deviant 
al W hile Berg acknowledges that the 
the ge group is always a special segment of 
group neral population and that the criterion 
sample usually people in general or a random 
atter from the “normal” population, this 
Which need not be the case; “. . . any group 
e S as a basis for comparison may 
this co for criterion purposes” (p. 342).° In 
a oe also it should be noted that 
should by ion as to whether a test response 
Parent] e included in a scoring key 1s ap- 
sis th Y to be based on a test of the hypothe- 
dlative t the two populations have the same 
ee frequency for that response against 
Ypo ndirectional alternative to the null 
thesis, 
agy by a deviant response Berg appa- 
ans simply any response category (or 
r and un- 
ls for the 
riterion 
d” has 
n pro- 


2 
B 
Recess; A Seems to have created a furthe 
wre, Confusion by his choice of labe 
poup, Ups—and especially by his use of c 
been ae phrase “criterion group metho 
nil es itionally applied to test constructio 
thetase where data external to the test responses 
WO Bro €s are used to identify members of the 
Ate t iy Ps between which differential test responses 
th Dorman Eht. Where the responses of a general 
ase see population are being contrasted with 
Within > 22 identifiable subpopulation contained 


s ji . 

we Toa is not uncommon to refer to the 
pich ion as the criterion group—a convention 
tison, “TB has completely reversed. Where com- 


0 
Oe Sten to be made between mutually exclusive 

th external, differential criteria are 
Soup, a to denote or identify members o 
y conf, n this latter case it would seem desirable 
Mey? ter Use the situation by alluding to one group 
thoq m properly applicable by virtue of the 
On, employed to either member of the com- 
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class) for which the relative frequency of 
occurrence in one group differs from that in 
some other group being considered, and 
nothing more. 

One final semantic clarification is called 
for—that which stems from the implicit dif- 
ferentiation of “critical” and “noncritical” 
areas of behavior in the statement of the 
Deviation Hypothesis. By critical (or “sig- 
nificant”) areas of behavior are meant those 
particular qualities which serve to determine 
membership in a given class (the criterion 
or the deviant groups) whereas noncritical 
behaviors are all those which, while (pre- 
sumably) also containing valid differential 
indicators for the class, are not so conceived 
nor ordinarily used by the community for this 
The basis for distinguishing between 
kinds of behaviors is apparently 
of the social importance 
attached to each. “Putting it another way, 
response patterns in critical areas of behavior 
have value attached to them in a particular 
culture and they usually bear classificatory 
labels” (p. 339). In addition to the cultural 
importance criterion (usually with an attend- 
ant verbal labeling of the class), it is ap- 
parently also required that all, or at least 
many, of the critical behaviors be jointly 
present in all members of the class. For’ 
example, “criminals,” “delinquents,” “execu- 
tives,” and “supervisors” are all given as 
examples of culturally valued and labeled 
classes but ones in which behavioral hetero- 
geneity of the members is so large that such 
d not serve as either criterion 


purpose. 
the two 
partly a function 


categories coul 
or deviant groups. 
961) does not actually give explicit defini- 
tions of either critical or noncritical as areas 
of behavior. The above is an attempt ata somewhat 
more precise statement of the criteria by which the 
two kinds of behavior are distinguished as I under- 
stand it from the informal discussion and examples 
Berg does present. One gets the uncomfortable feel- 
ing, however, that what really serves to define valid 
behavioral categories” in particular cases is not a 
prior application of both or either of these m 
criteria. Rather, what seems to be crucial is whether 
or not some given aggregate of persons, however 
related and labeled, can be separated from some 
her group by an empirically constructed scoring 
T for responses to some set of superficially ir- 
an (noncritical) stimuli. If this 7 a a 
the Deviation Hypothesis, whatever its 0 a virtues 
or faults, becomes very tightly circular indeed. 


3 Berg (1 
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Let us now consider some of the attendant 
notions or consequences of the Deviation Hy- 
pothesis which Berg (1961) presents (pp. 336- 
337, 365-372). First, he argues, that one of 
the consequences is “A definition of deviant 
responses in simple and operational statistical 
terms—no hunches, clinical intuition, judg- 
ments, ratings, or other subjective appraisals 
are involved” (p. 336). This statement must 
certainly pertain primarily, if not exclusively, 
to deviant responses to potential test stimuli, 
i.e., to those in the noncritical behavior area. 
As just one example, in the study by Barnes 
(1955) which Berg relies on heavily to make 
his case, judgments or subjective appraisals 
were rather intrinsically involved in the so- 
called critical behavior area. The fact that a 
patient has a recorded diagnosis of schizo- 
phrenia or character disorder or something 
else which can be objectively transcribed in 
no sense make this designation independent of 
the basis on which the original diagnosis was 
made—a very highly subjective and judg- 
mental process in most psychiatric settings. 

A second proposition is presented as 

A deduction [sic!] drawn from a vast array of 
evidence that virtually any relatively unstructured 
stimulus content can be used to elicit deviant re- 
sponses. This is not, of course, central to the hy- 
pothesis proper. . . . This is not saying, it should be 
emphasized, that any item is as good as any other 
for a particular purpose. Such an assertion would be 
absurd. But it is saying that for a given verbal item, 

for example, one would [could?] find a design or 
sound that would do as well. At least all of the 


evidence we have thus far garnered forces us to 
that conclusion (p. 336). 


This particular derivative of the Deviation 
Hypothesis, whether taken as a “deduction” 
from the statement that “Deviant response 
patterns tend to be general (italics mine)” or 
whether considered as an inductive inference 
based on the past empirical findings by Berg 
and his associates, is the primary focus of the 
present paper. 

It is somewhat difficult to see why Berg 
feels that this proposition is not central to 
the Hypothesis proper. Assuming that the 
Deviation Hypothesis says anything at all 
which is nontrivial and empirically meaning- 
ful (i.e, assuming that the definitions of 
critical and noncritical behaviors can be 
given independently of the defining character- 
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istics of the criterion and deviant groups In 
some such manner as that suggested above); 
then it would seem that the Deviation et 
pothesis asserts just this proposition—tha 
deviant response patterns tend to be goer d 
i.e., that they occur in both the critical an 
noncritical areas of behavior. It is true that 
the Hypothesis proper does not assert anys 
thing about the relative degree of separability 
that can be achieved between the ceng 
groups by the various possible classes > 
test stimuli. In this regard at least, the 
statement that one could find a design oF 
sound that would do as well as would i 
verbal, content relevant item does k. 
something beyond what is expressed in t : 
Hypothesis itself. It could be argued, how- 
ever, that the Deviation Hypothesis n 
scientific (or better, technological) veg 
tance only to the degree that behavior 1n t 
noncritical areas can be used to effect ne 
relatively substantial separation of the CO 
trasted groups as compared to that obra 
by use of more highly content relev 
stimuli. A somewhat similar analysis of t- 
asserted generality of deviant response P% 
terns and some empirical findings peran 
to this issue have been presented earlier 
Sechrest and Jackson (1962). sation 
The third implication of the Deviatio’ 
Hypothesis which Berg presents first en. 
the generality or pervasiveness of den 
responses but then adds another point— me 
members of any deviant group have a oe 
mon response patterns in both the critical 4 
noncritical areas. “Thus, an engineel, eal 
schizophrenic, or a genius should each P 
atypical responses which form a PAA 
peculiar to his special class. The Pa ets 
would be largely shared by other mem f 
of the same class” (pp. 336-337). The pee 
point mentioned is that the Hypothesis ow 
a conceptual framework which suggests 
a variety of problems, not just those of A 
chopathology, may be attacked. He m 
that the method is applicable to “any “alid 
of behavior which is identifiable by viy 
external criteria” {p. 3637). And ma to 
Berg contends that the Hypothesis serve 
explain a wide variety of findings of & 5 uni- 
ingly diverse sort and thus provides an re- 
fying principle which accounts for suc 
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oo of this later. Let us turn now to 
tief review of some of the empirical find- 


i A 3 A p 
ngs which Berg cites in support of his 
position. 


Some 
ME PRIOR EMPIRICAL RESULTS IN SUPPORT 
OF THE DEVIATION HYPOTHESIS 


Moere reviews a fairly large number of 
ae Investigations, the results from 
tion he interprets as support for his posi- 
crite „Most of the studies employed the 
aen a. group method and for the most 
er, ithe findings are based on responses to 
na Perceptual Reaction Test (PRT). 
on E aa dings are reported, however, based 
items itory stimuli or on more conventional 
ee as self-descriptive adjectives, self- 
ences Statements, and occupational prefer- 
tom Most of these latter findings arise 
Other studies which also employed designs 
rief than the criterion group method. A 
in pe Vey indicates that groups differing 
. Psychiatric diagnosis, in chronological age, 
egree of mental retardation, in amount 
emotional disturbance, and in severity 
Ma ote of physical illness all have been 
PRT _to differ in their responses to the 
iti While many of the findings are 
est ed only in terms of a value for some 
trop istic or the corresponding Type I 
e Probability, in some cases enough data 
the reported either, in Berg’s paper OF in 
original sources cited by him to make 
e © estimate of the degree or magnitude of 

relationships or differences involved. 
comes all means the most dramatic support 
Menti, from the study by Barnes (1955) 
4 Mier earlier, In this study PRT re 
Btou S were obtained from a heterogeneous 
of YP of 1,700 normals and from 546 patients 
fema ed psychiatric diagnoses. Males an 
o were treated separately. From the 
identigo ormas a subset of psychotics was 
Mon ed within each sex group and from 
female these, smaller groups of male and 
additio, schizophrenics were also isolated. In 
ers n, a group of 120 male character dis- 
Seharat, ne also identified and were used 
Partiti ely in some of the analyses. Barnes 
Piee each of the identifiable groups of 
a Sii into two parts. Then by means of 
le cross-validation design he developed 


of 
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empirical scales for contrasted groups on 
each pair of subsamples separately and esti- 
mated their validities on the remaining cases. 
The results reported indicate clearly that 
for all scales developed, the contrasted groups 
were well separated. Even for the one truly 
differential diagnostic key—that contrasting 
male psychotics with male character dis- 
orders—the data reported indicate that by 
use of an optimal cut score it is unlikely 
that the false positive or false negative rates 
would be much higher than 25% to 30%, 
assuming comparable base rates. Considering 
the brevity of the PRT, the apparent ir- 
relevance of its content to psychiatric no- 
sology, and the unreliability and often 
fortuitous nature of differential diagnostic 
classification, this latter finding, at least, is 
indeed impressive. If subsequent studies 
should indicate that equally discriminating 
scales can be built to separate, say, neurotics 
from psychotics, or process from reactive 
schizophrenics, Or neurological organics from 
functional psychotics, then indeed the PRT 
and the Deviation Hypothesis will have to 
be reckoned with by diagnosticians with 
other pet instruments and methods. 

The study to be reported below, however, 
does not pursue this aspect of the problem. 
It is concerned with differential classifica- 
tion, but within the domain of nonpath- 
ological personality organization. 


INVESTIGATION OF THE RELATIVE IMPOR- 
TANCE OF TEST ITEM CONTENT FOR 
THE ASSESSMENT OF NONPATHOLOG- 

ICAL PERSONALITY ATTRIBUTES 


The critical behavior domain employed 
in this investigation derives from peer ratings 
among close associates. An extended series 
vestigations reviewed elsewhere by the 
(Norman, 1963b) which employed 
wn from Cattell’s reduced per- 
set have each demonstrated 
the existence and relative orthogonality of 
five personality factors. The populations on 
which these results are based range from 
Air Force personnel, through college under- 
eraduates to applicants for graduate school 
programs in clinical psychology. The five 
factors have tentatively been designated 
(each by the name of one of its two poles) 


of in 
writer 
scales dra 
sonality sphere 
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Surgency (or Extroversion), Agreeableness, 
Dependability, Emotional Stability, and Cul- 
ture. Details of the structure, comparabilities 
across samples, salient scales for each factor, 
etc. are presented and discussed in the review 
previously mentioned (Norman, 1963b). 
Suffice it to say here that each of these five 
factors is quite clearly defined and easily 
interpreted in terms of polar clusters of 
highly homogeneous behavioral descriptors 
and that this structure was found in each 
of the samples described below. 

The present study was intended to com- 
pare the relative effectiveness of responses 
to content irrelevant stimuli with those made 
to more highly relevant test items in terms 
of their ability to predict the status of in- 
dividuals on each of the five peer rating 
factors. In the first analysis three kinds of 
test stimuli were employed: personality 
descriptive adjectives, occupational titles, and 
figural drawings. The adjectives, constituting 
a set of maximally content relevant stimuli, 
were drawn from a large pool of trait de- 
scriptive terms on the basis of judged high 
relevance to one or another of the five factors. 
The terms so chosen were then paired such 
that each member of a pair was relevant 
to a different factor but was matched with 
the other term in the pair on a particular 
rated desirability stereotype. The 200 forced- 
choice, self-report items constructed in this 
manner were typed in booklet form and 
called the Descriptive Adjective Inventory 
(DAI). 

A set of occupational titles was chosen as 
an intermediately relevant set of stimuli. 
Judgments of the factor relevance of prefer- 
ence responses to each of these stimuli proved 
to be very difficult to make and the judges 
expressed little confidence in their ability 
to do this. Again ratings of the desirability 
of expressed preferences for each occupation 
were also obtained and a set of 60 binary 
forced-choice items were constructed in the 
manner described above. These items were 
then typed in booklet form and named the 
Occupational Preference Inventory (OPI). 

Finally, the set of content-irrelevant 
stimuli used were the 400 figural designs 
which are contained in the Research Edition 
of the Welsh Figure Preference Test 


WARREN T. NORMAN 


(WFPT). In this test, the respondent simply 
records whether he “likes” or “dislikes” each 
of the designs. 

The samples of subjects employed were 
paid male volunteers from a number © 
fraternities, residence halls, and one coopera- 
tive housing unit at the University of Mich- 
igan. Sample I was composed of 82 men 
mostly seniors, from whom the peer nomine- 
tion ratings and test performances were er 
lected in the Spring semester of 1960. ss 
II consisted of 215 fraternity men, mös 
sophomores and juniors, tested in the Se 
semester of 1960. Sample III (used toget 
with Sample II in a second analysis reporte 
below) was composed of 241 residence e 
men, primarily freshmen, sophomores, ri 
juniors, tested in the Spring semester of oe 

Contrasted criterion groups on each ee 
were formed in Sample I by splitting i 
distribution of factor scores (derived fro s 
the peer ratings) at its median. Percëñtots 
of each subgroup on each factor respondin 3 
to each item alternative on each of the Pa 
were computed. From these values Pr 
centage difference discrimination no 
(item validities) and percentage endorsem 
indices were computed. A set of five aa 
liminary empirical scoring keys was co! 
structed for each of the three inventories Ke 
simply including all responses in a given re 
for which the item validity indices we 
greater than a selected minimal value.* 

The answer sheets for each subject 
Sample II were then scored on the sets ai 
preliminary empirical keys just describe 
The correlations among these scales an! 
criterion factor ratings are given in Table re 
The values in boldface and brackets @ 


in 


*This minimal value corresponded approximates 
to a 5% significance level. However, since & ee 
of validity was used instead of a significance, e 
statistic for such a measure, some responses for E 
with nearly even response splits probably had mê, 
nificance levels which exceeded .05 and sates 
especially those with extreme endorsement row 
certainly had Type I error likelihoods well be 
01, even for these small samples, The use © 
measure of validity (instead of merely a test 
Significance) for purposes of selecting keyable ti 
sponse categories for empirical scales is a Lae 
which can be strongly defended for reasons á 
too numerous and too complex to be expanded UP 
here. 


re- 


171 


"papuo syujod pewpaq *(STZ = N 1303) I] adureg uo paseg—aIoN 


e- zs īe— os— (60) s80 T= «SLO (D n 


1I— W- 10 [vo-] 40 _ or— ~LO- 10 A 

OP— ¥2— 90— 60- (90—) 80  60— £0 91— (Þ0—) so mo r= t0- [0-1 <0 =O) LO- M 
so— zz— o 60 (@I-) It  SO- 80  (99—) :0— z0o— z= 90 [ee-]¥I— 10- M IdIM 

SL or 60 L0- (40-) €0- 00 10 T0— (60) +0 st 0 80 [s0] £0 II 


eo zo— zo— 9I— (£0) OI- 60- 60 TT 


»£— 89— 6 b (se) +0 sz- 


z €T S4— 0 (Þ@ z0 so— 4z— z0- [so-] FO _ 10 a= A 
st¢— æ=- 4- o- (0) so r 92- eo- [s0-]+0 10 m 40 

4I- 80 so q (20) 10 eo oo- 6o [40] w- H 

60 60- 90— z0- (ez st 0- 10— 90 [e0 I 


9¢ w- ğa [oT] et w= LO- n A 
s9— ól- 9 60— [yz] ez- 0- u AI 
ee w= TT v- 90 e- IM Iva 
w © @ zZz is] vt i 
m— 80— oe— ez- Lor] I 


(£0) z o= or z0 [oo] I 


S è &9 Mo W A 

or se o AL 
$ LM- mM BA 

30 IL 

I 


RELATIVE ImMporTANCE oF TEST ITEM CoNTENT 


A AI WI H I A AI I TW I A AMI Ww n I A Al 
Ida IdO 


m n I 


Iva sune 


Ld:IM ONY ‘Ido ‘Iva AHL YOA SAIY IVONIARY AAVNINITAAJ THI NO SHAXOOS ANV Sauoog AOLoV] ONILVY IHL ONONWY SNOILVITAUOD 


I JTIVL 


172 


cross-validation estimates of validity of the 
scales against the peer nomination rating 
factor scores. The values in boldface and 
parentheses are the correlations between in- 
struments for corresponding factor keys. 

The first thing to note is that the keys 
for the WFPT have essentially zero validities, 
The largest magnitude, that for the Factor 
III key, is in fact negative. The validities 
for the OPI. keys are likewise essentially 
zero with the one exception of the .29 value 
for Factor V. By way of contrast all five of 
the validity coefficients for the DAI keys 
are positive and statistically significant using 
a .05 Type I error level. 

Taken together, these results appear to 
argue rather forcibly against Berg’s conten- 
tion that test item content is unimportant in 
the construction of valid empirical keys. Nor, 
I believe, can these findings be dismissed 
as irrelevant because of some defect in the 
design or procedures employed. If it is 
argued that the size of Sample I is too small 
to be used to develop criterion group empir- 
ical keys, or that the use of median splits 
on the rating factor score distributions mis- 
classifies criterion cases, then how does one 
explain the significant and, in some cases, 
appreciable validities obtained for the DAT 
scales? If the relatively small number of 
available response alternatives on the OPI 
is mentioned, it must be pointed out that 
the WFPT has 800 potentially keyable re- 
sponse categories whereas the DAT has only 
half that number. Thirdly, if it is suggested 
that perhaps the peer nominations do not 
yield valid, “operationally clean” criteria for 
identifying the contrasted groups, one must 
point out that the factor structure based on 
these ratings is exceptionally clear-cut and 
that the sole basis for classifying subjects 
into groups is on the basis of explicitly 
labeled (and undoubtedly socially valued) 
behavioral homogeneity as judged by groups 
of intimate associates. And finally, the point 
made by Berg that the stimuli must not be 
too highly structured so that response sets 
are free to operate has certainly been met 
by all three devices and especially by the 
WFPT whose stimuli are at least as un- 
structured as those on the PRT. 

Although the analysis just discussed was 
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hardly encouraging with regard to the 
potential value of further work with empir- 
ically keyed figural preference responses for 
the measurement of the five rating factors, 
there were several reasons to pursue this 
approach at least a bit further. Berg and 
his associates had, after all, generated a 
rather large number of apparently successfu 
demonstrations of this approach—to be RE 
with other groups, and in other kinds 0 
settings, but with similar stimuli. A secon 
and more immediate reason was the nature 
of the research program in which we vo 
engaged. The general problem under attac 
was to develop individual performance meas- 
ures of these five factors that would be 
Suitable for use in a personnel selection con- 
text where the peer nomination method was 
administratively unfeasible. Because appli- 
cants for the training program for whic 
these tests were to be used as selectors are 
highly motivated to be accepted, some 7a 
surance was needed that applicants woul 
not be able to fake their responses to oe? 
admission. Since the peer nomination scal 
defining the factors each have a ae 
discernible Positively valued pole (in th 
setting, at least), simple self-ratings on thes 
scales or self-reports on typical invento 
scales were expected to be highly vulnerable 
to any attempts to dissimulate responses: 
One possible approach to this problem, © 
course, would be to employ test stimuli whic 
bear no apparent relation to the variables t° 
be measured.’ If such stimuli could be use 
to develop scoring keys with substantia 
validities, then the problem presented by ĉ 
set to “fake good” in a respondent would Þe 
greatly diminished, Accordingly a secon 
analysis was done. d 

The 456 subjects from Samples II an 
III were divided by peer rating groups int? 
a pair of cross-validation samples of 228 men 
each—hereafter referred to as Samples 
and B. These two samples were matched 28 

"A quite different approach to this problem and 
one which permits the use of content relevant tes 
stimuli, has been described elsewhere (Norman, 
1963a). The evidence so far available indicates that 
the method presented in that paper holds consider- 
able promise for an effective solution of the proble™ 
of faking in at least certain kinds of assessment 
contexts. 


| 


{ 


RELATIVE IMPORTANCE OF TEST ITEM CONTENT 


oud as possible in terms of residence 
ied or dormitory), academic class 
ioe and similarity of their criterion 
data Pe en based on the peer rating 
mous] ach sample was then split trichoto- 
30% Sia each criterion factor with about 
an the sample in each of the extreme 
grou na the remaining cases in the middle 
the aa each item response category on 
two oth E a on the DAT (as well as on 
Bh er inventories which need not con- 
and th z ere); overall endorsement indices 
iscrimi Ne matrices of percentage difference 
mous thd indices among the trichoto- 
Dishes one for each of the criterion 
Separately fo computed. This was done 
teste 7 for each sample and for each 
yo e discrimination indices (between the 
extreme criterion subgroups) for the 
ata aay as calculated from the Sample A 
ample E Plotted against those obtained from 
ndicateq ` a inspection of the scatter plots 
ig and the relationship to be moderately 
actors EOS HVE TOP each of the five criterion 
items h n addition, appreciable numbers of 
% i ad discrimination indices exceeding 
Some n magnitude in both samples and in 
to Bee they extended as high as 30% 
ian For this test, items for which a 
One g ination index of 15% or higher on 
sam ample reversed in sign on the other 
ese dl were extremely rare. Quite clearly, 
Empiri ata could be (and were) used to build 
ical scoring keys for which validities on 

Vali Ye criteria would be maintained in cross- 
bee, tion. The results of that effort have 
a tae elsewhere (Norman, 1963a) 
as not be repeated here. y 

Wag ever, when the corresponding analysis 
One for the items of the WFPT, the 


Teg, 

ony S were completely discouraging! a 

tha, Were the discrimination indices gow 
in 


for items 

otha. OF the content relevant iten 

rate Mventories analyzed, it was extremely 
to find one as large.as 15% between e 


8 oe 
unggah subject actually took each dent ties: act 
acey, standard instructions tO be as Hon dae 
to f ate as possible and secondly under ue n n 
ad H G the most desirable responses er e T 
We Sion to the training progra™ zere wa > 3 
the Will be concerned here only with a anal yis o 

ata obtained under the first Set ° instructions. 
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high and low criterion groups. What is 
worse, such items as were found with 
validities in this range on one sample were 
as likely as not to show a drop in validity 
to zero or even a reversal of sign in the 
other sample. The scatter plots of the item 
discrimination indices between the two 
samples were, in fact, almost perfectly cir- 
cular and the marginal distributions looked, 
for all the world, like random error dis- 
tributions. It was perfectly clear that any 
attempt to construct scoring keys from such 
data in the hope of being able to cross- 
validate them was a fools enterprise and 
consequently the effort to utilize such test 
materials for our purposes was abandoned 


at this point. 


DISCUSSION OF RESULTS AND THE QUESTION 
OF THE THEORETICAL SIGNIFICANCE OF 
THE DEVIATION HYPOTHESIS 


Whatever other implications may be drawn 
from these findings, one thing certainly is 
clear—the term “general” in Berg’s formula- 
tion of the Deviation Hypothesis cannot be 
taken in anything like the unqualified sense 
in which he has used it. And in at least 
certain kinds of personality assessment con- 
texts, the use of relevant test item content is 
of rather considerable importance. 

But if one grants these points (as I believe 
he must in the face of the data presented 
above), the far more intriguing questions 
only briefly alluded to earlier follow at 
once. In what particular kinds of assessment 
contexts, for which classes of subjects, and 
with what degree of success can which 
specific kinds of noncritical behaviors be 
employed to measure personality or interests 
or any other critical attributes or status 
variables? For purely pragmatic purposes, 
such detailed knowledge is of the essence if 
we are ever to emerge from the “let’s-try- 
everything-and-see-what-works-this-time” ap- 
proach to test development. In this connec- 
tion, the extensive bibliography which Berg 
and his associates (Berg, 1961) have gen- 
erated is bound to be useful—especially as it 
continues to accumulate and is reported in 
increasing detail. 

But in a wider sense is this sufficient for 
anything but the most immediate applied 
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or technological purposes? Does even a very 
extensive compendium of such findings—even 
when subsumed under some such statement 
as that presented in the Deviation Hy- 
pothesis—constitute an adequate theoretical 
structure for an empirical science? In my 
opinion the answer is, No! Even assuming 
that enough data are eventually forthcoming 
so that we will know under what conditions, 
for whom, and with what degree of success, 
which stimuli can be depended upon to 
yield valid assessments of which behavioral 
phenomena—at that point we will have our 
“engineer’s handbook” and little else. What 
is more crucial in my opinion is that by any 
consistent adherence to the distinction im- 
plied by the conceptions of critical and non- 
critical areas of behavior, we will have ex- 
plicitly precluded the development or con- 
struction of any theoretical system capable of 
yielding an adequate “explanation” of be- 
havioral phenomena. What is the distinction, 
after all, between the terms critical and 
noncritical when applied to areas of behavior 
but an admission of the incompleteness of 
our present knowledge? An adequate empir- 
ical theory, from the present viewpoint at 
least, must be more than a mere catalogue 
of what goes with what. Nor does it seem 
particularly fruitful to codify and perpetuate, 
in the major postulate of a formulation pro- 
posed for this purpose, a distinction that 
simply reflects our current naivete and social 
values. On these grounds, it seems to me 
that whatever pragmatic virtue the Devia- 
tion Hypothesis might have for goading 
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researchers to explore new possibilities for 
useful test stimuli, its purported theoretical 
value to account for or explain specific = 
havioral phenomena or relationships is, 2 
the very best, illusory. 
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CLIENT LIKABILITY: 
A VARIABLE IN THE STUDY OF PSYCHOTHERAPY * 
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Wisconsin Psychiatric Institute, 


A study designed to investi 
and the relationship of CL t 
as to their likableness by 10 raters 
the therapy of each of the clients. 
successful and less successful ca 


familiarity with the cases were employe 
reliably by the familiar raters beyond 
clients were liked significantly greater t 
the .05 level, and (c) CL correlated sig 


of Rogers’ Process Scale. It was concl 
bly related to this variable. 


and that success in therapy is possi 


na background for this study is a very 
Tip and personal reaction to clients. 
clients Stated, this reaction is that for some 
Passionate can feel immediately more com- 
One PA and optimistic, while to others 
More nE less personally moved and also 
study iscouraged about being helpful. This 
of thi attempts to determine the generality 
own is reaction, by having raters give their 
teco a onal liking for clients, as heard in 
0 is ed therapy interviews. The purpose is 
action if such a personal and subjective re- 
vestis, can be reliably measured and to Im- 
in „eate the relevance of client likability 
Psychotherapy. 
Pos operationally stated the research 
i rted here investigates the following: 
Pi The reliability of a scale to measure 
nt likability. 
- The relationship between 
ngs of clients and the succes 
nts in therapy. n 
- The factor of familiarity with the clients 
the Pe raters and the likability ratings for 
se clients. 
es The change in likabilit 
Interviews. 
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the likability 
"ati s of these 
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gate the rateableness of client likability (CL), 
o success in psychotherapy. 10 clients were rated 
from 2 recorded segments taken from 
The clients were classified into more 
tegories prior to this study. 2 levels of rater 
d. It was found that (a) CL was rated 
the .01 level, (b) the more successful 
han the less successful clients, beyond 


ificantly with the Experiencing Strand 
uded that CL can be reliably rated 


5. The relationship between the client’s 
rated likability and a measure of the client’s 
in-therapy behavior as reflected by Rogers’ 
Process Scale (Walker, Rablen, & Rogers, 


1960). 
PROCEDURE 


ere given 20, 2-minute, taped seg- 
d client interactions drawn 
Prior to this 


Ten raters Wi 
ments of therapist an 
from 10 recorded therapy cases.” 
study five of the cases were classified as more 
successful and five as less successful, on the 
criteria of therapist ratings of outcome, patient 
ratings of outcome, and a self-concept Q sort. 
Two segments were used from each case, one from 
an early interview and one from a late interview. 
The 20 segments were presented to each judge in 
a random order; this was made possible by having 
each 2-minute segment on a separate tape spool.$ 
The raters were then given the following instructions 
with the 20 segments, 


Instructions for the Rating of Client 
Likability 

After listening to the taped segment of the 
client and therapist interchanges, make a rating on 
the continuum from liking to disliking of the 


client. 
You may make a mark at any place along the 
scale, you are not confined to the points that are 


numbered. 


2The 10 clients rated in this study were from 
10 fully recorded cases from the University of 
Chicago Counseling Center. 

3 These 20 segments were drawn from a large 
pool of segments of these 10 recorded cases used 
by Tomlinson and Hart (1962) in a validation study 


of the Process Scale. 
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Scale point number one (1) is for a positive 
liking reaction to the client, while a check mark 
at number six (6) would be a disliking reaction 
to this client. Marking any place along the scale 
between these two points will represent the mag- 
nitude of your liking or disliking, depending on 
the closeness to the end of the scale. 

Often it is our experience that we have feelings 
and reactions to people, but do not necessarily 
draw our attention to these feelings. This rating 
task asks of you to look at the Specific liking or 


disliking feeling that this client brings out in you, 


and to mark on the scale the point that best 
describes your reaction along this continuum. 
The 10 raters that made their likability judg- 


ments were all people with some experience as 
therapists. Five of these raters had Previously made 
ratings on the Experiencing Scale (one of the scales 
in the Process Scale) on the same clients and had 
heard many segments from the recorded interviews, 
The other 5 raters were hearing these clients 
for the first time. The distinction will be made 
between these two groups of raters as the familiar 
and unfamiliar raters, respectively. 

A summary of the design of this study is as 
follows: there are 20 segments drawn from 10 
therapy cases, on several criteria prior to this study, 
5 of the cases were classified as more successful 
and 5 as less successful. From each case there 
is a segment from an early interview and one 
from a late interview. Of the 10 raters, 5 were 
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RESULTS 


Table 1 presents the summary of be 
analysis of variance of the likability ratings. 
It was found that the more successful clients 
received a significantly higher mean likability 
rating than the less successful clients Ca 
versus 3.5). In addition, it was found a 
the five familiar raters gave significantly 
higher likability ratings than the five W 
familiar raters (2.8 versus 3.3) 8 s 

No significant change in likability W2 
found from early to late therapy interviews: 
The more successful clients were more like 
early in therapy by raters and they main 


à 3 ul 
tained this advantage over the less success! 


clients. ; pa 
The results of rater reliability of this SU 
jective rating task are indicated in Table he 
Table 2, below the diagonal, presents a 
interrater reliability for the familiar se 
which indicates that the raters who bie 
familiar with the clients showed good agree 


A nder 

4 The author wishes to acknowledge A. Alcea 
of the Wisconsin Psychiatric Institute for his 

Sultation on problems of design and analysis. 


ith 
familiar with the clients from previous process 5 The Likability Scale ranges from 1 to 6 W 
rating and 5 were unfamiliar with these clients, liking at 1 and disliking at 6. 
TABLE 1 
SUMMARY OF THE ANALYSIS oF VARIANCE OF THE LIKABILITY RATINGS = 
Source df SS MS F 
Group (A) * 
(More successful-less successful) ii 36.98 36.98/a 8.56 
Time (B) 
(Early interviews-late interviews) 1 41 A1/c = 
Rater familiarity (C) 1 16.25 16.25/b 6.09" 
AXB 1 -08 -08/c TE 
AXC 1 2.42 2.42/c a 
BXC 1 -40 -40/c ae 
AXBXC 1 12 .72/c Fi 
Error terms 
a Subject’s within groups 
(More successful-less successful) 8 34.60 (a) 4.32 (a)/c — 
b Raters within familiarity 8 21.39 (b) 2.67 (b)/c = 
c Residual 176 1,390.45 (c) 7.90 (c) 
Total (V — 1) 199 


1,503.70 


*p< .05. 
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TABLE 2 


INTERRATER RELIABILITY OF FAMILIAR AND UNFAMILIAR RATERS ON THE LIKABILITY SCALE 


F G H I J 
A .08 36 ad AT F 
X B | .52** 9 04 26 |G 
Familiar Unfamiliar 
C | .47* 53** -60** $3 |H 
raters raters 
A D | .73** .70** 42" 4o* | I 
E | .75** 36 wart 73° T 
A B € D E 
Note.—af = 
"> 18 
BE 


ian as to whom they liked and whom they 
is ed. The average rater reliability for 
is b group of familiar raters was .58, which 
eyond the .005 level. 
able Contrast, the unfamiliar raters were un- 
e to show this degree of reliability. Table 
+ above the diagonal, presents the matrix 
peed reliability coefficients found among 
relight miliar raters. The average rater 
si ibility for this group is -30 which is 
nificantly lower than the average reliability 
e familiar raters. DE. 
Cal © familiar raters had made Experiencing 
ti € ratings on these clients several months 
is to the time of the likability ratne 
. IS made it possible to determine the re'a- 
penship erwan client likability and ae 
tes SS Scale variable. Before presenting the 
pilts. of the correlations between these two 


TABLE 3 


Co 
Tet LATIONS FOR THE FAMILIAR RATER 
E Likaprury AND THE EXPERIENCING R 


S BETWEEN 
ATINGS 


Correlation 


Familiar a 
raters coefficients 
A —.63** 
B —:61** 
Q —.37* 
D —.51* 
E =,10"* 
Note. gF = 
we < ome 


b <io. 


scales a word of introduction to the Experi- 
encing Scale. 

The concept of experiencing is one of the 
central notions in Rogers’ Process Scale. The 
details of this concept can be found in 
Gendlin and Tomlinson (1962); Gendlin 
(1961) Walker, Rablen, and Rogers (1960), 
and Tomlinson and Hart (1962). In brief, 
the Experiencing Scale deals with the degree 
to which the client focuses on his personal 
experiencing and this ranges from “remote- 
ness from experiencing and unawareness” 
at the low end of the scale with “decreas- 
ing remoteness and increasing awareness” at 
the middle range and “lives in process of 
experiencing and uses it as a major referent” 
at the upper end of the scale (Gendlin & 
Tomlinson, 1962). 

The five familiar raters who made the 
Experiencing ratings and the Likability 
ratings had a significant correlation between 
these two scales. That is, the more the client 
is liked the higher the process rating and the 
more he is disliked the lower the process 
rating. Table 3 presents these correlations 
which ranged from —.37 to —.70. 


Discussion 


The problem of scaling something so vari- 
able and subjective as the degree of liking of 
a client just from a taped segment of the 
interview raises the question of reliability, 
The average rater reliability of .58 for the 
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familiar raters indicates that some consistent 
cues are being used in making these judg- 
ments. This degree of rater reliability in- 
dicates that client likability may be a general 
characteristic on which raters can agree, that 
is, once they have become more familiar with 
the client. 

The finding that the familiar raters liked 
the clients significantly more than the un- 
familiar raters may indicate that raters 
change in their attitude toward the clients 
as they hear more of their therapy. 

In addition, since the raters have all had 
experience as therapists and are familiar with 
a variety of client behavior, their judgment 
of liking may depend on what they consider 
to be “good” in-therapy behavior. This inter- 
pretation is supported by the significant cor- 
relations found between the client’s rated 
likability and his rated level of process on 
the Experiencing Scale. These findings point 
to a possible source of biasing in the rating 
of psychotherapy material. 

The clients whose experience of psycho- 
therapy is less successful may be the people 
who are much more difficult to like and it is 
this behavioral characteristic of degree of 
likability that may be related to the less 
successful outcome of therapy. Rogers (1957) 
has presented the theoretical formulation that 
if certain therapeutic conditions are provided 
by the therapist, and in turn if these con- 
ditions are perceived by the client, then con- 
structive Personality change will occur. 

It is possible that the therapist may be 
unable to provide effective therapy to a 


Norton STOLER 


client he cannot like and the client who has 
had little experience in life in being liked 
may be unable to perceive that the therapist 
really cares for him. Future concentration 
on the phenomenon of Client Likability as 
an interactive factor in the relationship be- 
tween the therapist and the client may ac- 
count for some of the variance that con- 
tributes to the effectiveness of psychotherapy- 

In conclusion, this study raises two ques- 
tions for research in psychotherapy, One, 
Whether the variable of Client Likability is 
a “halo effect” that contaminates the validity 
of outcome criteria and process measures; 
and two, whether Client Likability is @ 
relevant personality variable which is & 
causal factor in the success of psychotherapy: 
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NOTES AND COMMENTS 


PSYCHOTICISM OR TEN PSYCHOTIC SYNDROMES? 


H. J. EYSENCK 


University of London 


Iı . 
met recent article, Lorr, McNair, Klett, 
rating pct Poets report a factor analysis of a 
cluding Ta ule for 296 psychotic inpatients, con- 
Were ire ten postulated psychotic syndromes 
ey also oo in varying degrees” (p. 189). 
Support of eclare that “there is no evidence in 
single s Eysenck’s contention that there is a 
F bape factor of psychoticism” (p. 
data, a : neither conclusion is justified by their 
the tle examination of their failure to use 
} meee que of factor analysis properly may be 
“ie Universally agreed that to obtain evidence 
sented et factor, this factor must be repre- 
e. equately (a) in the tests used, and (b) 
Second. Population studied. It is clear that no 
found ee factor of general intelligence will be 
IQs. vE all the subjects tested have very similar 
k Varia ere there is no variance there can be no 
fatter ce The proper test of the existence of 
Whic of psychoticism requires a population In 
on Fa ae have high scores, others low scores 
a S factor; the hypothesis cannot be tested 
Population in which by definition every 
On k 's high (probably almost uniformly high) 
S factor. Clearly no correlations can be 


Bene 
Optimal @ under these conditions, which m 
do not on discovering group factors, a 


existence permit of a proper examination — 
Such gp Of a general factor of psychoticism. In 


choti Studies as have disclosed this factor, psy- 
ubjects 


have” reurotic, and (usually) normal s 
1953p Ways been included (Eysenck, 19522, 
and p, 1955, 1960; Trouton & Maxwell, 1956). 


by ip adequate sampling than that provide 
Accept b et al. (1962) is a prerequisite if any 
Me ei le conclusions are to be derive 
a aia or otherwise of @ factor ° 
Ar bee used to define this factor have usu- 
a, = of an objective, performance type, 
Eyson ey CMiniscence in pursuit-rotor learning 
visa © 1961), mental speed (Eysenck, 1953), 
Senge Pity and accommodation (Eysenck, 
Placing ” & Brengelmann, 1955), pegboard 
fom 8 (Eysenck, 1952b), object sorting, per- 


n . 
ce speed and overinclusiveness tests 


d about 


(Payne & Hewlett, 1960), and so forth; they are 
characterized by the fact that they give scores 
on which there is continuity between normal, 
neurotic and psychotic subjects. Lorr et al. used 
ratings on the majority of which a positive entry 
would suffice to classify the subject as grossly 
abnormal; such ratings are inadequate to dis 
close the existence of a factor of psychoticisr: 
should it exist, which is based on the hypothesis 
of continuity (Eysenck, 1952a). In both respects 
(tests used and population studied) the experi- 
ment under consideration fails to be relevant to 
the conclusion reached; it was not designed to 
investigate Eysenck’s hypothesis, and it did not 
enable any conclusions to be reached regarding it. 

Neither does the discovery of 10 “syndromes” 
appear to be a justifiable conclusion from the 
data reported. Most of these factors seem to be, 
not syndromes validated, but symptoms shown 
to be rated reliably. Tests and items used for a 
factor analysis should be logically independent; 
there is no such independence in the items here 
used. Consider the following items which not 
unnaturally hang together in one factor: “Speaks 
to voices”; “Reports voices accuse or blame 
him”; “Reports voices tell him to do things”; 
“Reports voices threaten him.” While not en- 
tirely identical, these items are so similar that 
they cannot be regarded as anything but 
tautologous; correlations between them have the 
status of reliability coefficients. It is not advisable 
to include such reliability coefficients jn what 
purports to be a factor analysis devoted to the 
discovery of “syndromes.” It is possible that 
similar syndromes would still be discovered even 
after the removal of repetitive items of this 
kind, which abound in the scale, but it is im- 
possible to arrive at any secure judgment on the 
basis of the report. We must infer that the 
conclusions drawn by Lorr et al. are not justified 


by their own data. 
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CANONICAL VARIATES AND SECOND-ORDER FACTORS: 
A REPLY 


MAURICE LORR, DOUGLAS M. MCNAIR, C. JAMES KLETT, 


r 1 
and JULIAN J. LASKY 


Veterans Administration Hospital, Washington, D, Ç. 


Eysenck argues that our statement, “There is 
no evidence in support of Eysenck’s contention 
that there is a single second-order factor of 
psychoticism,” js unjustified. His arguments are 
for the most part based on a confusion or 
blurring of the difference between (a) second- 
order factors in the Tucker-Thurstonian sense 
(Thurstone, 1947, 


obtained from the correlations among the first- 
order factors, and i 


needed to describe differences 
on gq quantitative variables 
Eysenck has utilized 
virtually all of his demonstrations concerning 
psychoticism as an independent dimension. 
Eysenck (1962, p. 50) at times recognizes the 
differences between the two approaches when he 
approvingly quotes Slater as saying there is little 
justification for expecting factor analysis vectors 
to coincide with those defined by discriminatory 
analysis of the same data. With this we agree. 
But in his writings Eysenck repeatedly refers to 
both types of dimensions as though they were 
identical (vide Eysenck, 1953, p. 53; 1960, p. 
15). For example, of Trouten and Maxwell’s two 
first-order factors he Says, “These are glearly 
identifiable as neuroticism and psychoticism, 


1950). 
variates in 


ffiliated with Veterans 
1 thors are also a c 
eoa Hospital, Perry Point, Maryland. 


tors 
implying the presence of second-order fac 
at the first-order level (Eysenck, 1960, Tec 

The present authors, in their analysis, factors 
strated the presence of three second-order dromes 
among the correlations of the primary syn qorder 
and simply inferred that a single secon inter- 
factor was insufficient to account for ae was 
correlations. On the basis of this inference erning 
concluded that Eysenck’s assertions cone were 
psychoticism as a second-order dimension por 
not tenable. Eysenck’s other arguments in i jr- 
of his between-groups dimensions are simp ingle 
relevant to the issue of the existence of a a be 
common second-order factor. It would the his 
well for Eysenck to clarify his thinking a 
language in discussing psychoticism in the fu has 

In passing it can be said that Eysenck a 
never demonstrated that a single psychot! e 
factor is sufficient to account for differences 
tween normals, neurotics, and psychotics. He or 
merely shown that for the particular ton a 
five arbitrary tests administered two canon a 
variates are sufficient to account for difer 
between the groups. Introduction of say six aa 
Cognitive tests could easily require two ee a 
canonical variates, As Cattell has pointed A 
meaningful typing demands a sampling Pa 
specified domain, which Eysenck does aa 
The canonical variate dimensions identifie ups 
Eysenck are possibly useful in separating Ete 
but tell us very little about the nature of 
groups separated. 


NOTES AND COMMENTS 


panes second line of argument is broader, 
Rea eet intended to sway the reader un- 
to oh ni test theory or factor method, and 
ee the enemy. Any novice knows that, 
rae of chance error, a certain degree of 
iring and is essential in measurement. In meas- 
a test ability to multiply, items included in 
peie the to a certain extent tautologous. One 
ens y reliable item would be suflicient to dif- 
Scale ar persons at any point along an ability 
Syndr ikewise in psychopathology an excitement 
od rome is manifested in hurried speech, rapid 
a Movements, elevated mood, attention seek- 
voice controlling behavior, a loud, _ boisterous 
Ese and by an attitude of superiority. Are 
Brou elements tautologous? Inspection of the 
Ree ee reported in the article (Lorr, McNair, 
» & Lasky, 1962) and of the original scale 
a Klett, McNair, & Lasky, 1962) should 
or ies any skeptic. The internal consistencies 
i75 e scales defining the syndromes range from 
a AeA Eysenck’s assertion would require 
(42 gher values. The majority of the scales 
dichote of 77) were nine-point continua and not 
is a omous as Eysenck erroneously states. It 
Sari Snt that a dichotomous item does not neces- 
Which Tequire an assumption of discontinuity 
Compl 5 not made by the authors. Eysenck also 
ynde etely ignores the important fact that the 
sent Oomes were postulated in advance and repre- 
the findings from more than a dozen reports 
(lost) E (1952), Guertin (1952), Wittenborn 
orr » Lorr, Jenkins, and O'Connor (1955), and 

: O'Connor, and Stafford (1957). 
Carele mary, Eysenck has asserted (perhaps 
(called 2, that (a) a single canonical variate 
Psych, _Psychoticism) is sufficient to separate 
is a Otics from normals and neurotics} (b) there 
Shee ee second-order factor common to syn- 
Wo : descriptive of psychotic symptoms. These 
assertions are not identical and have been 
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confused by Eysenck. Our assertion concerning 
psychoticism refers only to the latter. The asser- 
tion that the syndromes isolated are “invalid,” 
mere tautologous congeries, has no substance. A 
similar charge could be made, with as little 
justification, against virtually all factor studies. 
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BRIEF REPORTS 


INTRAINDIVIDUAL VERBAL-NUMERICAL DISCREPANCIES 
AND PERSONALITY } 


ARNOLD E. DAHLKE AND 


University of Minnesota 


Intraindividual differences in verbal and numer- 
ical abilities have been related to personality 
characteristics and to psychopathology, Dana, 
Dahlke, and Mueller (1959) used extreme Verbal 
(V) and Quantitative (Q) SCAT scores; 
“control” subjects had smaller difference scores, 
Groups were compared on 22 MMPI scales; 14 
of 176 ¢ tests were significant and no estimation 
of expected number under the null hypothesis 
was given. Blind analysis of group profiles indi- 
cated greater subjectivity, repression, projection, 
and distorted thinking for extreme verbal subjects 
regardless of sex. These results were consistent 
with past research, However, percentile difference 
Scores were used with no control for inequality 
of percentile units, Contrasting particular per- 
sonality variables rather than entire Profiles 
makes determination of chance expectation dif- 
ficult. Percentile Scores may be used with careful 
selection of groups. A split-plot analysis of vari- 
ance design for repeated measurements employs 
the entire profile, 

Using this design on 425 freshmen, SCAT V 
and Q scores were converted to Percentiles, and 
percentile distributions were divided into three 
parts, V and Q low (0-29), medium (30-70), 
and high (71-100), Combinations resulted in six 


selected randomly from each of these 12 groups. 
MMPI records were scored for the validity and 
clinical scales using K corrections and T scores, 


1 An extended report of this study may be obtained 
without charge from Richard H. Dana, Department 
of Psychology, West Virginia University, Morgan- 
town, West Virginia, or for a fee from the American 
Documentation Institute. Order Document No. 7402 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; Washington 

5, D. C. Remit in advance $1.25 for microfilm 
A for photocopies, and make checks payable 
to: Chief, Photoduplication Service, Library of 


Congress. 
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West Virginia University 


separate profiles, Profile heights were tested = 
the ratio of the Between Groups mean seba 
to the Subjects Within Groups mean aes 
Profile shapes were tested by the ratio oh en 
Groups X Scales mean square to the Subj Be- 
X Scales Within Groups mean square. The into 
tween Group mean square was broken oen a 
a set of eight orthogonal contrasts. Each con ean 
was tested by the Subjects Within Groups E 
square. None of the contrasts approached n 
nificance except between the extreme verbe P 
numerical groups for the clinical scales W her 
the extreme verbal profile is slightly aes 
(b < .10). The Groups X Scales sum of ae 
was broken down into Scales x each of the ae 
set of contrasts and each component was te ea 
by the Scales X Subjects Within Groups pal 
square. The validity profiles of the kignen ee 
and numerical groups are significantly di cs o 
in shape (p < .025); the difference in siap a 
the clinical profiles for these groups eee 
significance (p < .10). The difference in 5 ups 
between the extreme verbal and numercial Sidity 
also approaches significance (p < .10). VAT 
and clinical profiles of the high numerical qan 
in the upper percentile range and the high are 
merical group in the lower percentile sanea sis 
significantly different (p < .005). Blind ane - 
of profiles and identification of V or Q class 
cation occurred, ults 

Previous studies are supported. The res 0 
cannot be explained by the absolute standing, 
the verbal percentile since only one of three Je 
V classifications attained significance. The on 
size permits generalization to the population fae 
which it was drawn although broader general 
tion is questionable, 
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DISCRIMINATIONS WITHIN THE DELINQUENCY CONTINUUM 
ON GOUGH’S SOCIALIZATION SCALE* 


GAYLORD L. 


Fort Worden Diagnostic and Treatment 


hae (1960) claims that discriminations 
Using hi e continuum of delinquency can be made 
study te. Socialization (So) scale. The present 
inquent stea this claim in a sample of 58 de- 
dicted th oys aged 14-9 to 17-10. It was pre- 
vists th at So scores would be lower on recidi- 
commit ee nonrecidivists; that high So scorers 
and that Ta serious offenses than low scorers; 
Placed in igh So scorers are more likely to be 
interna] ən open institutional program where 
ized, controls and responsibility are empha- 


I ae 
re ot, two questions of interest were 
ret First, would there be any difference 
delinquents. between “social” and “solitary” 
ic aii s, following the research of Randolph, 
e So on, and Johnson (1961). Second, does 
tween acd possess discriminant validity be- 
and betwe ligence test scores (WAIS or WISC) 
(SD), een a socially desirable response set 
Tesults showed no difference in So scores 
€n recidivists and nonrecidivists (¢ = .92), 
secure. difference between those boys requiring a 
ings Setting versus those placed in open set- 
(t= 1.25). There was a complete absence 
1 An 
taine; a 


ex: 


extended report of this study may be ob- 


without charge from G. L. Thorne, 1141 East 


Os 
th © Lane, Phoenix 14, Arizona, or for a tins 
Order Docu- 


merican Documentation Institute. 
Bale 7401 from ADI Auxiliary Publications 
Bross. ' Photoduplication Service, Library of Con- 
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Make for microfilm or $1.25 for photocopies, and 
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of relationship (r = .001) between So scores and 
a scale measuring severity of delinquent acts. 
Those getting high So scores were as likely to 
commit such offenses as assault, fire setting, and 
rape as those getting low scores. 

There was a difference found between social 
and solitary delinquents, the latter getting lower 
So scores (f= 2.01, p < .05). This would sup- 
port claims that there are reliable differences 
between these two types of delinquents, and it 
appears the dimension warrants further study 
and ultimately integration into a theoretical 
framework. 

The So scores correlate only .08 with IQ scores 
(WAIS or WISC), but an r of .51 (p < .001) 
was found with SD scores. This suggests that the 
concept of intelligence cannot be substituted for 
the present one of socialization, whereas the 
concept of a socially desirable response set can 
be substituted moderately well for that of 


socialization. 

The present findings would not support claims 
that the So scale can discriminate between insti- 
tutionalized delinquents. It appears that the 
clinician working in a delinquency institution 
might find it difficult to use the So scores even 
for screening purposes. 
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TYPE OF BRAIN DAMAGE AND INTELLECTUAL FUNCTIONING 
IN CHILDREN? 


DANIEL V. CAPUTO, WILLIAM E. EDMON 


STON, Jr, LUCIANO L’ABATE, 


AnD SAMUEL R. RONDBERG 


Washington University School of 


Recent studies of children with minimal brain 
dysfunction and with diffuse brain damage have 
indicated the need for a more consistent assess- 
ment of their intellectual functioning. By elimi- 
nating the influence of intellectual leye] from 
subtest patterning, through covariance analysis, 
one may obviate the masking of subtest differ- 


turbance in children, children with seizures dis- 


than children with 
nonseizure brain damage, and (b) that there are 
differences in Particular areas of intellectual 
functioning between two specified types of brain 
damage and controls, 

Brain damaged subjects for this study con- 
sisted of two independent samples of children 
from an outpatient clinic (Sample I) and from 
a county health department (Sample II) who 
had been administered the WISC or WBI. Each 


1An extended report of this study may be ob- 
tained without charge from Daniel V, Caputo, 
Department of Psychiatry and Neurology, Washing- 
ton University School of Medicine, 4940 Audubon 
Avenue, St. Louis, Missouri, or for a fee from the 
American Documentation Institute. Order Document 
No. 7400 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; 
Washington 25, D. C. Remit in advance $1.25 for 
microfilm or $1.25 for Photocopies, and make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 


Medicine, St. Louis 


t 
functioning, the results tend to support ot Pm 
hypothesis. Seizure groups were e i 
between control groups and nonseizure ned 
damage groups, although some of the differe for 
between means are not significant. Evidence 
our second hypothesis is equivocal. The pei 
Concerning particular subtests, after cover ae 
analysis, indicate that the seizure groups 1n from 
samples were Virtually indistinguishable was 
their controls. Only one significant difference ring 
found in Sample II, the seizure groups pat 
significantly higher than controls on the Obj o 
Assembly subtest, Children with seizures es, 
not appear to suffer marked decrement in 
Specific area of intellectual functioning. howed 

The nonseizure brain damaged groups § i in 
a significantly lower Picture Completion ae 
comparison with controls in both Sample 0 
Sample II after covariance analysis (5 yor 
Two other subtest scores, Arithmetic an lower 
cabulary, were found to be significantly i 
in the nonseizure brain damaged rol o 
Sample I. The differences between our tors: 
samples may relate to at least two FAE 
(a) the socioeconomic level of the subj 
and (b) the referring agency. the 

In summary, our data lend themselves te call 
following interpretations: (a) on Wechsler “dren 
Scale, Verbal and Performance IQs chil on- 
grouped as “seizure brain damaged” and E 
seizure brain damaged” do not differ tor aA 
another, although both groups tend to differ hil- 
matched groups of emotionally disturbed dae 
dren, (b) children with “nonseizure brain ptest 
age” yield lower Picture Completion su hil- 
scores than matched emotionally disturbed l 
dren, and (c) Block Design and Digit Sym 
subtests, while they may be quite sensit vi to 
organic pathology, seem to be highly relate 
Psychopathology as well. 


(Received June 20, 1962) 
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I 2 
it a Peigi; study (Wallach & Strupp, 1960) 
erapy erved that as patients’ motivation for 
parent in creased, concomitant shifts were ap- 
investigato, nber of dependent measures. Other 
ieee ave also pointed to the dimension 
; motivation as being one of the most 


E rel 
abl A 
Other ee to correlate with a variety of 
fur €asures. This study aimed to explore 


t 
ion Copel and use of the concept of 
Š for therapy. It is important to note 
ny concern was to elucidate the 
fedical $’ conceptualizations of the concept. 
al students, 71 interviewing 151 patients, 
tients St ts residents interviewing 182 pa- 
Of the oat questionnaires upon termination 
Calleq Asus aaa interviews. The questionnaire 
Plans, a clinical estimates, proposed treatment 
toward = the interviewer’s personal reaction 
Š e patient, Lastly, there was the estimate 
tateq apc motivation for psychotherapy 
tyne ite a seven-point scale and a completion 
Cues ee in which the interviewer specified the 
Patienp,  Pioved in deriving his estimate of the 
S motivation for psychotherapy. 


intervie 


1 Thi 
Hans IS research was supported by a grant (M-2171, 
National sienn; Principal Investigator) from the 
Service, 'nstitute of Mental Health, Public Health 


n 
Wit oun ned report of this study may be obtained 
of Dey anten from Martin S. Wallach (Department 
Chapel Hat North Carolina Memorial Hospital, 
A erica: ill, North Carolina) or for a fee from the 
No, 7309 Documentation Institute. Order Document 
Photody imu ADI Auxiliary Publications Project, 
Was in P gation Service, Library of Congress; 
microfilm a 25, D. C. Remit in advance $2.50 for 
Payable t or $1.75 for photocopies, and make checks 
gi Tek Chief, Photoduplication Service, Library 
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JUDGMENTS OF MOTIVATION FOR PSYCHOTHERAPY: 
SOME FURTHER EXPLORATIONS 1 


MARTIN S. WALLACH 


School of Medicine, University of North Carolina 


A factor analysis yielded two major factors, 
and it was demonstrated that estimates of motiva- 
tion for therapy are a summary construct in the 
complex of impressions about a patient. However, 
this central role appears limited to situations in 
which the decision has been made to initiate 
therapy, and the task at hand is of evaluating 
the patient as a therapy participant rather than 
as a potential candidate. 

Medical students tended to rate patient motiva- 
tion as greater than did residents. This finding 
is believed to reflect the medical students’ rela- 
tive face value acceptance of patients’ overt pleas 
for help. Psychiatric residents, on the other 
hand, seem to have learned that the defenses 
of emotionally disturbed persons tend to oppose 
the correction of painful symptoms or patterns 
of behavior, and thus they are more skeptical of 
overt pleas for assistance. Medical students were 
concerned with assessing need for help whereas 
residents were more set to assess the patient’s 
ability to participate in therapy. Members of 
both groups equally often made reference to the 
same kinds of cues and dimensions in estimating 
motivation. The 5 (of 31 coded) categories of 
cues most often cited were the patient’s stated 
desire for help, the referral source, the patient’s 
conception of his psychiatric illness, reference 
to keeping the diagnostic appointment, and the 


patient’s willingness to expend effort to get 


therapy. 
REFERENCE 
Warracs, M. S., & Strupp, H. H. Psychotherapists’ 
clinical judgments and attitudes towards patients. 
J. consult, Psychol., 1960, 24, 316-323. 
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A FURTHER FOLLOW-UP COMPARISON OF 
PSYCHOTHERAPEUTIC PROGRAMS? 


GEORGE w. FAIRWEATHER 2 AnD RALPH SIMON 3 
Veterans Administration Hospital, Perry Point, Maryland 


In an earlier publication (Fairweather, Simon, three dia 
Gebhard, Weingarten, Holland, Sanders, Stone, the hospital (y? = 11.45 p< 01) employment 
& Reahl, 1960) four psychotherapy programs (2 = 9.40 pe 01) ho fends ( 2 = 11.63, 
were compared for nonpsychotic, short-term and pe 01) “and ‘a i — 3 cess (= 7.98; 
long-term psychotic patients. The four contrasted pe 05), resent aa A th continue during 
treatments were: a control consisting of indi- the nee ie monie f aami living. AS ” 
vidual work assignments; individual psycho- the 6-month talion ae adaptive com- 
therapy and individual work assignments; group munity adjustment mer the short-term psy- 
psychotherapy and individual work assignments; chotic group with iting a E sychotic patients 
and a group living situation consisting of group showing the sabe ates pa and the non- 
activities. Comparisons were made on a number psychotics fallin bet = eae a Correlations 
of diverse inventories, projective devices, be- between 6- and 5 i i = 5 items show 
havior rating scales and 6-month follow-up com- that eight of th monty fa kow-up o significan 
munity adjustment. This study continues the at the 01 level Thee. i ag e 53. Such 
follow-up comparisons at 18 months. The ques- high correlations i Ma drom, “fi st 6 months 
tionnaire consists of nine items administered to of communit tin chan tae al and consti- 
86 of the original 96 subjects. The items concern: tute a sudi s menl Ei o establish 
(a) employment, (b) rehospitalization, (c) alco- significant RP ength of TF O iet study 
holic indulgence, (d) antisocial behavior, (e) Fairweather p trends. In tig oa meas? 
friendships, (f) interpersonal communication, correlated Senile Ain 1300s, Iy ont com- 
(g) general adjustment, (%4) problem behavior, munity adaa SEY miih ange 6 months. 
and (i) degree of illness. a Again, at fhe i ge a aa na one measure 

By the eighteenth month, no significant dif- correlates significant] mon S, O is follow-UP 
ferences obtain among the four treatment pro- criteria. The ] t ntly with any o t correlates 
grams in any area of community adjustment. The 50 (p < 05) a gee therapy ery 2 spital a” 
significantly greater percentage of patients in the -49 (p< 05) m remaining out of e isap“ 
three psychotherapy conditions employed during pearance , f sie being employed. ifferenc® 
the first 6 months when compared with the y iar nth nificant employment di ye 
work condition disappear by 18 months. On the fects en of oh oe that psychoihatry for 
other hand, significant differences between the chronic r T ant ee a who 

*An extended report of this study may be ob- ‘eturn to the hospital willie 18 months at H 
tained without charge from George W. Fairweather, Tates of 72.5 and 55.6 percent, respectively. a 
Chief, Social-Clinical Psychology Research and Serv- seems that for these groups a social syst€ 
ice Program, Veterans Administration Hospital, Palo aimed at maintaining them ia the community 


Alto, California, or for a fee from the American needs to 
Documentation Institute. Order Document No. 7457 he developed, 


from ADI Auxiliary Publications Project, Photo- REFERENCE 


P io of 
gnostic groups on remaining out 


duplication Service, Library of Congress; Wash- Eu 
ington 25, D. C. Remit in advance $1.25 for micro. FAIRWEATHER, G. W., Svon, R, GEBHARD, M D 
film or $1.25 for photocopies, and make checks ee REN E, Hottanp, J. L, SANDE ver 
payable to: Chief, Photoduplication Service, Library es of atl pam J. E. Relative! S Cauti- 
of Congress. ae sychotherapeutic programs: hree 
2Now at Veterans Administration Hospital, Palo diferent onaparison of four programs = ma 
Alto, California. atient groups. Psychol. Monogr 
3 Now at National Institutes of Health, Bethesda, 74(5, Whole No. 492). 
Maryland. 
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PATTERNS OF PSYCHOPATHOLOGY: 
Case Studies of Behavioral Dysfunction 


B 5 
Y Melvin Zax, University of Rochester and Geor 


Ak files of numerous specialists and 
high) one the authors have assembled a 
Gr peo and vitally interesting collec- 
demonst case histories, over 100 in all, that 

rate a wide variety of psychopatho- 
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B 
eat E. Gordon, 
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tindine 0? recent theoretical and research 

clinical 2 porh experimental psychology and 
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Sonality prde a detailed analysis of per- 
Y tunctioning and formation. 


P 
ERSONAL ADJUSTMENT: | 


of ypPProach Through the Study 
ealthy Personality 
“cond Edition 


By ç; 

i Sidney M. Jourard, 

leg ny of Florida 

tal k JS an effective, positive eoncept of men- 
b calth, The author begins his discussion 
n Presenting some illuminating commentary 
H jenifcant modern views of personality. 
Constitui elaborates on his own theory of what 

utes a healthy personality. 


D 
ARNAMICS OF 
OLESCENT ADJUSTMENT 


P Thomas F. Staton, 
ù ntingdon College 

k i practical manual on adolescent be- 
think? Dr. Staton explains how teen-agers 
to » and feel, why they act as they do, how 
€cognize when they need help, and how to 


he] = 
m, 


The Macmillan 


6 ‘ 
0 Fifth Avenue 


ge Strickler, Goucher College 


logical phenomena. Organized into chapters 
enting various diagnostic syndromes, 
the text discusses each syndrome, follows with 
several pertinent cases, and concludes each 


chapter with integrative material. 
Paper, $3.50 
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MENTAL HYGIENE 
FOR TEACHERS: 


A Book of Readings 


Edited by Lester D. Crow, 
Brooklyn College, 

and Alice Crow, 

formerly of Brooklyn College 


These readings provide a practical and chal- 
lenging presentation of the principles of men- 
tal hygiene. The prevention of mental illness 
and the preservation of mental health are 


stressed. 


STUDIES IN 

ADOLESCENCE 

By Robert E. Grinder, 

University of Wisconsin 

Forty-five selections from the writings of 
authorities on psychology, sociology, and an- 
thropology introduce a variety of viewpoints 
on the analysis of adolescent behavior. 


THE PSYCHOLOGY 
OF ADOLESCENCE 
Second Edition 

By Arthur T. Jersild, 
Columbia University 


Dr. Jersild, continues, in this edition, his 
critically acclaimed analysis of the adolescent 
through the youngster’s personal experience 
of his own existence. 


Company 
New York 11, N.Y. 
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of 


RESEARCH in PSYCHOTHERAPY 


Proceedings of the Second Conference on Research Contributors 
in Psychotherapy held at Chapel Hill, N.C., in May Barbara J. Betz 

1961. Leading research workers in psychology and eae 
psychiatry deal with selected research issues and John M. Butler 

assess new research developments. Their efforts Kenneth Mark Colby 


Jerome D. Frank 
Ernest A. Haggard 
Leonard Krasner 


place research in psychotherapy more squarely 
derstanding interpersonal processes. Henry L. Lennard | 
à 


within psychological research concerned with un- 


Daniel J. Levinson 
oe Maurice Lorr 
The psychotherapist’s tase Labasie 
contribution to the treat- Morris B. Parloff 
Laura N. Rice 
ment process Irwin C. Rosen 
3 s Nevitt Sanford 
Measuring personality change George Saslow 
a David Shakow 
in psychotherapy John M. Shlien 
ate Richard S. Siegal 
Definition, measurement, and William U. Snyder 
A x ST lan Stevenson 
analysis of significant Hans H, Strapp 
iables Alice K. Wagstaff 
var se 
The book is a follow up of Research in Psycho- plus Indexe? 
therapy, published by the American Psychological $3.5 
Association in 1959, which reported on the First Order from: N 
Conference on Research in Psychotherapy held jn AMERICAN PSYCHOLOGICAL ASSOCIATIO 
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Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability, 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation. 
Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


author’s lines, must not exceed 85 lines a 
eraging 42 characters and spaces in ie 
Set the typewriter margins for short lines in 
42 characters, which are 3.5 inches dons 
elite typing, and 4.2 inches long in pica. ust 

The manuscript of the Brief Report ie 
be double spaced throughout. Except for o 
short lines, it follows the standard style, n 
the 1957 revision of the APA P ubice 
Manual. Headings, tables, and references i 
avoided or, if essential, must be counte ac- 
the 85 lines. Each Brief Report must be W 
companied by a footnote in the style pey f 
which is typed on a separate sheet and 
Counted in the 85-line quota: 


2 An extended Teport of this study may b the 
tained without charge from John Doe (giving on 
author’s full name and address) or for a fee ocu- 
the American Documentation Institute. Order Don 
ment No. —— from ADI Auxiliary Publica o0- 
Project, Photoduplication Service, Library of 


e ob- 


$— for photocopies, ang 
checks Payable to: Chief, Photoduplication 
ibrary of Congress, 


Extended report, Because the extended fe 
port is intended for Photoduplication, 4° Je 
not copy to be sent to a printer, its siia 
should differ in several ways from that y 
other manuscripts: (a) The extended ag 
should be typed with single spacing fig- 
economy in duplication, (b) Tables and zt 
ures should be placed adjacent to the s 
which refers to them. A caption shoul 
typed below each figure. (c) Footnotes sh 
be typed at the bottom of the page on W Ha 
reference is made to them. In other respet“ 
the full report is Prepared in the style SP®° 
fied by the Publication Manual. 
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RELIABILITY OF RETROSPECTIVE REPORTS 
OF ADOLESCENCE 


IRENE ROSENTHAL 


Institute of Human Development, University of California, Berkeley * 


with parents, and social and 


Accuracy of memories regarding relation 
were studied for 100 Ss of a longi- 


emotional characteristics of adolescence, 
tudinal study, Interviews, tests, and ratings 


had been obtained for the Ss as 


adolescents and as adults of age 38-40. Product-moment correlations were 
adolescent, and adult variables. Partial cor- 


computed between retrospective, 


relations were computed between retrospe 
adult measures constant, when memori 
data, and also with adult measures. Me 
represented adolescent self-pictures an 


with a considerable degree of accuracy, 
t. In some cases, 


past, an 
her than the correspon! 


adolescence to a lesser exten! 
highly related to the present than the 
with adolescent items ot 


ports of memories of childhood expeti- 
Ment ave served as a basis for the develop- 
and a" f theories of personality formation, 
Cation techniques of psychotherapy- Verifi- 
OWe of the accuracy of these memories, 
Wiss R has been limited to the studies by 
Hag » Stolz, and Macfarlane (1935) and 


of eet, Brekstad, and Skard (1960), both 
whi 4 nestic 


Teporte investigate 
evelo concerning 
Stug Pent of their 
ing ih of the accuracy of mem 

€ types of material obtaine 


d mother’s anam 
the birth and early 
children. Empirical 
ories regard- 
d in psycho- 
elationship 


er, Des i 
With peutic interviews, such as T mt 
Parents, and social and emotiona 

d adolescence; 


{acteristics of childhood an 
me. 


kai agioe at the present ae 
though ies have dealt pr 
Wi gh theories ly childhood and 


relations between ear i at, some 
equent nality developmen 
the personai have involved the 


optical formulations ; 
escent period. An opportuni ; 
athe accuracy of adults’ memories ° 
ergy scence was presented in the 
Suq © connection with the Oaklan ia 
macy. The subjects of this long acs 
nv, who had been studied exte 
ee adolescence, participated } 2 
ay Study which gathered material 
esent lives, and their memories ° 
í i dies 
wN i for Mathematical Stu 
p titute £0 emat 
y RE Stanford University- 


es C 
n’s retrospective account of adolescence 


d observers’ ratings during adolescence 


ctive and adolescent variables, holding 
orrelated significantly with adolescent 


while the women’s memories reflected 
the women’s memories were more 
d in others they were associated 
ding measures. 


lescence.? Thus, retrospective accounts of 
adolescence, obtained after a lapse of 20 
years, could be compared with data which 
were obtained during the adolescent years. 


METHOD 


Subjects 

The Oakland Growth Study began in January 
1932, with subjects who were then in the fifth 
grade of elementary school. These subjects were 
studied intensively during the next 7 years until 
their graduation from high school. In the years 
1958-60, when their average age was 38-40, many 
of the subjects of the original study participated 
in testing sessions, and in a series of interviews 
averaging 8-12 hours in length for each subject. 
Detailed descriptions of the study have been 
published by Jones (1938, 1939, 1959). 

The present study involved 48 men and 52 women 
who were members of the Oakland Growth Study 
during their last 2 years of high school, and who 
participated in the follow-up study. Since all sub- 
jects did not participate in both the interviews and 
testing sessions, the number of subjects studied 
in connection with any one variable was less than 
the total. 


Adolescent Variables 


The adolescent variables of the present study 
consisted of observer’s ratings of the subjects, made 
during their last 2 years of high school, and the 


2 Data for the present study, which is part of 
a long-term longitudinal project of the Institute 
of Human Development, University of California, 
were gathered by Harold E. Jones under a grant 
from the Ford Foundation. 
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TABLE 1 


CONSISTENCY OF QUESTIONNAIRES OVER 1 YEAR 
AT THE Enp or Hicu SCHOOL 


Product-moment 


correlation 

Á 

Questionnaire Men Women 
5. Has fun 43 „55 
6. Is a leader , 58 74 
7. Likes to be alone, imagines 75 98 
8. Parents like the subject’s friends 3 35 


9. Parents treat the subject as child 39 86 


10. Not punished by parents -70 .79 
11. Doesn’t mind parents 58 .82 
12. Wishes-Parents -60 43 
13. Wishes-Neutral 44 33 


14. Fears 63 
15. Tensions 67 71 


Note.—all correlations are significant at the two-tailed .05 
evel, 


subjects’ responses to 
during the same period. 

Ratings. The adolescent ratings, based on observa- 
tions in a free social situation, were made inde- 
pendently on a 5-point scale by three observers. 
For the present study, the average of the two sets 
of ratings, made during the last 2 years of high 
school, was used. These ratings are described fully 
in an article by Newman (1946) where the reliabili- 
ties, ranging from 56-.78 in terms of product- 
moment correlations, are given. The ratings with 
assigned numbers, were: Leader (1), Uninhibited 
(2), Relaxed (3), and Cheerful (4). 

Questionnaires. The study utilized questionnaires 
administered during the last 2 years of high school. 
Since reliabilities were not available for the ques- 
tionnaires, product-moment correlations between 
the measures, given a year apart in adolescence, 
are given in Table 1 to provide an indication of 
the reliability. 

1. Seven items, drawn from the Tryon (1933) 
Adjustment Inventory, gave descriptions of þe- 
havior, and asked the subject to rate himself on 
a 5-point scale ranging from “yes” to “no,” as 
to how much he resembled the boy (or girl) 
described in the statement. (For example, the first 
item, Variable 5, consisted of the Statement “A. is 
a boy who seems to have a lot of fun.”) An average 
was computed for the two adolescent responses to 
each of these items for the last 2 years of high 

hool. Abbreviations of the questionnaire items, 
anil the numbers assigned to the variables were: 
ee t has fun (5), Subject is a leader (6), Sub- 
pul to be alone, imagines (7), Parents like 
acer x = friends (8), Parents treat subject as a 
subject's Subject is almost never punished (10), 
child (9), aes 3 t to mind parents (11), 
and Subject doesn’t wan 


questionnaires administered 


IRENE ROSENTHAL 


” 
2. Two adolescent variables, “Wishes-Parent 
(12) and “Wishes-Neutral" (13) were derived frox 
a test where the subjects were asked to check ea 
they “wished for.” The score, computed for k 
of the last 2 years of high school, was the 
number of items of each type checked. 3 (12) 
The items, comprising the Wishes-Parents 


: re fewer 
test, dealt with the subject's wish to hav tention 
conflicts with his parents, to receive more nts. 


from his parents, and to have more cheerful PAE ave 
The Wishes-Neutral (13) test consisted + i 
items not dealing with parents. Included 
test were wishes concerning recreation, P öl 
or school. For example, one of the items swim- 
Wishes-Neutral test was “I wish I could e score 
ming every Saturday.” The Wishes-Neum sp e 
was used in the study to determine whet nden! 
Wishes-Parents score represented a general “than 8 
for the subject to change his life rather ith 
specific desire for changes in the relation 
his parents. i ey 
3. Subjects were asked to check thing ie 
feared from a list of 61 items which concerne Jaces, 
of things such as injuries to self, high Bosted 
specific persons, being lost, or being Tot 
(Tryon, 1939, lists these fears). The score was 
test, which was the sum of the items checked ool. 
computed for each of the last 2 years of high signed 
The designation for this variable with its 45 
number was: Fears (14), of the 
4. The “Tensions” test, which is a part xieties 
Tryon (1933) Adjustment Inventory, listed lect 
moods, and tensions, and asked the iiy 
reply yes or no in terms of the applica? d by 
the item to himself. The score was ae tests 
finding the sum of the yes responses. In t happy? 
the subject was asked whether he was “estion 
lonely, nervous, easily upset, and other qU eac! 
of a similar nature, Scores were obtained ere 
of the last 2 years of high school. The was? 
used for the test with the assigned number 


F b 
5. A score for social adjustment was obtained ) 
responses (made on a 5-point 
to those items of a longer questionnaire 
dealt with social relations, and which interco" 
significantly with each other in adolescence. 
items dealt with the subject’s feelings of 
ness, difficulties in making friends, and hes! 
in talking to strangers, 
—— 


" the 
ê To obtain data on the concurrent validity of an 
retrospective Wish-Parents test, product-mv" 
correlations were computed between the SCO with 
this test and a rating of “counter identification tated 
Parent” which denoted that the subject had 5 
that he had chosen a way of life opposite ities 60 
Parent of the same sex, The correlations wee ting: 
for the men, and .52 for the women, This "at pe 
devised by Suzanne Reichard and based 0n ntly 
follow-up interviews, is part of a study curre 
being done by John S. Hatfield and Albert 
Shapiro. 


RETROSPECTIVE REPORTS OF ADOLESCENCE 


Adult Variables 


hey adult follow-up study, the subjects re- 
tered ie the questionnaires that had been adminis- 
e kaite th adolescence. They were also rated on 
e adult t! at had been measured in adolescence. 
ures of FEA ee fell into two classes: (a) meas- 
the adult 3 subjects as they were at the times of 
Measures ae study, and (b), restrospective 
atin at dealt with memories of adolescence. 
ratings e of the subjects as adults. The adult 
average made on a 7-point scale) were an 
Y the 3 two sets of ratings: (a) ratings made 
With the nterviewer after completing the interviews 
individuat ee and (b) a rating made by another 
View, R ane basis of the typescript of the inter- 
correlation iabilities, measured by product-moment 
e rati ns, averaged .58 and ranged from 48-70. 
07), y ngs, with assigned numbers, were: Leader 
2o)? ninhibited (18), Relaxed (19), and Cheerful 
R P 
ties pp Pective ratings. Ratings of subjects’ memo- 
Were ba adolescence (made on a 7-point scale) 
contained = on typescripts of the interviews which 
’scribeq a lengthy section where the subjects 
Mary of their adolescence. After reading the sum- 
Subject sane interviews, the rater judged how the 
traits, oo have rated himself on the various 
Mterviey he rater was not the individual who 
th the the subject.) Since the typescripts of 
ings erviews were very long, only one set of 
Cheek Pi: obtained for most of the sample. A 
owever. {fhe reliability of the ratings was made, 
second OF nine men and nine women by having 
e subj individual read the interviews and rate 
een oa The product-moment correlations be- 
The ret e two sets of ratings ranged from 48-87. 
thibtted trees ratings were: Leader (21), Un- 
ult (22), Cheerful (23), and Relaxed (24). 
Present pue ora answered in terms of the 
o ti t hese adult questionnaires were identical 
Were q ests administered during adolescence, and 
Questi, escribed above in the section on adolescent 
th The following questionnaires de- 
e subjects as adults: Fears (25) see 
a variable 14; Tension (26) see adoles- 
adol vtiable 15; Social Adjustment (27) see 
Adult y variable 16. 
tive a retrospective questionnaires. The retrospec- 
during estionnaires were identical to the tests given 
adult ae reseenices but were administered to the 
haq at fra with instructions to respond as they 
tata, are CTA oF senior high school. The following 
had an re answered by the subject as though he 
Swered them during adolescence: 


Dicteg 
adolesce: 


AT 
o r 
obtain data on concurrent validity of the 


Ensio, 

ns Os A 
btained eee product-moment correlations were 
and between the score on the Tensions test 


Si 
for mais of the MMPI which had been obtained 
The Aa in the follow-up testing sessions. 
and Te test correlated .70 or higher with the 
aylor Anxiety scales. 
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Subject has fun (28) see adolescent variable 5. 

Subject is a leader (29) see adolescent variable 6. 

Subject likes to be alone, imagines (30) see ado- 
lescent variable 7. 

Parents like the subject’s friends (31) see ado- 
lescent variable 8. 

Parents treat the subject as a child (32) see ado- 
lescent variable 9. 

Subject is not punished (33) see 
variable 10. 

Subject doesn’t mind parents (34) see adolescent 
variable 11. 

Wishes-Parents (35) see adolescent variable 12. 

Wishes-Neutral (36) see adolescent variable 13. 

Fears (37) see adolescent variable 14. 


adolescent 


RESULTS 
Methods of Data Analysis 


The basic analysis of the study involved 
comparison of memory of adolescence with 
actual adolescent data. To determine whether 
memories were influenced by the present 
rather than the past, comparisons were also 
made of retrospective accounts with the 
present, and of the present with adoles- 
cence.’ To study the relation between the 
retrospective, present, and adolescent vari- 
ables, product-moment correlations were 
computed among all the variables. The vari- 
ables which were included in the correlation 
matrix, with assigned numbers, are presented 
in Table 2. Partial correlations were com- 
puted between retrospective and adolescent 
variables, holding adult measures constant, 
when memories correlated significantly with 
adolescent data, and also with adult meas- 
ures. A level of significance of .05 (two- 
tailed) was chosen as a criterion for testing 
whether correlations differed significantly 
from zero, and the word “significant” refers 
to this criterion throughout the present 
study. The magnitude of correlation needed 
for significance varied according to the data 
being studied. The correlations between 
ratings were based on about 46 cases for 


5A comprehensive analysis of “Personality Con- 
tinuity in the Oakland Growth Study” is in progress 
under the direction of John A, Clausen. A study 
959) of constancy of personality 
jlizing part of the present sample, is 
arable to the present study both because 
data were drawn from the time that 
the subjects were jn junior high school, and because 
the adult ratings were based on shorter and less 


comprehensive interviews. 


not comp: 
the adolescent 
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TABLE 2 


VARIABLES IN THE CORRELATION Matrix 


Adolescent 


Adult 


Ratings 


1. Leader 

2. Uninhibited 
3. Relaxed 

4. Cheerful 


Questionnaires 


5. Subject has fun 

6. Subject is a leader 

7. Subject likes to be alone, imagines 
8. Parents like subject’s friends 

9. Parents treat subject as child 
10. Subject is not punished by parents 
11. Subject doesn’t mind parents 
12. Wishes-Parents 

13. Wishes-Neutral 

14, Fears 

15. Tensions 

16. Social adjustment 


Ratings 


17. Leader 

18. Uninhibited 
19. Relaxed 
20. Cheerful 


Retrospective Tatings 


21. Leader 

22. Uninhibited 
23. Relaxed 
24. Cheerful 


: 5 3 nt 
Questionnaires answered in terms of the prese! 
25. Fears 
26. Tensions 
27. Social adjustment 


Retrospective questionnaires 


28. Subject has fun 

29. Subject is a leader 

30. Subject likes to be alone, imagines h 
31. Parents like subject’s friends 

Parents treat subject as child 

33. Subject is not punished by parents 
34. Subject doesn’t mind parents 

35. Wishes-Parents 

36. Wishes-Neutral 

37. Fears 


each sex while correlations involving ques- 
tionnaires had sample sizes ranging from 
34—43 for the men, and 31-39 for the 
women. 


All adolescent variables covered the last 


2 years of high school except the test, 


“Social adjustment,” for which only 1 
year’s results were available. For the ado- 
lescent questionnaire items, and ratings 
(which were scores on a single variable), 
scores were composited for the last 2 years 
of high school before the correlations were 
calculated. For the tests whose scores were 
sums of a number of items (i.e., the Tensions, 
Fears, and Wishes tests), the scores for each 
of the last 2 years of high school were 
included in the matrix of data used to obtain 
inc tions, and the two correlations for 
correla sate were then averaged to obtain 
ee for the correlation used in the 


study. 


t 
Relations between Memories of Adni 
and Adolescent and Adult Measur 
of Personality 


g Ado 
Memory of Relation to Parents during 
lescence 


As shown in Table 3, responses O” 
retrospective questionnaires dealing wi 
parental relation correlated significantly S of 
responses on several adolescent meas uia , 
parental relation in the case of the ch of | 
but to only a few for the women. PA 

e men’s retrospective questionnaires, © ei 
the one dealing with punishment are 
related Significantly with the identical l 
tionnaire of adolescence. For the women 
two items dealing with memory of disc Pie 
were unrelated to any adolescent mea Ie 
and the remaining questionnaires ea c wi 
related with one questionnaire dealing the 
adolescent parental attitudes, but not tO 
questionnaire which was identical. 


RETROSPECTIVE REPORTS OF ADOLESCENCE 


a sae} whether reports of memories 
A uenced by the present rather than 
. correlations had been computed 
able ro iene and adult measures 
Flach ). The men’s reports of memories 

cantly “A to parents were related signi- 
possibilit some adult variables suggesting the 
sonalities that aspects of their adult per- 
ties of Pie be distorting their memo- 

Possibilit eir parents. To „investigate this 

Duted h partial correlations were com- 

e een the retrospective parental 

ables ane the adolescent parental vari- 

af the adult measures held constant. 
= A ple correlations deviated by less 

Or the rom the original correlations except 

subject odmelation between ‘Parents treat 

a as child and “Not punished” which 

-19 with adult Tensions held con- 


of 
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stant. Thus, the correlations obtained for 
the men between parental retrospective vari- 
ables and adolescent parental variables 
could not be explained on the basis of adult 
personality measures. 

Inspection of Table 3 reveals that the 
men’s retrospective questionnaires were 
associated with adolescent variables other 
than those dealing with parental relations. 
This suggests the hypothesis that the men’s 
memories of adolescent parental relation 
might reflect a general picture of their ado- 
lescent life rather than the specific parental 
relation. Another explanation of the signifi- 
cant correlations obtained between retrospec- 
tive parental variables and adolescent non- 
parental-variables might be, however, that 
both the memories and the adolescent vari- 
ables were related to some adult variables. 


TABLE 3 


2 
RELATION BETWEEN MEASURES OF MEMORIES 


oF ADOLESCENT PARENTAL RELATIONSHIP 
AND ADULT PERSONALITY 


AND MEASURES OF ADOLESCENT 
Product- Product- 
Retr p moment A moment 
OSpective questionnaire Adolescent variables r Adult variables r 
Men 
3 ; x : 
1. P like friends—Q 8. P like friends—Q 54 26. Tensions—Q —.39 
5. Has fun—Q .39 25. Fears—Q —.36 
32 9. P treat as child—Q —.35 
+ P treat as child—Q 12. Wishes-Parents—Q 56 26. Tensions—Q 48 
15. Tensions—Q 46 
| 9. P treat as child—Q 36 
33 10. Not punished—Q —.33 
- Not punished— .32 20. Cheerful—R 39 
ished —Q 5. Has fun—Q 17. Leader—R 37 
34, , 
Doesn’t mind P—Q 11. Doesn’t mind P—Q oo 
l 35 6. Leader—Q : 
| » Wishes-Parents—Q 9. P treat as child—Q 
12. Wishes-Parents—Q : 
Women 
3 P 
a P like friends—Q 12. Wishes-Parents—Q Al 
s P treat as child—Q 8. P like friends—Q —.39 
33. Not punished—Q 
34. Doesn't mind P—Q 20. Cheerful—R —A7 
= 45 18. Uninhiùited--R 45 


35. Wishes-Parents—Q 8. P like friends—Q 


l Note. — R denotes rating; Q d i 
Se ents; enotes rating; Q denotes i 
denotes parents; it, p <.05, two-tailed. questionnaire, 


All correlations are signi 
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To determine whether the correlations be- 
tween the retrospective parental question- 
naires and the adolescent nonparental vari- 
ables could be explained on the basis of 
adult personality measures, partial correla- 
tions were computed between the retrospec- 
tive and the adolescent variables with the 
adult variables held constant. These partial 
correlations were not significant. (For Parents 
Like Subject’s Friends versus Subject has 
Fun, and Parents Treat Subject as Child 
versus Tensions, the correlations became .26 
and .28, respectively, with adult Tensions 
held constant. The correlation between Not 
Punished and Subject had Fun became .18 
with Cheerful held constant.) The fact that 
the correlations of the retrospective parental 
variables with adolescent nonparental vari- 
ables could be attributed to the effect 
of adult variables while those with parental 
adolescent variables could not, indicates that 


memories of adolescent parental relation 
probably reflected the actual adolescent 
parental relation. Further evidence that 


memories of adolescent parental variables 
were related specifically to parental ado- 
lescent relation came from the results ob- 
tained with the Wishes-Neutral test. The 
score on the retrospective form of this test 
correlated with the corresponding score on 
the adolescent test (r =.44) but not with 
the adolescent Wishes-Parents test, just as 
the restrospective form of the latter test 
correlated only with the corresponding ado- 
lescent measure. 

For the women, there was less evidence 
that their responses were determined by ado- 
lescent relation with their parents. Although 
the retrospective items tended to be related 
to adolescent parental items rather than to 
nonparental variables, they did not cor- 
relate significantly with the identical items, 
and the number of correlates was low. In 
addition, one of the retrospective question- 
naires, Wishes-Parents, was related to the 
adult variable “Uninhibited” as well as to 
the adolescent variable. To determine 
whether the adult variable Uninhibited 
could account for the correlation between the 
retrospective Wishes-Parents and the ado- 
lescent Wishes-Parents tests, the partial cor- 
relation was computed with adult Uninhibiteq 
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. as 
held constant. This partial correlation W® | 
30, a nonsignificant value.® 


Adolescent Memories—Dealing with 
Aspects of Adolescence other than 
Parental Relationship 


t 
Tables 4 and 5 present the adolesan 
and adult variables which correlated A 
cantly with retrospective measures of sence 
and emotional characteristics of ade ae 
(The retrospective questionnaire, jll be 
which is not included in the tables, viol 
discussed Separately in a subsequent Se highly 
For both sexes, memories were fairly ence 
related to data obtained during adoles an, 
The retrospective questionnaires, He cont 
Leader, and Alone, Imagines, had sign! cen 
Correlations with the identical adoles iy 
variables (Table 5). Two of the r etrosP i ie 
ratings, Leader and Uninhibited, ae [ 
significantly with the corresponding 4): 
lescent variable for both sexes (Tahe a 
The Tetrospective rating for the 
Cheerful, correlated significantly with 
lescent Cheerful for the men, but ne 
the women. The retrospective rating, ond- 
laxed, failed to correlate with the corresP nd 
ing adolescent trait for both men 


r 
t for 


rela 
women. The fact that some of the co% g 


tions between the retrospective ratings c 
the adolescent ratings were insign! ture 
might be explained by the different met” 
of these two sets of ratings, Thus, the 7 the 
spective ratings were self-pictures ° wer’ 
Subjects, while the adolescent ratings 
observers? impressions, wee 
Significant correlations obtained berate 
retrospective variables and adolescent racy, 
ables could not be attributed to acct!” of 
of memory without eliminating the eed 
adult personality on the reports Of nt 
lescent memories. The fact that signi wee? 
Correlations had been obtained Þet” | 
memories and adult measures of person 
Suggested the possibility that memorles 
being distorted to be compatible wit 

| 


lity? 
re 


the 


The women’s Wish-Neutral test behaved 
manner similar to the Wish-Parent test in t 
to correlate with the identical adolescent teh) Bo 
only significant correlation of this test mpoes” 
adolescent variable was with the item 
mind parents” (r= .51). 
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TABLE 4 


RELATION BETWEEN RATINGS 0 


WITH PARENTS, AND MEASURES OF Abo! 


F MEMORIES OF ADOLESCENCE OTHER THAN THOSE DEALING 


LESCENT AND ApuLT PERSONALITY 


Product- Product- 
R moment moment 
trospective rating Adolescent variables r Adult variables r 
Men 
21. Leader—R 1. Leader—R 38 18. Uninhibited—R 31 
2. Uninhibited—R 38 
6. Leader—Q 36 
4. Cheerful—R 33 
22. Uninhibited—R 2. Uninhibited—R .53 18. Uninhibited—R .49 
16. Social adjustment—Q 45 27. Social adjustment—Q AT 
1. Leader—R 43 
5. Fun—Q 39 26. Tensions—Q — 44 
4. Cheerful—R 39 17. Leader—R 30 
23. Relaxed —R 5. Fun—Q 42 26. Tensions—Q —.50 
24. Cheerful—R 16. Social adjustment—Q 52 20. Cheerful—R 38 
5. Fun—Q 43 18, Uninhibited—R 32 
4. Cheerful—R 40 
2. Uninhibited—R 36 
3. Relaxed—R 34 
1. Leader—R 33 
Women 
21. Leader— inhibited—R 46 17. Leader—R 67 
ii A TR 44 18. Uninhibited—R 59 
6. Leader—Q 41 27. Social adjustment—Q 57 
ú 26. Tensions—Q 43 
22. Uninhibited—R 1, Leader—R 59 a — H 
2. Uninhibited—R 55 + Leader 55 
s Fun—O 49 20. Cheerful—R 37 
6, Leader—Q Ee 27. Social adjustment—Q 34 
4. Cheerful—R 37 
8. P like friends j 
23. p 35 20. Cheerful—R 30 
i oe : es ‘37 20. Cheerful—R 34 
+ Cheerful—R 1, Lea 18. Uninhibited—R 47 
27. Social adjustment—Q 43 
17. Leader—R 33 


fap t 
Note,—P denotes parents; R denotes rating; Q deno! 


tigate this pos- 
were compute! 


adolescent vari- 
constant. 


ae life situation. To inves 
ility, partial correlations 
thee retrospective an 
Dles holding adult measures Ce dif. 
Ince the results (Table 6) followed 4 a 
erent pattern for the men than 
Women, they will be discussed separate y- 
For the men, all partial correlati 
were significant except e 


i tin 
Correlation between the retrospective rating 


es questionnaire. 


All correlations are significant, p < .05, two-tailed. 


Relaxed and the adolescent questionnaire Has 
Fun with adult Tensions held constant. Thus, 
the relation found for the men between retro- 
spective and adolescent variables could be 
attributed to accuracy of memory. 

The partial correlations obtained for the 
women present an inconsistent picture of the 
effect of adult personality on memories of 
adolescence. The retrospective rating and the 
retrospective questionnaire for the trait, 
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TABLE 5 


RELATION BETWEEN QUESTIONNAIRE MEASURES OF 


MEMORIES oF ADOLESCENCE, OTHER THAN 


THOSE DEALING WITH PARENTS, AND MEASURES OF ADOLESCENT AND ApULT PERSONALITY 


Product- Produc 
moment momen 
Retrospective questionnaire Adolescent variables r Adult variables r 
Men 
28. Fun—Q 16. Social adjustment—Q 42 
1. Leader—R 38 
4. Cheerful—R 37 
5. Fun—Q 37 
2. Uninhibited—R 35 
9. P treated as child—Q —32 
29. Leader—Q 6. Leader—Q AL 
30. Alone, Imagines—Q 7. Alone, Imagines—Q .55 27. Social adjustment—Q “i 
9. P treat as child—Q 46 18. Uninhibited—R 39 
4. Cheerful—R —.42 26. Tensions—Q $ 
2. Uninhibited—R —.41 
1. Leader—R —.41 
1S; Tensions—Q 34 
Women 
41 
28. Fun—Q 1. Leader—R 47 27. Social adjustment—Q : 
5. Fun—Q a7 
2. Uninhibited—R -36 5 
3 
29. Leader—() 6. Leader—Q 64 17. Leader—R 35 
1. Leader—R 44 18. Uninhibited—R 35 
2. Uninhibited—R 42 27. Social adjustment—Q ; 
30. Alone, {magines—Q 10. Not punished—Q -50 
9. P treat as child—Q 37 
7. Alone, Imagines—Q 36 


Note.—P denotes parents; R denotes rating; Q denotes questionnaire, 


All correlations are significant, b <.05, two-tailed. 


Leader, and the retrospective rating for Un- 
inhibited had insignificant partial correlations 
with adolescent ratings but had significant 
partial correlations with adolescent question- 
naires. For the retrospective questionnaire, 
Has Fun, and the retrospective rating, 
Cheerful, all correlations became insignificant 
when adult measures were held constant. 
For the retrospective variable Relaxed, how- 
ever, the partial correlation With the ado- 
Jescent rating Leader, holding the adult 
rating Cheerful constant, was a ee 
Since no consistent picture emerged Tom 
these partial correlations, the question 
whether adult personality affects memories 
of social and emotional characteristics for 


women remains unanswered in the prese”t 
study. be 
The last retrospective questionnaire tO ne 
discussed, the Fears test, differed from t g 
questionnaires presented in the precedin 
sections both in content and in relations Wi 
adolescent and adult measures. The retro 
spective form of this test was not correla 
significantly with the adolescent test, but bis fs 
highly related to the adult test for both a 
(r= .64) and women (r= 775)... The nes 
Correlation between the adult and retrosper 
tive forms of the Fears might be due ee 
tially to the fact that the two forms wé 
given successively in the testing session. 


$ 
\ 
i 
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DiscussIon 


oe memories of their relationship 
aan eir parents, and of their social and 
= ie characteristics during adolescence 
adoles ound to be significantly related to 
an cent self-pictures, and to observers 
sas obtained during adolescence. For the 
d x however, the results of the study 
a. give a consistent picture of the 

n between memory and actual data of 
adolescence, 


The lack of association found for the 
women for the retrospective measures of 
parental relations, as contrasted with the 
considerable association found between 
memory and adolescence for nonparental 
variables, indicated that inaccuracy in 
memory was related specifically to the area 
of parental relation. Since the number of 
parental measures was low, however, this can 
only be a tentative conclusion. In addition, 
the fact that the relation between the 


TABLE 6 


PARTIAL CORREL. 


ATIONS BETWEEN RETROSPECTIVE AND ADOLESCENT 


VARIABLES WITH ADULT VARIABLES HELD CONSTANT 


-e 
Variables being correlated* y 
Partial 
Retrospective Adolescent Adult variable held constant* correlation 
Men 
30. Alone R z A : 53* 
» Imagines—Q 7. Alone, Imagines—Q 27. Social adjustment—Q j 
a lone, Imagines—Q Y Alone, Imagines—Q 18. Uninhibited—R Sit 
30, Alone, Imagines—Q 7. Alone, Imagines—Q 26. Tensions—Q 52 
30, Alone, Imagines—Q 4. Cheerful—R 27. Social adjustment—Q 36* 
ab. Alone, Imagines—Q 4. Cheerful—R 18. Uninhibited—R 34* 
a Alone, Imagines—Q 4, Cheerful—R 26. Tensions—Q 35* 
1 Leader—R 1, Leader—R 18. Uninhibited—R .32* 
21, Leader-R 6. Leader—Q 18. Uninhibited—R 32" 
A Uninhibited—R 2. Uninhibited—R 18. Uninhibited—R 40" 
2. Uninhibited—R 2. Uninhibited—R 27. Social adjustment—Q 39% 
ae Uninhibited—R 2. Uninhibited—R 26. Tensions—Q 33% 
SA Uninhibited—R 16. Social adjustment—Q 18. Uninhibited—R A6* 
oy Uninhibited—R 16. Social adjustment 27. Social adjustment—Q 50" 
2 Uninhibited—R 16. Social adjustment 26. Tensions—Q 42 
a Relaxed—R 5. Has fun—Q 26. Tensions—Q 26 
oe Cheerful—R 16. Social adjustment—Q a Chemi mR a 
~ Cheerful—R 4, Cheerful—R . Cheerful 
Women 
28, E 27. Social adjustment—Q 32 
28, E ao 1. Leader —R 27. Social adjustment—Q .28 
29. Lead pog 5. Has funQ 17. Leader—R .59* 
29, pena ó. Leader—Q 27. Social adjustment—Q .57* 
29, TE 6. Leader—Q 27. Social adjustment—Q 32 
21. T.. t—Q 1. Leader—R 17, Leader—R 26 
21 eet 1. Leader—R 17. Leader—R Kips 
21 iene G bee 8 27. Social a e P5 
21, Tesda = ae 26. Tensions— 3 
rR 6. Leader—Q Benga: 
oD Saint hirer: 2. Uninhibited—R 18. Uninhiite’ ae 
22. Uninhibited -R 5. Has fun—Q 18. Uninhibiise = oo 
22 wipe 7. Social adjustment Q 43 
+ Uninhibited—R 5. Has fun—Q 27. aoe 
Relaxed—R 1. Leader—R 20. Cheerful—R kyi 
24, Cheerful—_R 1. Leader—R 17. Leader—R 28 


X denotes rating; Q denotes gera i 


a 
* 
< .05, two-tailed. 
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women’s retrospective and adolescent ratings 
could be attributed to a considerable extent 
to adult personality leaves fewer data than 
were available for the men on which to base 
the conclusion that nonparental memories 
tended to be accurate. (Although the rela- 
tions between retrospective and adolescent 
ratings could be attributed to adult person- 
ality, the interpretation that memories were 
accurate was not excluded by the results.) 


Comparison with Previous Studies 


Adults’ memories of adolescence in the 
present study- corresponded more closely to 
the past than mothers’ memories concerning 
the birth and early development of their 
children, as studied by Pyles et al. (1935) 
and Haggard et al. (1960). The results of 
the previous studies suggested that retro- 
spective reports would have been less ac- 
curate in the present study than in the 
other studies since, in the previous studies, 
memories for facts were more accurate than 
for measures such as attitudes, wishes, and 
anxieties, which are more comparable to the 
measures of the present study. There are, 
however, important differences between the 
study reported here and the other studies. 
First, the memories in this study concerned 
the subject himself, rather than his children. 
Secondly, both previous studies dealt only 
with women, and the results of the present 
study indicate that women’s memories are 
related less to the past than the memories 
of the men. 

Unlike the previous studies of anamnestic 
reports, the present demonstration of some 
validity of memories lends some support to 


IRENE ROSENTHAL 


theories of adolescent personality develop: 
ment which have been based to a large oe. 
on retrospective interviews and que 
naires. The demonstration, however; of E 
accuracy in memories in some areas, sici 
that of the adolescent parent relation in ing 
case of the women, suggests caution 1n sei 
Tetrospective data both in the formula 
of theories of personality, and in i 
which attempt to relate adult persona 

characteristics to adolescent development. 
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48 female Ss ranked photos before and af 
andardized. S controlled the verbal exchange. 


found when P disagreed than when he agreed. 
t the degree of this change. All Ss allotted more 


(P) whose comments were st 
Greater opinion changes were 
Social orientation did not affec 


time to P than to themselves. Self-oriente: 
talked less, and interaction-oriented Ss ta 
Ik-time differentially. Disagreement did not change 
Ss’ speech patterns over trials. Agreement ii 


action-oriented Ss over trials while self-oriente 
tudes among groups toward the experi- 


agreement did not affect ta 


questions yielded differences in atti 
ment and P. 


an ic research is of special interest to 
tributio; psychologists for its potential con- 
clinica] ns to a better understanding of the 
endea process. The present experiment 
with vors to link verbal conditioning research 
on studies on the effects of reinforcement 
siid, articipation in small groups in order to 
andy the role of a subject as both speaker 
In stener in a controlled dyadic interaction. 
ent most studies on the effects of reinforce- 
of aoe behavior, the subject’s range 
Su e verbal behaviors is. limited and 
ua Me ge by the experimenter. The 
mits a yadic situation differs in that it per 
Teinfor speaker to affect the behavior of the 
Speaker © partner in at least two ways: The 
Peinfore may obtain increased numbers © 
tion 0 peng when he increases the propor- 
the co a critical response class, changing 
ol ice of his speech but not the overall 
rearty of his verbal output (Kanfer & Mc- 
Yy, 1962). A second strategy, often ob- 


1 Th; 
Ga ai study was suppor 


nt M-3700 (C1) from 


ted in part by Research 
the National Institute of 


ental Health, U. S. Public Health Service, F. H. 
or. The data were col- 


anfen, principal investigator. “ne č y 
ed at Louitiana State University in Spring 1961. 
e authors wish to thank O. Hubert Campbell and 
ete S. Leventhal 

ysis of the data. 


ter discussion with a male partner 


d Ss talked most, tas! <-oriented Ss 
Iked least. P’s agreement or dis- 


served in two-person interactions, consists of 
solicitation of reinforcing statements from the 
listener. These verbal behaviors, classified as 
mands by Skinner (1957), may be manifested 
by the speaker’s seeking of reassurance, Or 
confirmation of his opinions. Such “reinforce- 
ment-soliciting” behaviors may be subject to 
modification by the same variables as speech 
content (opinions), Or they may relate to 
relatively invariant personality characteristics. 

From a viewpoint of group dynamic theory, 
the progression toward solution of a group 
problem also takes on importance. When the 
subject is exposed to comments in which the 
partner states agreement about the subject’s 
opinions, the subject’s attempts to influence 
are reinforced. Especially if both participants 
are of equal peer ratings and a specific task 
is to be solved, a rational model of group 
interaction suggests that the subject is en- 
couraged to communicate more frequently 
and for longer durations if his partner ap- 

roves his efforts (Bass, 1960, Ch. 7). While 
reinforcement theories stress the social and 


emotional effects, group dynamic theory em- 
phasizes the value of contributions toward 


a solution. 
The purpose O 
gate the effects © 


f this study was to investi- 
f social reinforcement, and 
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the subject’s social orientation on (a) dyadic 
speech patterns, and (4) verbal content 
(opinions) when the partner’s opinions were 
predetermined, and the subject was given full 
control over frequency and durations of the 
partner’s contributions. The relationships 
under investigation were studied by examina- 
tion of the changes in the subject’s opinion 
after a series of interactions and the amount 
of time which the subject allotted to himself 
and his partner during the verbal exchange, 
for the subjects whose partners agreed or 
disagreed with their statements, and for the 
subjects who differed on Orientation In- 
ventory scores (ORI) (Bass, 1962). 


METHOD 
Subjects 


The subjects in this study were 48 volunteer 
female undergraduates. 


Apparatus 


The experimental room is shown in Figure 1, 
On the table before the subject were two keys. 
The left key was marked Talk and the right key 
was marked Listen. Plywood partitions separated 
the subject from her partner, who was the experi- 
menter’s confederate. On the partner’s side a slide 
projector was on the table. When a picture was 
shown, the projection surface was a 25X25 inch 
area on a light green wall, approximately 6 feet 
from the subject. Below the projection area two 
lights were mounted. A red light was wired to 
the key marked Talk and a green light was con- 
nected to the subject’s Listen key. The subject had 
a free view of the picture but she could not see 
the partner or the experimenter. Behind the experi- 
menter’s partition a large clock, accurate to 5 
second, timed each trial. Two smaller clocks were 
connected in series with the subject’s Talk and 
Listen keys and their respective lights. Adjacent 
to the smaller clocks were two signal lights, in 
series with the keys. The small clocks cumulated 
the time on each key for each trial. In this way 
the experimenter recorded duration of each key 
depression, frequency of key depressions, and dura- 
tion of silence (when neither key was depressed), 


The Experimental Pictures 


i xperimental pictures of faces were chosen 
R used by Stritch (1954). A trial con- 
sisted of a 4-minute period for discussion of the face, 
At the beginning of each trial the subject ranked 
10 adjectives typed with regard to the face. At the 
end of each trial the subject ranked the same 
adjectives again. The rank difference correlation 
obtained for each subject served as an indication 
of the subject’s shift as a result of discussion with 
the partner. The adjectives had been used jn 
previous research by Campbell (1961) and were 
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taken from Osgood’s list of traits. They va 
intelligent, friendly, relaxed, mature, trusting, adapt- 
able, thrifty, fair, sociable, and calm. The sequence 9 
Picture presentations over five trials was randomize 
for each subject. 


Orientation oj the Subjects 


Bass (1960, Ch. 8) proposed three ideal typ 
of group members: self-oriented, interaction-orien u H 
and task-oriented. He argued that these types ve 
be reinforced by different effects, The task-orien ri 
subject would behave in a manner directed pE 
getting the job done; attempts to influence aj ed. 
toward group effectiveness would be reinfore i: 
The interaction-oriented subject would direct jous 
attention to maintaining conflict-free harmon ea 
interactions; behavior producing pleasant ieee 
action would be reinforced. The self-oriented su a 
would be reinforced by effects satisfying his pers 
needs for domination and ego enhancement rega! o 
less of whether his behavior was contributing 
the group's efiectiveness, The ORI attempts o: 
isolate these types, The subjects choose the Wi 
and least Preferred of 27 triads of alternatives vats 
test-retest reliabilities of .75 for the three ii e- 
scores. Systematic differences in volunteering 10 
havior, in discussion group behavior, and at W 
have been found, supporting the validity © 
ORI (Bass, 1961) 2 

In the present study the subjects used top 
those 48 of 65 subjects tested who scored in the ian 
quartile of one of the scales but below the me ac 
on the other two scales, They were type d) 
cording to their highest score (eg., self-oriented/ 


Procedure h 

Three groups of subjects were preselected: nee 
Scorers on the Self scale, high scorers on the 4 0 
action scale, and high scorers on the Task sche 
ORI. Half of the subjects were given agreer cls 
(positive reinforcement), the remaining sub) 
were given disagreement (negative reinforcemó e 
by the partner, On each of five trials the su ne 
discussed the picture with the partner petm a 
her initial and final rankings. The measure 0 n 
dyadic speech pattern consisted of the duration falk 
frequency of the depressions of the subject’s ‘by 
and Listen keys, Opinion change was measured l 
the change between the subject’s first and the 
ranking of the faces. In order to standardize nts 
partner’s behavior three male graduate a ther 
Practiced 18 pilot runs which were discarded. Ne ge 
the partner nor the experimenter had any knowle 
A ls 


3 ry = asi 
? The test in Bass (1961) includes such iteng 
1. One of the greatest satisfactions in life 1s: 


A. Recognition of your efforts (5) 
- The feeling of a job well done (T) 
C. The fun of being with friends (1) 


2. I think I do my best when: 
A. I work with a group of people who 
are congenial (1) 
B. I have a job that is in my line (T) 
C. My efforts are rewarded (S) 
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OP 


Projector 


Clocks 
O Listen Light 
@ Tolk Light 


Fic. 1. Sch 


o 
a ie Subject’s ORI scores. Assignment to the two 
rar imenta] treatments was balanced among the 
ners, 
eat Subject was met in the hall. 
sub; menter indicated that he had 
Ject first. He said: 


On the way the 
to get another 


We are using two subjects in this experiment 
a We would prefer that you don’t see the 
her one right now. ` 
i om 
When the subject entered the experimental ro 


the A tition. After 
Partner was seated behind beura ‘asked the 


the sup; 
„Subject was seated, the exper TA 
ibject and the partner for their names. and se 
dividually and presented the | mAN aS. 
aki Was told that the experime’. i 
E co nige personality ai the subject 
s. A was projected, ? 
PSM ithe paa are. feao aae eroh 
hought the face revealed about Penisi: on A 
bi the person by ranking r ied: 
nk card, The experimenter con! 


In order to minimize personal ine toe 
Obtain greater objectivity we you (pointing to 
People work together as 4 te of this team. You 
the Subject) will be in Sn aur judgment and 
Can improve the causa wing Way- When I say 


of your team in the 


ematic diagram of experimental room an 


d apparatus. 


go ahead, both of you should discuss your in- 
dividual impressions about each picture, You 
will do this by using these two keys. When you 
want to talk, press the key marked Talk and keep 
it down while you are discussing your impressions 
of the picture. In order to get the help of your 
partner release your key and press the key marked 
Listen. Your partner can only talk as long as you 
press the key. Notice that each key is connected 
to a set of lights. When you press your key, the 
red light will go on and you and your partner 
will know that it is your turn to discuss the 
picture. When you press your partner’s key a 
green light will go on and your partner will have 
the opportunity to discuss the picture as long 
as you keep his key depressed. 


The team was told that they would have 4 
minutes to discuss each picture. They were asked 
to rank the picture on the basis of their own im- 
pressions and discussion with the partner. The in- 
structions were ended with: 

Evaluation of your ability to judge personality 
accurately will depend on two things, your in- 
dividual score and also the score obtained by 
your team. Try to be as accurate as you can, as 
an individual and as a member of a team. 


The experimenter then turned to the partner and 
said: 
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Your job differs from that of your partner 
only in that she is in charge of your team and 
she will press the appropriate keys. You have 
no keys and you must take your signal from 
these lights here. 


The experimenter demonstrated the use of the keys 
by having the subject and the partner interact 
briefly, and projected the first face. After 1 minute 
for the initial rankings the experimenter picked 
up the partner's card (which was actually: a blank) 
and the subject’s card, passed the subject’s ranking 
to the partner surreptitiously and signaled the 
beginning of a trial. At the end of 4 minutes the 
experimenter asked the team to stop and to rank 
the traits again on a green card. The same Procedure 
was repeated for five pictures. After the last trial 
a questionnaire was given to the subject and the 
partner. 


Rules for the Partner’s Behavior 


Reinforcing statements by the Partner were di- 
rected only toward the subject’s judgment and dis- 
cussion of the faces. In the agreement group the 
partner selected from the subject’s rankings the 
traits ranked first, second, third, eighth, ninth, and 
tenth, and expressed (as if they were his own 
opinions) statements which generally confirmed the 
subject’s ranking. In the disagreement group the 
partner expressed opinions which were the reverse 
of the subject’s ratings. The partner avoided answer- 
ing direct questions and never stated agreement or 
disagreement directly but only by giving his “own” 
opinions. The partner continued talking until the 
subject turned his light off. While 


the subject’s pre- to postratings was subjected to 
an analysis of variance for the partners under both 
treatments. No significant F ratios 
were obtained, indicating that all Partners were 
equally effective in the extent to which they af- 
fected the subject's change in judgments, Secondly, 
an analysis of variance was performed on the 
duration of time which the subjects allotted to the 
three partners. The resultant F was not significant, 
indicating that the partners did not differ in the 
extent to which they influenced the subject’s dura- 
tion of pressing the Listen key. 


Postexperimental Questionnaire 


The subjects were asked to check scaled answers to 
the following four questions: How satisfied were 
you with the group decisions reached? Answers 
ranged from 1, completely dissatisfied, to 9, com- 
pletely satisfied. If you were to be re-tested in a 
team to what extent would you prefer the same 
partner as you had today? Answers ranged from 1, 
completely prefer different partner, to 9, completely 
prefer same partner. To what extent did you par- 
ticipate in discussions in comparison to your 
partner? Answers ranged from 1, my partner did 
all of the talking, to 9, I did all of the talking. 
In comparison to your partner to what extent did 
you feel influential? Answers ranged from 1, my 
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r " yas 
partner was completely influential, to 9, I w 
completely influential. 


RESULTS 
Opinion Change 


The effects of the experimental treatments 
and personality variables were examined by 
an analysis of variance on the rho corre 
tions between each subject’s first and secon 
ranking on each trial. A high value of pi 
correlation indicates little shifting oF Ne 
subject’s opinion while a low rho indicates G 
great degree of change. The analysis yielde 
a significant F of 30.81 (1 and 40 df; H 
< .001) only for agreement versus disagio 
ment. Neither orientation nor the pado 
affected the extent to which the pubes 
changed her opinion. The mean rho for a 
subjects under agreement was .30 while t : 
mean rho for subjects under disagreeme' 
was .19. The size of the correlations sugges i 
that agreement did not result in very me” 
Stability in the subjects? judgments. os 
sidering the ambiguity of the photo i 
task, a rho of .30 does indicate that respons 
to the pictures were not random. 


Time Allotted to Partner 


The mean duration of Listening and Talk 
ing in all groups is presented in Table on 
Table 2 presents an analysis of variance 6 
the total time which the subject allotted 
the partner. The subject’s willingness ted 
listen to her partner was significantly rela n 
to orientation rather than to the reinforceme s 
conditions. Interaction-oriented subjects net 
most willing to listen, regardless of whe 
the partner agreed or disagreed (Table 42 
An F of 2.39 for trials (at p = .05, F = aie 
revealed that the subjects tended to decre’ s 
the amount of time given to the partner 


TABLE 1 


ENT 
DISTRIBUTION OF Time (IN SeconDs) Seme 
IN TALKING AND LISTENING FOR SUBJECTS 

VARYING ORIENTATIONS 


Orienta- Listen key Talk key yg 


tion (second) a (second) 
„46 
Interaction 175.98 73.32 63.32 40.98 
Task 105.84 6910 7101 346 
Self 155.50 64.79 83.22 


ining u 
ain! 
* Percentages do not add to 100, since the remat! 
Was spent in silence. 
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TABLE 2 


ANALysIs OF VARIANCE ON TIME 
ALLOTTED TO PARTNER 


Source af its g 
Bety 
po subjects 47 1,918.75 
va (A) 2 8,384.78  493* 
p Bree versus i 
if sissaree (B) 1 4421 = 
oe 2 997.80 — 
42 1,699.33 
Within Subjects 
aK 
ex a 4 902.75  2.39* 
CxR 8 328.39 = 
CXB% 4 261.81 = 
Eno SA 8 69647 185 
168 376.96 
” b <10, 
b S025, 


e A 
Over pa criment progressed. A trend analysis 
6, Jas (Edwards, 1960) yielded an F of 
31 Ê < .10) for Orientation and an F of 
o RU -10) for the statistical interaction 
the in Orcement and Orientation. In testing 
‘t Was ĉar slopes of each group against zero 
Were ‘Ound that task-oriented subjects who 
. i agreement significantly decreased 
o< * allotted to their partner over trials 
- Interaction-oriented subjects when 
(p< ,*8teement also tended to decrease 
Ment while self-oriented subjects under 
Nt tended to increase (p < -10). All 
Under disagreement showed slopes 
a Best 'd not differ from zero. These results 
A ttn that the change in time allotted to 
ae tin Was a function of the subject’s 
“tions lon as well as reinforcement CON- 
Pe 
Que 
k When of Interactions 
ay Was greauency of presses for the Listen 
ti BEL rein eed to an analysis of variance, 
ple ect: Orcement conditions nor Orienta- 
ie 2.20 were found to be significant. An 
Wad fo With 4 and 168 df (p < -10) was 
tatin, . trials, suggesting a decrease of 
ere actions over trials. These re- 
dig Vis, TI examined further by 4 trend 
the not diffe, linear components in each group 
eh quadrati, significantly over trials. W hen 
(è fo tement components were examined, 
53.25) conditions differed at p <10 
and the Reinforcement by Orien- 


a 
Zs 
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tation interaction was significant at p < .025 
(F= 4.01). With agreement, the groups 
tended to decrease rapidly over the first three 
trials and then increased until they reached 
the same frequency on the fifth trial as on 
the first trial. On the other hand, the groups 
receiving disagreement tended to listen with 
approximately the same frequency over all 
trials. In this regard, the interaction-oriented 
subjects also showed the least change over 
trials, while the task-oriented subjects dif- 
fered most, yielding a pattern of initial de- 
crease followed by recovery. Self-oriented 
subjects showed this pattern to a lesser 


degree. 


Subject Talk-Time 


Since the subjects were requested to de- 
press either of the two keys throughout each 
trial, there is an inverse relationship between 

ime allotted to the partner 


the amount of ti 
and the amount of time taken by the sub- 


ject. However, since some time (actually less 
than 1% of total time available) could be 
taken up by silence (failure to depress either 
button), an additional analysis was performed 
on the amount of time during which the 
subject depressed his Talk key (see Table 1). 
An analysis of variance yielded an F of 4.81 
(p < 025) for Orientation. The analysis 
further indicated a significant F of 3.27 (with 
4 and 168 df, p< .025) for trials. All sub- 
jects tended to talk more as the experiment 
progressed. ‘A trend analysis substantiated the 
findings from the analysis of time allotted 
to the partner. Interaction and task-oriented 
subjects under agreement significantly in- 
creased in time spent talking on successive 
trials (p < -05). ; 

The analysis of the subject's own speech 
patterns indicated again that orientation was 
relatively more important in determining the 
subject’s dyadic speech pattern. Self-oriented 
subjects took the most time, interaction- 
oriented subjects spent the least time talking. 
All subjects invited the partner to talk more 
than they talked themselves (see Table 1). 
The mean percentage for the aema 
groups was 30.9%; for the disagreemen 
groups it was 30.0%- ; 

An analysis of variance on the frequency 
with which the subject pressed the Talk key 


yielded no significant differences with regard 
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to the main variables. An F of 2.23 with 4 
and 168 df (p < .10) was obtained for trials. 


Questionnaire Data 
Each of the four postexperimental ques- 
tions was analyzed by an F test for differ- 
ences between reinforcement conditions and 
Orientation scores. The subjects who were 
given agreement yielded a mean score of 
7.67 in response to question one, “How satis- 
fied were you with the group decisions 
reached?” while subjects given disagreement 
yielded a mean score of 4.29 (F = 79.94, p < 
.001). On question two, the subjects given 
agreement had a mean score of 7.17 in how 
much they preferred their partner for a retest, 
while the subjects given disagreement had a 
mean score of 4.79 (F = 26.33, p< .001). 
On question three, about how much par- 
ticipation was shared by the subject and the 
partner, significant differences were found for 
orientation but not for reinforcement con- 
ditions (F = 8.33, p< .001). Self-oriented 
subjects stated that they talked as much as 
their partner (M = 4.69), task-oriented sub- 
jects felt their partner talked more (M = 
4.06), and interaction-oriented subjects felt 
their partner talked much more (M = 3.69). 
As can be seen from Table 1, these ratings 
accurately reflected the talk-listen differences 
between the subject and the partner. How- 
ever, all subjects grossly overestimated their 
own contributions in the interaction. On ques- 
tion four, about how influence was shared, 
both reinforcement conditions and orientation 
yielded significant differences, An F of 8.75 
(Ż < .005) indicated that the subjects under 
agreement felt they had equal influence in 
making the decision (M = 5.33) while the 
subjects with whom the partner disagreed 
thought the partner was more influential (M 
= 4.38). The analysis yielded an F of 3.59 
(p < .05) for orientation. The interaction- 
oriented subjects felt that their partner 
was more influential (M = 4.25), and the 
task-oriented subjects felt that they were 
of about equal influence (M = 5.06). Self- 
oriented subjects thought that they were 
somewhat more influential than their partner 

DISCUSSION 

The present findings yield evidence that 
the content of the partner’s speech affected 
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oa. ime 
both the subject’s opinions and the time 


which she allotted to herself and to m 
partner. The subjects shifted opinions A 
when interacting with a disagreeing par 7 
than an agreeing partner. In this regard, m 
subjects? orientation was not a per 
factor. The correlation coefficients ing 
that the difference in opinion change nif 
not very great, even though it was nate 
cantly different for the two ee, y 
treatments. However, in the present S ong 
the variables which usually result in st 

modification of group opinion have 

minimized. For example, the subject a 
saw her partner. She considered him 


SSE try A 


u 
: A n 

peer at best, since “the subject had, bes Pal 
in charge of the team” by instruction oided 


partner played a subservient role and av' 
expression of strong opinions. The 
indicate that even under these con 
disagreement enhanced the subject’s 
bility. B. 
The subjects’? tolerance for listen g 
their partners decreased and their W tend 
ness to talk increased with time. This wi 
ency may have been due to boredom e 
the partner, or a gradual lowering ce. ON 
estimation of the partner’s importam ‘conv 
the subjects may have become mor 
fortable in the interaction, . 
toward a more equal distributio: i 
However, even during the last trial is il 
jects talked for much less than half irs jc” 
able time. These results have some elot in 
tion for the behavior of the mee fa 
therapeutic interactions. If tolerera vit 
listening to another person decreta K uld 
increased exposure, then a therap S perea” 
have to guard against a tendency tO E 
talking as a counseling session progres by 
The personality traits, measured |” tend 
ORI, strongly influenced the ae ee 
ency to approach, listen to, or t@ erac jor 
partner. Given control over the int ofl ee 
subjects allotted over two-thirds rs- T 
total available time to their parte iat e 
self-oriented subjects behaved as A j 
ment theory would suggest al a fro 
should. As they experienced age allo 
their partners they increased the ee LEA 
to listening, thereby increasing the opii” 
positive reinforcements for their 


dition’ 
flex!” 


results | 
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ian es hand, interaction and task- 
nee Ae jects behaved more like a dy- 
should rane would suggest all subjects 
the part © they experienced agreement from 
Arena SAP their attempts to influence him 
Apparen Page the time allotted him decreased. 
to ae self-oriented subjects attempted 
ene magnify the partner’s approving 
Periods me inviting him to talk for longer 
subjects his involved no effort since the 
Order to mr had to press the Listen key in 
interactio, tain agreement. On the other hand, 
creased a and task-oriented subjects in- 
rewarded Ce for which they had been 
own talk hese subjects focused on their 
focused 6 while the self-oriented subjects 
Ment pr T partner’s agreements. Disagree- 
Subject’s with no significant change in the 
Pinions. illingness to listen to the partner's 
it Tatbiaing both experimental conditions 
Jects a that interaction-oriented sub- 
thers i most willing to listen to their 
ir willi resumably these subjects may see 
aintaini ingness to listen as a means of 
Tienteg ing good relations with him. Self- 
about ee appeared least concerned 
denen, ening, possibly showing their greater 
Oriented ence in this situation. When the self- 
Îscussi subject is in a free small group 
lends his behavior is quite different. He 
influence talk the least and to successfully 
ings? the group very little (Bass, 1961). 
Partner all subjects were female and the 
the g.> Were male the sex composition of 
atie adic team may have been a factor 
‘pane the distribution of time among par- 
(1963) > In a recent study by Elwood 
ang ne Using a similar procedure, both male 
Baers T subjects were used with male 
Os for lwood found higher mean dura- 
ects 4 male subjects than for female sub- 
er some of his conditions, while sex 
Win, JS: an effective variable in other Con- 
th Eby, comparison of the present findings 
of lag Con results further suggests that 
talk gs tiable also affects the distribution 
bie time. Giy ific task, the 
Con tS were si en no speci c , th 
Ver ation simply instructed to engage 10 
tales thes to “get better acquainted. 
The d less e conditions the subjects also 
€ final at first but increased over trials. 
Percentage of time allotted to the 
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partner closely approximated 50%. From the 
results of both studies it is evident that the 
status of the partner, the subjects com- 
petence in solving a problem, and the sex 
relationship in the team all affect dyadic 
speech patterns. On the questionnaire the sub- 
jects felt more satisfied with the group dis- 
cussion when the partner agreed than when 
he disagreed. The agreeing partner was also 
much preferred and the subjects felt that he 
had more influence under agreement than dis- 
agreement. These results suggest the im- 
portance of the personal and emotional by- 
products affecting the quality of the dyadic 
relationship, quite apart from the real value 
of a partner’s contribution. These data lead 
one to suspect that conformity of a partner 
may become a more important criterion for 
his inclusion in a dyad or group than his 
actual contribution to a task. Social orienta- 
tion also affected the subject’s impression 
about his dyadic role. Self-oriented subjects 
tended to overrate their own importance in 
the interaction more, perceived themselves as 
more influential and more actively participat- 
ing (as indeed they were) than subjects who 
were task or interaction-oriented. 
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MacKinnon ( 1944) has credited Janet 
with the first attempt to distinguish between 
symptom and personality trait constellations 
in the neuroses. A revival of interest in this 
problem has taken place in recent years as 
attempts have been made to improve the 
validity and reliability of Psychiatric diag- 
noses and to devise relevant measuring instru- 
ments (Chodoff & Lyons, 1958; Foulds, 
1961b; Foulds & Caine, 1958, 1959; Sandler 
& Hazari, 1960). The distinction is of rele- 
vance to therapists since there is evidence 
to suggest that psychological tests related to 
symptomatology change more readily with 
treatment than do tests related to personal- 
ity trait constellations, with the personality 
trait measure here to be described remaining 
relatively more stable than symptom and 
attitude tests (Foulds, 1959; Foulds & 
Adams, 1962; Martin & Caine, 1963). 

The recent empirical evidence thus sup- 
ports Janet in his contention that the 
hysteroid and obsessoid personalities can be 
usefully dichotomized along a single dimen- 
sion as an examination of the traits making 
up the relevant constellations suggests. Until 
the development of the Hysteroid/Obsessoid 


1 Thanks are due to Keith Hope for his 


help 
with some of the statistics. 


mental hospital who were rated by the staff on the 
An analysis of tł i 


rect 

Questionnaire (HOQ), no test hased drains! 
upon the traits involved and validate ed. 
an outside criterion has been deven of the 

The present study is a et ô thi 
construction, validity and reliabiin ir 
HOQ together with its relation foun t 
relevant factors. If the HOQ 13 obsess” 
relate more closely to hysteroid/ ria/DY*, 
trait constellations than to Hyster stil 
thymia symptom complexes, further be from 
tion for the separation of sympto 
traits will have been provided. 


METHOD 
Construction of the Questionnaire 


int 
pratio 
Tt was found in previous work that es Me 
by hysteroid and obsessoid pera py Foro 
hysteroid-obsessoid rating scale describe te the j c 
and Caine (1958) failed to differen? ach Daas 
groups sufficiently accurately to be or pring at 
tical value. It was felt that rating sca pjeci ye as 
were difficult to grasp for untrained S jcation 
some of the points had mo Tis 
worded, and that the 9-poin M: 
was too short for high reliability. To cian mini 
difficulties the 
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HYSTEROID/OBSESSOID COMPONENT 


ae to Wear eye-catching clothes 
It He quiet at parties or meetings 
like ce me to be the center of a lively group 
iscussing myself with other people 


Forty-j 
Y-eight statements were thus compiled each of 


Which 
c s F k 
leco a be scored in a hysteroid or obsessoid 


Administration of Tests 


Thi á 
enter o estionnaire was administered to all patients 
Claybury a community wards for neurotics at 
Bee ae Deed In addition 
from, the ic patients referred to the psychologist 
as liysteroid a admission wards during the period 
minimum or obsessoid personalities were tested. 
on the Mill of Grade IV above the tenth centile 
three pati Hill Vocabulary Scale was set. Ninety- 
Of these ma in all were tested on the HOQ and 
*tsonality Tat given the Minnesota Multiphasic 
'Stinguish ree ecu (MMPI) scales found to 
oulds etween Hysterics and Dysthymics in 
Were combi and Caine (1958, 1959) studies. These 
the s aon of the Hs, D, and Pt scales with 
for $ score for women and the D + Pt score 
K was also included as were the Extra- 


Punitive” 

v 

by Pa Intropunitive scales reported in detail 
of the S, Caine, and Creasey (1960). Fifty-three 


Sonali a ponents also completed the Maudsley Per- 
treat nventory (MPI) but at different stages 
Werg Reiner together with the HOQ. Sixty-two 
ested on the HOQ 6 weeks after admission. 


aö 
lishment of the Criterion 


atin N 
medica on all patients were obtained from 
Ate, |? Nursing, and occupational therapy staff, 

e rating scale 


Sescribe we using a form of th 
58), In the previous study (Foulds & Caine, 
fto 3 he number of raters per patient varied 
Usua ly to 11 with a mean of 7.5. Ratings were 
Week a completed towards the end of the second 
“onfi bee admission, when raters felt reasonably 
tag an pf making an accurate assessment. 
atin sa es of the internal consistency of the 
fo. Ma] ae items showed that only 6 of the 11 
be the raits were being successfully differentiated 
oki g, groups by the raters, namely, attention 
N alie “Motional display, speed of decision, lability 
he si Conscientiousness, and shallowness of affect. 
erid ota formed the subsequent basis of the 
anal: sessoid rating allocation. 
Dd Oi ake of the mean rating was computed 
th, Dite th this confidence limits were calculated. 
hè calc © safeguards taken in the rating scheme 
© rati ulations showed that the dependability of 
Com, ttings was in question. Although this is a 
faetono a and recognized weakness of ratings, a 
to i ial experiment. was carried out in an effort 
"atone Prove the validation criterion. Five of the 
twice each rated 10 males and 10 female patients 
maki? with a 10-day interval between ratings, 
Vatign® a total of 200 ratings. An analysis of 
Nee was carried out with raters, replica- 
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tions of ratings, and patients as the sources of 
variance. The between-raters, between-patients, and 
rater/patient interaction variances were significant 
but the between replications were not. The between 
raters variance was too small to be of practical 
significance and since the rater/patient interaction 
could not, in practice, be allowed for, it was 
included in the residual variance. The experiment 
was collapsed into a 5 (raters) X 20 (patients) 
factorial design. Estimates of the variance of the 
factors were made on the assumption of Model 11 
of the analysis of variance (Snedecor, 1946). The 
variance of the raters was only 35, while that of 
the patients was 4.37. The error variance was 2.69. 
One may conclude from the factorial experiment 
that raters tended to show something of a bias in 
one direction or another but that this bias was 
too small to be worth correction, and that the 
raters differed frequently from each other in their 
individual classifications. The reliability of the mean 
rating of each patient (averaged over the five 
raters and measured by an intraclass correlation 
coefficient) was .92. 

The distribution of the patients’ ratings was not 
significantly different from a normal distribution 
having the same mean and standard deviation. 
Degree of hysteroidness or obsessoidness as observed 
by trained observers was thus not related to hospital 
admission. 

To establish the hysteroi 
classification the ratings for eac! 
taking the total hysteroid score less the total obses- 
soid score as the allocation. This allocation yielded 
65 obsessoids, 27 hysteroids, and 1 indeterminate. 

A preliminary analysis of the HOQ was con- 
ducted to determine the score association of ques- 
tionnaire items among themselves. Items most 
closely related were placed together with agreements 
below the .05 level of significance being ignored. 
A factor analysis of the questionnaire is contem- 
when a larger number of cases has been 


d or obsessoid rating 
h patient were pooled, 


plated 


collected. 
The score on the HOQ (total hysteroid score 


since each item could be scored hysteroid or 
obsessoid) and other clinical and test data were 


related to the personality rating classification and 


to themselves. 
RESULTS 
Abilities, Sex, and Age Composition of the 


Sample 


Of th 
completed the 
test. The mean 


e 93 cases (42 men, 51 women) 91 
Mill Hill Vocabulary synonym 
synonym score of these was 
27, giving an expected total of 51, which is 
well within the “average” range. Of the 
two remaining referred cases, one had a 
Wechsler IQ of 105, and the other fell at 
the seventy-fifth centile on the Progressive 
Matrices test. The mean age of the 28 rated 
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hysteroids was 33.72 (SD 8.78), that of the 
65 rated obsessoids was 33.95 (SD 9.50) with 
a directly comparable age range of 19-54, 


Predictive Power of the Questionnaire 


The power of the questionnaire in excess 
of chance in predicting the rating scale 
allocation was calculated taking a score of 
24 and above as the questionnaire hysteroid 
classification. To call a person hysteroid or 
obsessoid is similar to calling him long or 
short—the dividing line is arbitrary. A 
priori, since only hysteroid responses are 
scored on the HOQ, those scoring on more 
than half the items may be legitimately as- 
signed to the questionnaire hysteroid clas- 
sification. The rating scale classification was 
thus found to be correctly predicted by the 
questionnaire in 80% of the cases, an im- 
provement over chance of 26%, 


Correlations with the HOQ 


Biserial, product-moment, and Pearson r’s 
were calculated between the HOQ and other 
clinical data and tests and the results are 
shown in Table 1. 

No significant relationship was found 
between any of the clinical scales of the 
MMPI and the rating scale classification. 
There was no significant sex difference in 
HOQ score of those rated hysteroid or 
obsessoid although significantly more women 


were rated as hysteroid. Scores on the HOQ 
were normally distributed. 


Item Agreement within the HOQ 


Items showing the highest score agreement 
within the questionnaire could be identified 
under the following traits: (a) imaginative 
thinking; (4) meticulous concern with de- 
tails and standards; (c) self-effacing; (d) 
self-display, fashionable dress; (e) speed of 
decision; (f) conscientiousness; (g) deep, 
lasting friendships; (/) dependence on 
others; (7) inhibition of emotions; (j) lability 
of effect. All items tended to agree with the 
exception of one to which 83 subjects re- 
sponded “yes” and which was misplaced in 
the item agreement analysis. 


DISCUSSION 


1 shows that with the exception of 
i Š Scale the highest measure of 


T. M. Came anp L. G. HAWKINS 


TABLE 1 


CLINICAL AND Test DATA CORRELATION: a 
HYSTEROID-OBSESSOID QUESTIONNA 


s wiru THE 


Number Correlation 


ient 
Data of cases coefiicien 
HOQ retest (after 6 weeks a7 
community therapy) 62 : ttt 
6 
Rating classification 92 
Clinical diagnosis (Hysterics 28** 
15; Dysthymics 71) 86 s 3 
Age 93 a 
Verbal ability (Mill Hill = 08 
Vocabulary scale) 91 
MMPI scales 08 
K 77 = 1 4 
Hs 77 —15 
D 77 at 
Pt 77 . 
Test diagnosis (Hysteria; 
Dysthymia) 002 
D — Hs (women) 41 "32 
D + Pt (men) 36 
Total punitive score -07 
MMPI Extrapunitive + In- 77 7 
tropunitive 
Direction of hostility 3i"* 
MMPI Extrapunitive — In- 77 
tropunitive 
Maudsley Personality In- 
ventory (MPI) gat?” 
Extraversion 53 _ 40" 
Neuroticism 53 
Correlation between MPI 46 
Extraversion and Th 
Neuroticism 
*p <.05, 
“4 <i. 
> <.001, 
pa 


agreement is between the HOQ and meg b 
sonality rating classification. Tepe ent 
considerably smaller measure of am noe 
between the HOQ and the clinical ap ig 
The test diagnosis for men is alm wor! 
nificant at the .05 level but that for is not 
is negligible. The total punitive score ™ ppe 
related to personality as measured ot? 
HOQ but hysteroids are significantly re 
extrapunitive, a finding supportive fe jti? 
vious work. There is a significant P 


{ 
| 


relationsh; 
on between age and HOQ in this 
a ury sample. 
the HOO a high level of agreement between 
they are and the MPI E scale suggests that 
is is ee, virtually the same thing. 
Places m, p surprising since Eysenck (1959) 
eh reliance: on “attitudes” than on 
naire a in hysterical symptom question- 
e eo rueton and equates hysterical 
ysterical symptoms, “hysterical attitudes,” 
&stablishi personality, and psychopathy in 
eè also ng his extraversion criterion groups. 
the g Eais to the heavy dependence of 
cale on “sociability” (Eysenck & 


Clara 
Some 8e, 1962), a trait which is related to 


o i i 
seekin f the self-effacing versus attention 


sing items of the HOQ. 

tinguish re well be that the failure to dis- 

Dlexes h etween trait and symptom com- 

0 Ro oe lack of confirmation 

dee ck’s initial findings (Eysenck & 
, 1962; McGuire, 1962; Sigal, Star, 


lari 

r 
avare me 1958). As far as the author is 
the MPT attempt has been made to validate 
of extra = Scale against an outside criterion 
Sotiatet son as measured by ratings of 
fal acies personality traits. The logical 
OF this of using Hysterics and Dysthymics 
Foulds purpose have been pointed out by 
Borts ,{1961a), and the present study sup- 
Bot contentions. In view of the theoreti- 
evi roversy and conflicting experimental 
system, one 


in restricting the interpretation 


bn HOQ to the hysteroid-obsessoid di- 

ad which it was devised. The 
toye ction of a third extraneous and con- 
inte. Sial concept is not required in the 


) ter : 
Pretation of the data. Particularly 15 


viden 

ìs j ce concerning Eysenck’s 
Ustified 
of th 


Shot 
q 


SI . 
the lite When there is general agreement in 
i 


ality types 


erature that the two person: YE 
al entities. 


Uuesti, MT 
€stion form recognizable clinic 


| Hysterow/OssEssomi COMPONENT 
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DEVIANT RESPONSE PATTERNS AS A FUNCTION 
OF CHRONOLOGICAL AGE* 


S. O. HESTERLY 2 


Louisiana State University 


Berg's Deviation Hypothesis suggested that: (a) response patterns of children è 
on the Perceptual Reaction Test (PRT) will vary with age to adulthood 
so that PRT age scales can be developed reflecting maturity level; (b) PRT 
age scale scores of elderly adults will reflect a decline in maturity level. PRT 
responses which differentiated 500 children aged 6 from 850 adults aged 
20-59 (p < .0025) were selected for the age scales. PRT age scale scores for 
cross-validation groups (500 children aged 7-16; 203 adults aged 20-59; 


200 adults aged 60-83) 


reflected systematic increases in scores with age 


to adulthood (p < .001) and decreases in scores for elderly adults (p < .001). 


The scores of elderly adults approxim: 


Subjects seldom respond randomly to 
multiple-choice tests even when there is no 
obvious basis for choosing one response above 
another. Instead, certain responses are 
chosen significantly more often than others, 
so that biases occur. Berg, Hunt, and Barnes 
(1949) developed a test which was specifi- 
cally designed to elicit such biases. This test, 
the Perceptual Reaction Test, referred to 
hereafter as the PRT, consists of 60 abstract 
designs to which the subject responds by 
choosing one of the following options: like 
much, like slightly, dislike slightly, or 
dislike much. Of course, the subject can 
make no response, although this is contrary 
to the test instructions. When Berg (1955, 
1957) administered the PRT to different 
groups of subjects, he found that subjects 
who made responses which departed from the 
typical bias patterns, i.e., uncommon or 
atypical responses, also tended to show per- 
sonality deviations. These findings led him to 
formulate the Deviation Hypothesis which 
states (Berg, 1957): 


Deviant response patterns tend to be general; 


e hence 
those deviant behavior patterns which are si 


L) e ignifi- 
cant for abnormality (atypicalness) and thus re- 
garded as symptoms (earmarks or signs) are asso- 


1 This article is based on a dissertation submitted 
to the Graduate School, Louisiana State University 
in partial fulfillment of the requirements for the 
degree of Doctor of Philosophy. The author is 
grateful to his advisor, I. A. Berg, for his aid and 
encouragement. 


? Now at the University of Arkansas, School of 
Medicine, Little Rock, Arkansas, 


ated those of children aged 9-10. 


5 whic 
ciated with other deviant response patterns, wat 
are in noncritical areas of behavior and ghisa 
not regarded as symptoms of personality 
ration (nor as symptoms, signs, earmarks) IP. 


X ness 

A series of studies (Adams, 1959; P58 

1955; Grigg & Thorpe, 1960; Harris, © 9, 
Hesterly & Berg, 1958; Roitasch 


= dies 
1959) were stimulated by the early st ro 


The Hesterly and Berg (1958) study Pi | 


vided the suggestion for the present ie jo! 
The results obtained in this investi" ing 
indicated that the response patterns of y tl 
children to the PRT differed significa gy 
from those of normal adults in such 4 jch 


0 1 whi | 
as to approximate the deviant patterns ‘epic | 


Barnes (1955) had found with schizoP "JE 
adults. In addition, there was clearly anes 
nificant tendency for scores On pa ot 
schizophrenia scale to decrease with 48 jike 
that older children tended to score Mt 
normal adults, pat 
The present study represents an gee 
using these changes with age in the TP’, of 
patterns of normal children as a MC45" ge 
maturity as defined by chronologic’ 97) 
The Deviation Hypothesis (Berg, „yio 
Suggests that the various “critical” phys, 
patterns which are associated with the F cut 
ical and psychological changes whic? re 
as children grow to maturity shoul’ rns on 
flected in “noncritical” response ae ag? 
the PRT, which change with increas"! oveloP 


se 


: e 
Specifically, an attempt was made tO norm? / 


age scales for the PRT, using the 
adult data employed by Barnes 
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ta 


DEVIANT RESPONSE PATTERNS 


(ase measure. Further, the possibility 
ci rmal adults, as they approach senility, 
hee deviant patterns on the PRT, 
gated Ve those of children, was also investi- 
Drè . Accordingly, the hypotheses of the 
mage study were two: 
Mien wes response patterns of normal chil- 
3 a the PRT will vary with age in such 
y <a to reflect maturity level as defined 
e eaen age, making possible the 
tendencies. of age scales based on group 
i e response patterns of normal adults 
age grou T will vary with age, when older 
& releet d are considered, in such a way as 
aS the decreases in scores on the age scales 
4 A approach senility. 
tiline ee a systematic development and 
was h in patterns of response on the PRT 
sons, ge for children and aged per- 
Would pormal children and elderly adults 
erns Po aas to manifest similar pat- 
represent leviant responses to the PRT. This 
erg s a test of the Deviation Hypothesis 
normal 1957). The behaviors shared by 
Meta aren and elderly adults may be 
adults as critically different from normal 
critica] aged 20 through 59 years. Such 
„cal differences should be reflected in non- 


(i 

ritica] ar : T tion 

Select: eas of behavior as PRT optio! 
ections, 


METHOD 
Subjects 
A 
stug tat of 2,253 subjects were used in the present 
by sex) ive hundred school children (divided equally 
two y? from the public and parochial schools of 
Sery, Ouisiana parishes aged 6-0 through 6- 
aa pte Subjects in the construction of the 
femal 08) with 850 normal adults (500 males; 350 
Adult oy Barnes (1955) constituting the normal 
ie e. 

Louisia inated public school children, from four 
Were y a parishes, divided equally by age and sex, 
aa ince in developing norms for older children 
at eg ch testing the validity of the PRT age scales 
ast 39 Of the following age levels: 7-0 through 
through” through 10-11; 11-0 through 12-11; 13-0 
9 Two pri 15-0 through 16-11. 
5 females dred three normal adults (105 
pales) drawn from Berg’s file 0 
sepo 28h a sample of normal adults aged 2 
rho 59 years, This sample included policemen, 
rlasses €achers, housewives, adults in evening college 
(y ai the like, Two hundred elderly adults 

€s; 150 females) ranging in age from 6 
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to 83 years, formed the sample used in testing the 
second hypothesis of this study and in developing 
norms for elderly people. These subjects were all 
members of the Golden Agers Organization of 
Orleans Parish, Louisiana and included individuals 
from various socioeconomic backgrounds. 


Procedure 


The PRT was administered to the subjects in 
groups of 12 to 70. Standard instructions were used 
with adults; however, these instructions were modi- 
fied somewhat, as in a previous study (Hesterly & 
Berg, 1958), for children, to insure their under- 
standing the nature of the task required of them. 
The responses for each subject were transferred 
to IBM answer sheets for tabulation and scoring. 
For each option, the percentage of the criterion 
(or normal adult) group was compared with the 
percentage of the experimental (or normal 6-year- 
old) group and tested for statistical significance by 
use of a contingency table (Mainland & Murray, 


1952). Age scales for males and females, taken 
a cross- 


separately, were then constructed by 
validation technique proposed by Katzell (1951). 
In this technique, the samples are divided into 
random halves with independent statistical analyses 
performed on each half and the resulting scales 
validated on the remaining half. Only those options 
which differentiated the groups beyond the .05 level 
of confidence on both scales were included in the 
final scales. Each option preferred by normal adults 
was given a weight of 1; each option preferred by 
normal 6-year-old children was given a weight of 
—1. Each subject’s PRT age scale score was com- 
puted by algebraic summation of the weights for 
the options he chose. Measures of central tendency, 
variability, and reliability were then computed for 
the various age groups. The variances for the groups 
appeared to be rather heterogeneous; therefore 
Bartlett’s test for homogeneity of variance was 
applied to the five age groups of children and 
Fisher’s test for homogeneity of variance between 
two groups was applied to children versus adults, 
children versus elderly adults, and adults versus 
elderly adults. Since all of these tests yielded prob- 
abilities of less than .05, nonparametric techniques 
were used to compare the group PRT age scale 
scores. Specifically, the nonparametric median test 
was deemed appropriate for data of the type ob- 
, as this test is not dependent upon homo- 


tained. 
geneity of variance. The hypothesis tested is that 
the groups compared are random samples from a 


population with a common median. 


RESULTS AND DISCUSSION 


The male PRT age scale constructed in 
this study included 176 of the 300 possible 
response options. Sixty-four options were sig- 
nificantly more frequent among 6-year-olds, 
and 112 options among adult males; accord- 
ingly, weights of +1 and —1, respectively, 


212 S. O. HESTERLY 
TABLE 1 
SUMMARY OF MEASURES OF CENTRAL TENDENCY, RELIABILITY, AND VARIABILITY OF 
PRT AGE SCORES BY AGE GROUPINGS 
— 
Males Females 
Age group N M SD re SEs N M SD ru SEs 
6to 6-11 250 —21.0 23.80 97 404 230 15.6 18.30 96 a 
7to 8-11 50 —12.5 30.64 98 4.29 50 —10.3 19.52 98 164 
9 to 10-11 50 —0.3 23.39 .95 5.23 50 0.4 16.45 .99 157 
11 to 12-11 50 5.5 13.09 99 1.31 50 5.3 15.68 99 72 
13 to 14-11 50 13.9 18.55 .99 1.86 50 11.0 15.71 98 337 
15 to 16-11 50 23.1 1685 97 2.92 50 18.2 10.68 9045 
7 to 16-11 250 5.9 23.96 99 2.40 250 5.6 18.02 99 340 
20 to 59 105 29.1 15.29 95 3.42 98 224 12.86 .93 ay 
60 to 83 50 2.1 27.15 99 2.72 150 —4.1 19,93 .91 s 
rly 


were assigned to each option. Thirty-seven 
other options were eliminated in the cross- 
validation. 

Similarly, the female PRT age scale in- 
cluded 135 options (55 significantly more 
frequent among 6-year-olds, 80 among adult 
females). Forty-three other options were 
eliminated in the cross-validation.? 

A summary of measures of central tend- 
ency, variability, and reliability for various 
age groups on the PRT age scales is pre- 
sented in Table 1. A systematic negatively 
accelerated function in mean score as a 
function of age from age 7 through age 16 
can be observed. 

The results shown in Table 2 indicate that 
a statistically significant increase in median 
PRT age scale score occurs among normal 
children aged 7-0 through 16-11. With in- 
creasing age, these scores increasingly re- 
semble those of adults aged 20 through 59. 
This suggests that the scales would be cor- 
related with the critical behavioral changes 
which occur during the process of growing 
to adulthood, e.g., those response modifica- 
tions which are related to physical and 
intellectual maturation. 

The results shown in Table 3 indicate that 
the PRT age scales significantly differenti- 
ated children from adults aged 20 through 
59. Further, the female PRT age scale sig- 


3 The complete dissertation containing keys and 
T scale conversion tables for the PRT age scales 
may be obtained by ordering Publication No. 
60-5916 from University Microfilms, Ann Arbor, 
Michigan. 


nificantly differentiated children from eds 3 
adults, with the latter obtaining 4 ai 
cantly lower median score. A similar the 
nonsignificant trend was present with a 
male PRT age scale. Inspection of the ei: 
and median scores reveals that the a 
adults in the present study responded “Idren 
PRT in a manner most like that of chil tion 
aged 9-0 through 10-11. The Devia 

Hypothesis (Berg, 1957) would lead US 


I lts 
infer from these results that elderly sa 
and children aged 9-0 through oe aren 


common behavior patterns in a critica! 4 oft 
Table 3 also indicates significant di of 
ences between the PRT age scale ates 
adults aged 20 through 59 and elderly a 
with the latter obtaining significantly e 
median scores. From this it may be i: 
that a decline in PRT age scale SC 


. . i ical 
exists which is associated with the or tet 


Thus it would seem that the PR 
scales measure a function which res 


the general pattern of growth and d 
which has previously been observ ng 
manual skills (Miles, 1931), sige 
(Cathcart, Hughes, & Chalmers, 1935; jlv® 
& Stoessiger, 1927), reaction time (De & 
1936; Pearson, 1925), intelligence ( ning 
Conrad, 1933; Miles, 1931), and 
ability (Miles, 1933; Ruch, 1934). er 
the task of responding to the designs t 
PRT seems to be qualitatively differe” 


DEVIANT RESPONSE PATTERNS 
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r TABLE 2 
ONPAR. 
ARAMETRIC MEDIAN Test FOR PRT AGE Score DIFFERENCES BETWEEN Acre Groups OF CHILDREN 


is Below Above Subgroup Group 
ge groups median median median median df x 
Males 
7-0 to 8-11 38 12 —8.0 
9-0 to 10-11 28 22 4.0 
11-0 to 12-11 25 25 75 7.9 4 43.63** 
13-0 to 14-11 18 32 11.6 
15-0 to 16-11 7 43 22.7 
Females 
7-0 to 8-11 38 12 —7.6 
9-0 to 10-11 32 18 1.6 
11-0 to 12-11 22 22 3.6 5.7 4 227.85** 
13-0 to 14-11 21 29 10.5 
1 5 45 17.4 


5-0 to 16-11 


k 
b <.001. 


those + 
rar i ie in the studies above in two 
absence E PRT instructions suggest the 
ence po ‘correct” or desired responses. 
Without e subject is asked to make a choice 
empt t the implication that he should at- 
Skil] tha choose correctly or to display a 
tego Eth, etc. In addition, the response 
imply ae (like much, like slightly, etc.) 
at the subject is expected to give an 


affective response. These aspects of the PRT 
task suggest that performance with the PRT 
is more likely to reflect differences in affective 
reactivity than differences in skill, intelli- 
gence, strength, and the like. 

With these considerations in mind, it is 
possible to infer that the PRT age scales 
tap some kind of affective maturity which 
has a systematic development and decline 


NCES BETWEEN PAIRED Ace Groups ` 


No: 
NPARAMETRIC MepraN Test ror PRT A 
ji gi Group 
Below Above Subgroups 3 i 
Age groups median median median median df x 
Males 
7-0 to 16-11 165 85 7.9 15.40 1 78.95** 
20 to 59 15 90 34.2 
7-0 to 16-11 121 129 79 7.10 1 2.21 
60 to 83 30 20 45 
20 to 59 36 69 34.2 27.50 1 25.89** 
60 to 83 30 11 4.5 
Females 
7-0 to 16-11 150 100 5.7 11.98 1 42.02** 
20 to 59 1 77 24.4 
7-0 to 16 5.7 5 ~ 9,76" 
-11 08 142 3.50 1 16 
60 to 83 : . 61 —0.9 
20 to 59 e 24.4 1 83.62** 
; 115 83 9.00 3. 
60 to 83 112 38 —0.9 
aos a 
$ Soi, 
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with age in normal individuals. The data 
tend to support this inference in that the 
normal adults aged 20 through 59 generally 
chose the “like slightly” or “dislike slightly” 
options for the vast majority of their re- 
sponses. This suggests that their affective 
reactions to the designs are rarely extreme. 
On the other hand, children and elderly 
adults chose the “like much” option quite 
frequently, suggesting relatively more extreme 
affective reactions to the designs. Perhaps 


these test performances reflect the character- 


istics which children and elderly people tend 
to share, viz., immaturity, childishness, 
atypical affect. The results of Barnes (1955) 
indicate that schizophrenic patients also tend 
to choose the “like much” option for many 
of their responses. This finding also lends 
support to the inference that performance 
with the PRT reflects affective maturity, 
since schizophrenics are typically character- 
ized as immature, childish, and atypical in 
affect. The author is currently engaged in 
further research with the PRT age scales 
in an attempt at assessing the inference. 
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CONSTRUCTION AND VALIDATION OF THE FOOD 
ATTITUDE SCALE * 


DONN BYRNE, CAROLE GOLIGHTLY, axD E. J. CAPALDI? 


University of Texas 


asure a personality dimension involving attitudes 
; toward food was constructed by means of internal-consistency item-analysis 
procedures, A pool of 221 true-false items was administered to 400 under- 
graduates. The analysis yielded 62 cross-validated items scored for males, 
63 for females. With new samples, split-half reliability was found to be .73 


A scale designed to me: 


for males, 


the food scale were found to 
viscerotonia and negatively correlate 


a cay stigations of individual differences in 
Cues hay responses to food or food-related 
reaction, è been confined primarily to aversive 
Presented Typically, a list of various foods is 
Ose iten and subjects are asked to indicate 
ey w, ems which they dislike, those which 
t Me ee refuse to eat, etc. With aversion 
able, į conceptualized as a personality vari- 
of E has been found that the number 
dumb s disliked is positively related to a 
ceding of relatively negative characteristics: 
toticige, Problems (McCarthy, 1935), neu- 
og (Gough, 1946; Smith, Powell, 
indice 1955a; Wallen, 1945), various other 
avids of maladjustment (Altus, 1949; 
Anxiet Lawton, 1961; Wallen, 1948); 
matas Smith, Powell, & Ross, 1955b), im- 
dislike y and a general tendency to indicate 
ten (Lindgren, 1962). 
ne though psychological research deal- 
With food attitudes has largely been 
€d to food aversions, it seems obvious 
he Considerably broader array of attitudes 
thing food and eating should constitute 


1 

qT 
ang rae Food Attitude Scale, 
the iene keys for each have 
Ment ee 
Toje - 7506 from ADI 
Bresse Photoduplication Service, 
$2.59 Vashington 25, D. C- Rem 
Make 1° microfilm or $1.75 for photo 
Service cks payable to: Chief, Pho 
zm” Library of Congress: P 
to The authors wish to express "MON q Labowitz 
Carl Mes Barry, John Brelsfords etl ial avid 
Jer McGraw Gladene Michel, ohn She" e O 
this immons who served as research assista 

roject, 


in 
limi 
that 


Co; 


Con- 


appreciation 


74 for females. Test-retest relia! 
.82 for females. In a validational study w 
be positive! 
d with cerebrotonia, 


bility (6 weeks) was .86 for males, 
ith 204 undergraduates, scores on 
ly correlated with self-ratings of 
as hypothesized. 


a personality dimension of possible interest 
and usefulness. Wallen (1945) suggests that 
food habits reflect many family practices 
and attitudes. It would be expected, then, 
that the recurring events associated with 
eating throughout infancy and childhood are 
related not only to adult attitudes toward 
food, but also to a number of other attitudes. 
For example, there is some evidence that the 
greater the variety of foods which children 
like, the more positive is their self-concept 
and their mother-concept (Davids & Lawton, 
1961). In addition, early food experiences 
have long been discussed by personality 
theorists as important determiners of a 
series of characteristics related to “oral 
dependency” and similar formulations. 

In view of these considerations, a scale 
which measures general food attitudes would 
seem to be a potentially useful tool in per- 
sonality research. The construction of such 
a scale and the determination of its reliabil- 
ity are described in the following section, and 
an initial validational study is reported in 
a later section. 


CONSTRUCTION OF THE SCALE 


Initial Item Selection 


The intent was to construct a series of True-False 
items which would cover an array of eating experi- 
ences. A series of 221 items were written, and these 
were grouped into three categories. The first, Past 
Attitudes and Habits, contained items dealing prima- 
rily with childhood events. Examples are: 

15. Discipline was usually enforced shortly before 
or after the evening meal. 

17. My father enjoyed eating. 


215 


216 


25. Less than an average amount of conversation 
occurred at mealtime in my family. 

28. Business matters were often discussed at meals 
(chores, etc.). : 

31. Sometimes my mother would give me my 
favorite food when I was sick or unhappy. 


The second section, Food Preferences, more or less 
followed the form of the various food aversion tests. 
Examples are: 

65. I like apricots. 

92. I like chili. 

110. I like green salad. 

120. I like liver. 

145. I like sardines. 
The third section, Present Attitudes and Habits, 
dealt with current behavior. Examples are: 

171. In general, I prefer a slow leisurely meal to 
a quick, hurried one. 

177. I do not care much for desserts. 

186. A good wife must be a good cook. 

206. Watching people eat makes me hungry. 

208. I often buy refreshments at movies, ball 
games, etc. 


The items were arranged in a booklet entitled Survey 
of Eating Habits; responses were recorded on IBM 
answer sheets. 


Item Analysis 


Because there is no obviously acceptable external 
criterion of food attitudes, an internal consistency 
item analysis was employed. A number of investi- 
gators have reported significant sex differences in 
the food aversion studies (Smith et al, 1955a; 
Smith et al, 1955b; Wallen, 1943), and it was 
decided to perform separate item analyses for male 
and female subjects, 

An initial scoring system was devised in the 
following manner, Two of the authors (DB and EJC) 
independently designated which of the 221 items 
appeared to be the most definitive measures of a 
dimension of food attitudes, There was agreement 
on 50 such items, and these constituted the initial 
scoring key for the test. 

The 221-item questionnaire was administered to 
400 students (200 males, 200 females) enrolled in 
the introductory psychology course at the University 
of Texas. Sample I consisted of 100 males and 100 
females. For each sex, a frequency distribution was 
plotted, and those subjects falling in the upper and 
lower 27% of the distribution were the criterion 
groups in each item analysis (Kelley, 1939). Utilizing 
Flanagan’s Table (Thorndike, 1949, pp. 348-351), 
product-moment correlations between responses to 
each item and total score were determined. This 
procedure was repeated for Sample II (100 of each 
sex) as a cross-validation, 

For each sex, those items which yielded cor- 
relations significant beyond the 5% level in both 
samples were designated as scorable items in the 
final scale. For males, 62 of the 221 items met this 
criterion. For females, 26 of the same items reached 
statistical significance in both samples along with 
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in 
an additional 39 items. Thus, the final scoring 
system is based on 101 items. 


Reliability 


A new sample of 73 male and 86 female sud 
was drawn. The 221-item Food Attitude rnal 
(FAS) was administered, and coefficients of nt 
consistency were computed for each sex. For ae 
scores on odd and even items correlated .73) ales, 
rected by the Brown-Spearman formula. For fem: 


ee s T4 
it- iability coefficient Was «z; 

the corrected split-half reliability males, 55 
erio 


the instrument is suffiicently reliable for T ility 
use; however, the magnitude of these reha? 
coefficients indicates that further work ER 
selection is warranted in order to increase cons! 
of measurement. 


ncy 


Discussion i 
perso? 
trume 
Jationa 


Any meaningful discussion of the 
ality dimension measured by this ins 
must await the accumulation of corre 
and experimental data. However, it nA 
of value to examine briefly the conte ale 
the items which comprise the final 5 jn 
Fully realizing the dangers involve ects” 
naively assuming that item content rectal 
even the phenomenological world of su oa ed 
the following generalizations are pre eril 
tentatively as a summary of the MaF 4 
contained in the scored items an 
possible source of hypotheses. 

Despite differences in specific ay 
males and females are quite similar ™ pj- 
kinds of items which constitute ee 
tudinal dimension, Individuals who 2” op- 
the positive (food liking) end of the ative 
tinuum, in contrast to those on the we pat 
(food disliking) end, tend to indica asi 
mealtime during their childhood Š all 
pleasant social experience in yi in 
members of the family participate SE of 
unhurried and often elaborate (f0W® "on 
candles on the table) ceremony. L!8 eatin’ 
versation took place during and afte men 
without unpleasantness such as BER Y 
disciplining, scolding, or assignon ed E 
chores. All members of the family a god 
eat, and the mother was perceived rousen! 
cook who enjoyed this particular utili, | 
task. In addition, food seemed to Lan ee 
by parents as a reward in that favor 


conte! 


Foop ATTITUDE SCALE 


ae prepared when the child was sick or 
sane or on special occasions such as 

days; going to a restaurant was a 
Special treat. 

With respect to present attitudes, those on 
vee end of the dimension still regard 
is atime as a pleasant activity, and food 
Re Pparently utilized by these individuals 
A self-administered reward (celebrating 
W pat events by going to a restaurant, eating 
bes depressed, etc.). Many kinds of activi- 
indi are made to include eating in that these 

Widuals say that they buy refreshments 
on aed and ball games, eat peanuts at 
watchin like to smell food cooking, eat while 
agree te TV, and both males and females 

k at a good wife must be a good cook. 
o indicate that they like desserts, 
and tke crave sweets, like foreign dishes, 
Say th e to try new foods. In addition, they 
ive at they tend to gain weight, are talka- 

» that children should not be forced to 
tare ce that indigestion and heartburn are 
food- hus, favorable present attitudes toward 
o H ited activities appear to be related 
Ciateq istory of positive reinforcement asso- 

with food and eating. 
i peerilic food preferences are diffic 
tes Dret meaningfully. In this section of the 
ang items which differentiated those high 

ow in food liking exhibited the least 
jects nt of overlap for male and female sub- 
iffere ily three of the food preference items 
Sexes, tiated high and low scorers of both 
it chicken, hot dogs, and lettuce. 
Other , Tespect to sex differences per sê 
have Vestigators have reported that females 
Maley Shificantly more food aversions than 
(19q,(Smith et al., 1955a, 195Sb). Wallen 
Socioeg found that among subjects of low 
fooq a nomic status females disliked more 
Ce an males, but he found no sex dif- 
Staty S for subjects of high socioeconomic 
log ¢ On the present test for a sample of 
Aversion ness the mean number of food 

San ns (out of 104 foods) was 21.02; for 
This Die of 98 males the mean was 19.01. 
cant Cp rence was not statistically signifi- 
50 may. 16, df = 202). Another sample of 
sex sles and 50 f cred ase 

“differ, $ emales was se ecte 3 
As g ow, ence item analysis of the food list. 

n in Table 1, 15 of the foods yielded 


ult to 
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TABLE 1 


ITEMS YIELDING SIGNIFICANT SEX DIFFERENCES 
Ix Foop AVERSION 


Percent- Percent- 
age of age of 
males females 
indicating indicating 

Food dislike dislike x? 
Calves’ brains 70 90 6.25** 
Celery 32 8 9.00*** 
Clam dip 46 74 8.174% 
Eggs 4 18 5.01* 
Hominy 28 52 6.00** 
Kidney stew 80 94 4,33* 
Lamb 20 38 3.93* 
Left-overs 24 44 4.46* 
Nuts 2 14 4.89* 
Oysters 26 64 14.59**** 
Tripe 64 96 16.00**** 
Turtle soup 60 88 10.19*** 
Waffles 4 18 5.01* 
Watermelon 2 12 3,84* 
Whole wheat bread 6 22 5,325 

* 
=p $o 
++p <01. 
p < 001. 


significant sex differences. In only one in- 
stance (celery) did males indicate greater 
dislike than females. 

The relatively low reliability of the FAS 
suggests that the test is not a homogeneous 
measuring instrument. It was found that 
when the three sections of the test are scored 
separately, the highest relationships for each 
sex were between Past Attitudes and Habits 
and Present Attitudes and Habits. As shown 
in Table 2, Food Preference scores were 
unrelated to either of the other sections in 
a group of 104 females and related only to 
Present Attitudes and Habits in a group of 


98 males. 


TEMPERAMENT TYPES AND ATTITUDE 
Toward Foop 


Background 

Many personality theorists have described 
personality traits or types in which food- 
related behavior, or more generally orality, 
plays a prominent part. For example, Freud 
and Abraham describe the oral-passive type, 
and there is Fromm’s receptive orientation, 
Adler’s pampered child, and Horney’s com- 
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TABLE 2 


INTERCORRELATIONS OF THE THREE SECTIONS 
OF THE FOOD ATTITUDE SCALE 


Males Females 
Past Present Past Present 
atti- atti- atti- atti- 
tudes tudes tudes tudes 
and and and and 
habits habits habits habits 
Past attitudes .10 35" —.16 223* 
and habits 
Food 320 —.06 
preferences 
.05 
=b S21 


pliant character. The behavior proposed by 
these various analysts for such individuals 
includes a tendency to gain support from 
parents, friends, authorities, and God; a 
tendency to be optimistic, friendly, and 
loving when things go well and anxious and 
distraught when rebuffed; the seeking of 
consolation in eating and drinking because 
being fed is equated with being loved; con- 
forming behavior; and a desire to please 
and succeed without effort (Munroe, 1955), 
These clinically derived observations suggest 
a series of studies relating the FAS to these 
various oral characteristics, 

A first step, however, would seem to be 
that of determining whether or not scores on 
this scale are, in fact, related to a more 
general personality variable of orality. One 
personality theorist who has described an 
oral dimension is Sheldon (1942). It is pos- 
sible to utilize Sheldon’s definitions of the 
“three primary components” of temperament 
for research purposes without any as- 
sumptions about possible interrelationships 
between somatotype and temperament. Obvi- 
ously, his viscerotonic characteristic con- 
stitutes the sort of personality dimension 
with which the food scale would be expected 
to correlate. He has defined 20 rather specific 
traits which make up each of his tempera- 
ment components, thus facilitating the con- 
struction of an appropriate measuring device. 

The viscerotonic component is character- 
ized by relaxation, love of comfort, an oral 
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orientation, sociability, affectionate behaviot, 
etc. If these characteristics do form a na 
eral personality trait, the more Ta d 
dimension of food attitudes should be me. 
to it. Therefore, it is hypothesized that 
FAS is positively related to viscerotonia. 

The somatotonic component is chara i: 
ized by assertiveness, energy, a mose a 
orientation, dominance, courage, car 
ness, competitive behavior, etc. There fot 
not appear to be any a priori ee 
predicting either a positive or a a 50 
relationship between these traits an 
attitudes. 7 

The cerebrotonic component is characte 
ized by restraint, overintensity, & “re ity; 
orientation, need for privacy, sens! i to 
introverted behavior, etc. With repr r 
food, Sheldon (1942) suggests tha 
viscerotonics, the food-taking time 15 
high spot of the day while cerebrotone p 
to eat quickly and have it over with; Ce mony 
tonics care little for service and cere at 
or for fancy dishes. It is hypothesize tonia- 
the FAS is negatively related to cerebro 


cter- 


wan 


Constructing a Temperament Scale 


Attempts to measure Sheldon’s temperam 


procedure involved ratings on a i 
made by an examiner who observed | seat situ? 
closely for at least a year in many aia analyt 
tions and conducted not less than 2 r 
interviews. roach E 

For most research purposes, such an Mee tiga” 
impossibly forbidding. In a subsequent multiP’ 
tion, Child (1950) constructed a 66-item pe ame? 
choice questionnaire to measure these tem 2 
components, two ° m 

For the present validational study, item seli 
authors (CG and DB) constructed a 60-40" igon 


e 
rating scale which was designed to follow F pie’ 
trait descriptions as closely as possible. 
are: 

Viscerotonia «porate (pot? 
1. My- responses are slow and deliberé by 

verbal and motor reactions). orted ng 
28. I like to be surrounded and suPP°" mo 


p i ne in 
others (I love company, enjoy being ° eople 


many, feel a genuine fondness for P! do™ 
general). ae (sel atle 
37. I freely communicate my eelinfo, j 
hold things back, little emotional inhi riod 
repression of emotion). pest Pot I 
47. I think childhood is perhaps the b°, gult 


of life (more fun to be a child ien ti 
enjoy children and get along with the 
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TABLE 3 


SpLrtT-HALF RELIABILITY COEFFICIENTS 
FOR THE TEMPERAMENT SCORES 


Males Females 
Yiscerotonia 55 68 
ematotonia .76 ‘80 
Cerebrotonia 20 56 
Somatotonia 
10, I 


seek and enjoy physical adventure for its 


ow: i k 

toug sake (like speed, danger, horses, large dogs, 
Sh sports), 

toms love being out in the open (dislike small 

fte ‘Md Places in which it is difficult to move 
Uses ike large rooms, life on a grand scale, 
33 0P hills). 

Openly net the influence of alcohol I become more 


and are uninhibitedly aggressive, more expansive, 
ull of a sense of power. 

regard the period of early manhood or 

C Ood as the best time of life. 

“rebrotonia 

later feel that the greatest happiness lies in the 

freg (4CC2des of life when an individual is relatively 

Om tenseness and emotional insecurity. 

Prefer to be alone when faced by 


Womanh, 


Be Brief trouble 
am physiologically overresponsive (easily 


» Poor digestion, frequently constipated) . 


Haye T 
picate 
d me and 


Otten hae very aware of everything aroun 
Ite Ve a feeling of apprehensiveness. 


ms 
tangeg Presenting the three components 
Song ìn random order, For each item, 


were ar- 
subjects 
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Procedure 


Both the Food Attitude Scale and the tempera- 
ment scale (labeled “Behavior Rating Scale”) were 
administered on separate occasions to a sample of 
204 students (98 males, 106 females) enrolled in the 
introductory psychology course at the University 
of Texas. 


Results 


First, the reliability of the three tempera- 
ment scores was investigated. As may be 
seen in Table 3, the corrected odd-even cor- 
relation coefficients reveal that the internal 
consistency of these temperament scales is 
relatively low; only for somatotonia do they 
approach adequacy. 

Despite the magnitude of these reliability 
coefficients, scores on the FAS were cor- 
related with each temperament variable, 
separately for males and females. In Table 4 
these coefficients are given. As hypothesized, 
scores on the FAS are positively related 
viscerotonia and negatively related to cere- 
brotonia, Somatotonia and FAS were un- 
related in males, as predicted, but an un- 
expected positive relationship was found for 
females. 

In addition, separate correlations with the 
temperament variables were obtained for 
each of the three sections of the FAS as 
is also shown in Table 4. With only one 
exception (females for the cerebrotonia 
component), the relationships are accounted 


N ed o i : Almost j h 
Cver, thiieanena bod pace. Ba Usually. for by the Past Attitudes and Habits and by 
3 ' 3 
TABLE 4 
RELATIONSHIPS BETWEEN [OOD ATTITUDES AND TEMPERAMENT COMPONENTS 
i Females 
Males 
n Viscero- Somato- Cerebro- 
Viscero-  Somato- i tonia tonia tonia 
X tonia tonia ton — = 
atti mn k = 
Sale tide 21* 09 — 290" es 
‘St a, * 
ttitu + Ra 22 — 18 
ey habite .22* 08 —.26 
pa —.06 — 26" 
P ferences 02 03 =i sit : / : 
and t Attitudes —.20* 23* 20 =i 
habits 22 08 
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CORRELATIONS BETWEEN TEMPERAMENT SCALE ITEMS AND FOOD ATTITUDE SCALE SCORES 
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TABLE 5 


r for males 


r for females 


Item 


34ee* 


—.270* 


-20* 


.23* 


41*** 


—.32*** 


35te* 


30. 


45. 


wn 


16. 


26. 


36. 


41. 


48. 


57. 


54. 


58. 


5 S ith eating 
- I take pleasure in eating and the ceremony connected with | 


11 but 
- I have a strong need for privacy (I would like to be cut off from 4! 


: r ; it 
. Iam relatively complacent (see little hurry or urgency in any $! 


. I turn to action when confronted with trouble or tragedy. 
+ I bear injury and pain lightly (relatively oblivious to phys! 


P nes . ivi o! 
. I have an intent and sensitive-appearing face (all activity 


- People say that I look younger than I am. 

. Tam relaxed and without tension. 

- My voice is unrestrained and carries well, n uf 
. I have few mood changes (feelings are steady, dependable, 7 


i 

. I am sensitive to pain (dislike doctors and dentists intensely» 
z i 
- I feel alert and active (sit, stand, and walk erectly and ¥ 


* . tro! 
. Tam not very intense in mental and moral outlook (lack of § ited 


. People say that I look older than I am. 


(well-set table, company, candles). 


A smoulty in 
I do not sleep at night and feel fatigued during the day (difficulty 
going to sleep, light sleeper, wake easily in the morning). 


My sleep is deep, easy, and undisturbed (go to sleep quickly, 


sleep 
through the night). ! 


one or two intimate associates), 


Pe s intens? 
I prefer to remain silent about my feelings even when they are | 
(dislike outward expression of joy, sorrow, etc.). 


; st 
T enjoy competition (put myself forward, not afraid to ask qu 
not easily embarrassed), 


ions, | 


: dislik® | 
I dislike being socially involved (avoid social gatherings, 
comradeship beyond a few intimates). 


+9 politer 
I like to do what is accepted and approved (prescribed routine Teal 
manner of approach to what I seek, the ceremony related tosu | 
as births, marriages, and holidays is important to me). 


e fa 
I feel the need for people when troubled (seek people in att 
stress, fee] better after telling others a problem, seek comp’ 
during times of sorrow). 


r ing, sper” 
I do things heartily, thereby making noise (in walking, moving» 


ing, laughing, coughing). uatio™ | 


cross bridges when I come to them, let sleeping dogs lie). 


ical trail 


find cold showers stimulating, do not flinch when I get a shot): cial 


musculature is rapid and alert). 


changing, harmony and evenness of outlook). slik? | 


cold showers, apprehensive of anticipated pain). goron 


without effort, use vigorous gestures), gave 
ne pon 


except for the elementary biological needs, do not get a 
most issues). bj 
I lack uniformity in behavior and outlook (attitudes a ; eds 
sudden and disconcerting change, abrupt changes of min¢/- one 


E u 
Iam not particularly sensitive to the less obvious or more $ 
of others, 


ect te. 4 


Foop ATTITUDE SCALE 


ne Present Attitudes and Habits rather than 
e Food Preferences section. 
aa further analysis was undertaken. Each 
SA he 60 items of the temperament scale 
epee with the FAS for males and 
‘males separately. The items yielding the 
ighest coefficients are shown in Table 5. 
again sex differences are apparent, 
a etme that this oral dimension has a 
gs different meaning in males and 
fit ang It would appear that male subjects 
a e description of an “oral character” 
Mewhat more closely than females. 


Discussion 


ia hypotheses were confirmed, but the 
ing! i of the correlations is disappoint- 
a M ow. Even discounting the low reli- 
EES. of these research instruments and 
Coena for attenuation, the resulting 
Possible ts are still relatively small. It 1s 
ie, that the various elements proposed as 
re ‘ache of an oral personality dimension 
Cal ae highly related than various theoreti- 
th t pmulations would suggest. It may be 

food attitudes are highly related to a 


ew « 
Others. characteristics” and unrelated to 


I 

MPLICATIONS FoR FURTHER RESEARCH 
the Food 
While it is 
research, 


{etitional work in refining 
sufjest® Scale would be desirable 
Steater Uy reliable for exploratory a 
Pugh consistency of measurement should be 
Th in further refinements of the scale. 
tiong Ow interrelationships of the three sec- 
is Of the scale suggest that this dimension 
Shou a unitary one; 4 factor | analysis 
atti Yield more homogeneous dimensions 

oaks toward food. : 
imensi, investigations with this personality 
in gOn are suggested by previous research 
“tan, i. area, First, there has been 4 long- 
"lation® interest in child-rearing practices 1 

n to eating problems. Several items 0D 
Suggest that the families of food 

a ave a permissive, rewarding, an 
family orientation toward mealtime 
suq} Y- And studies of parental behavior 
(losy) that of Sears, Maccoby, and Levin 
indicate that the mothers of children 
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with feeding problems tend to be relatively 
punitive, cold, and rejecting. Thus, the study 
of the antecedents of food attitudes in 
parent-child interactions would seem to hold 
promise. 

A second area of potential interest is that 
of the oral character and the personality 
variables which various theorists suggest 
should be interrelated. In the present work, 
modest relationships were found between 
FAS scores and viscerotonia and cerebro- 
tonia. Other aspects of orality may also be 
found related to food attitudes. For example, 
Blum and Miller (1952) measured orality 
in children by counting nonpurposive mouth 
movements in a series of time sampling pro- 
cedures. They found that orality was posi- 
tively related to consumption of ice cream 
over a 3-week period, ratings by teachers 
of eagerness at lunch time, ratings of eager- 
ness to have other children like them, de- 
pendency on adults and other children, and 
taste suggestibility, and negatively related to 
popularity. It is hypothesized, then, that food 
attitudes should be related to the sorts of 
behaviors investigated by Blum and Miller 
as well as to affiliation need, religious beliefs, 
alcohol consumption, conformity, predispo- 
lopment, etc. Because of 


sition to ulcer dev 
the consistency with which sex differences 


were found in the present research, it may 
be well to investigate these relationships with 
orality separately for males and females. 
Not only are those high on the oral 
dimension supposed simply to enjoy food and 
drink, but it has often been hypothesized that 
they tend to respond to anxiety and depres- 
sion by eating and drinking (Fromm, 1947). 
everal items on the FAS lend sup- 


Again, Si J 
ot to this general proposition. Thus, the 


chological components of obesity may 
ai ly positive food attitudes and 


combine strong! ‘ 
anxiety. Conversely, extreme underweight 


could involve strongly negative food atti- 
tudes plus anxiety. In a situation in which 
anxiety was aroused, differential predictions 
would be made about the food intake of those 
scoring at opposite extremes of the FAS. 
Any situation in which hunger is aroused 
ad to different response 


ld conceivably le i 
patterns on the part of those at opposite 


) 
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extremes on the food attitude dimension. For 
example, under deprivation conditions food 
likers, in contrast to food dislikers, might 
be expected to produce more food-related 
imagery in thematic apperception productions 
(Atkinson & McClelland, 1948), to be more 
sensitive to food stimuli presented sublimi- 
nally (Byrne, 1959), to learn and to recall 
food-related words more readily (Epstein & 
Levitt, 1962), and to have greater difficulty 
in adhering to restrictive diets (Summerskill 
& Darling, 1955). 
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VERBAL CONDITIONING OF SCHIZOPHRENICS AND 
NORMALS IN A SITUATION RESEMBLING 
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10 normal Ss and matched Ss diagnosed “schizophrenic 
as characters themselves, the experimenter, and 


ry E was as neutral as possible, but in 
he experimenter (the criterion response) 
1 cues (“mmm-hm,” nods of the head, etc.). 


as tested and rejected. Normals not only emitted a 
iterion responses under reinforcement 


but also conditioned more thoroughly (p = .01). 
discussed. 


to tell two stories including 


two other people. During the first sto’ 
the second, E reinforced references to t 


using both verbal and gestural 
The null hypothesis w: 


significantly greater percentage of cr 


than did the hospitalized Ss, 
Implications for psychotherapy were 


ie © Present study was undertaken to in- 
bita differences existing between hos- 
with eed (NP) subjects and normal subjects 
i egard to a type of verbal conditioning 
Situation resembling psychotherapy. 
Rages in the literature, one can find 
any us studies of verbal conditioning, 
tas dying rather ingenious devices to 
Btoups the “conditionability” of various 
Teview of subjects (see, for example, the 
OWey, of Salzinger, 1959). Few studies, 
8oals er, can be found which satisfy both 
£ tipa the present undertaking: (a) to 
a Verbal hospitalized and normal subjects on 
this to , Cnditioning task; and (b) to relate 
f a psychotherapeutic type of situation. 
Honin the numerous reports on verbal condi- 
“Yes „8 in which the familiar “mmm-hm” or 
etc. are employed, those which ap- 
Psycho be most readily identifed with a 
Ration herapeutic situation are the investi- 
Storyts Which have utilized conversation, 
(195 “lling, and/or interviews. Verplanck 
© on for example, found it to be possible 
thro, otrol the content of a conversation 
o ent 0 i 
reinforcement. McNair (1957) m- 


Elsewh 


Dear 


1 
th, Thi 
re first report is based on a study submitted by 
put eee as a master’s thesis to the Graduate 
Ra 1962 the Fort Hays Kansas State College, 
Cis z 

vit po ters would like to thank Steve Pratt, 
sty pital *chologist, and others at Larned State 
v. or their aid and cooperation in this 


” were asked individually 


creased the rate of verbalization while sub- 
jects were talking about photographic slides. 
Mock (1957) had schizophrenic subjects tell 
20 10-minute stories and found in general 
an increase in positively reinforced responses 
and a decrease in negatively reinforced re- 
sponses. Krasner (1958), employing a similar 
method, found that the schizophrenic sub- 
jects’ references to a preselected class of 
words increased or decreased in relation to 
the experimenter’s reinforcement. Salzinger 
and Pisoni (1958) showed that it was pos- 
sible to condition self-referred affect state- 
ments in both normal and schizophrenic sub- 
jects during an otherwise standard clinical 
interview. 

Several writers (Dollard & Miller, 1950; 
Pascal, 1959; Wolpe, 1958) have reported 
behavioral changes during psychotherapy 
through the application of such learning 
es as counter conditioning, generali- 
zation, discrimination, extinction, and various 
methods of reward or reinforcement. The 
problem which seems to exist for most 
learning-oriented psychotherapists is what to 
reinforce and how to reinforce it. That most 
all psychotherapists employ some of the 
above principles in therapy, perhaps unwit- 
tingly, is obvious. Dinoff, Rickard, Salzberg, 
and Sipprelle (1960) pointed out that in a 
psychotherapeutic situation, a person’s re- 
marks could be classified into three general 
groups: references to his environment, refer- 
ences to himself, and references to the thera- 


principl 
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224 J. Stecuta, W. Gwynn, AND C. PEOPLES 
TABLE 1 
SUBJECT VARIABLES 
Experimental subjects Control subjects 
Marital Years in Marital xeni 

Subject Age Sex status school Subject Age Sex status si 
1 29 F M 14 1 29 F M m 
2 29 F M 14} 2 29 F M 12 
3 27 F D 12 3 26 F M 12 
4 27 F D 12 4 27 F M 12 
5 27 F M 12 5 27 F M 12 
6 27 F M 12 6 27 F M 12 
7 27 F M 1 7 27 F D 12 
8 19 M g 12 8 19 M S 14 
9 28 M S 143 9 28 M S 15 
10 30 M S 15 10 30 M S 

pist. Various types of therapy emphasize PROCEDURE 


the importance of one or another of these 
areas, but regardless of the focus, any 
psychotherapeutic relationship is dependent 
upon the client’s ability to respond to the 
therapist and the extent to which he can use 
the therapist’s reactions to modify his own 
behavior. 

Dinoff et al. hypothesized that the success 
a therapist has in establishing a relationship 
with a patient might be determined by the 
ease with which the patient progresses from 
references to the environment to himself to 
the therapist. Enlarging on this hypothesis, 
it is possible that the extent to which an 
individual is capable of entering productively 
into therapy might be determined by his 
ability to be conditioned to include the thera- 
pist in his verbalizations. This might be 
characterized by a rise in the frequency of 
references to the therapist. 


In an attempt to duplicate the conditions 
of a psychotherapeutic situation, a group of 
subjects diagnosed “schizophrenic” and a 
group of matched normals were seen indi- 
vidually by the same experimenter and asked 
to tell two stories. An effort was made to 
increase the frequency of occurrence of one 
class of verbal behavior in the second 
story. The null hypothesis of no difference 
between groups with regard to conditioning 
was tested. 


re? 

Subjects. The subjects for this study wet? ee 
male and seven female patients at Larn agnosci 
Hospital, Larned, Kansas, who had been d seve” 
as chronic schizophrenic and three male “il sub- | 
female normal control subjects. The pF the 
jects were matched as closely as possib n edu” 
schizophrenics with regard to age, sex, i subject 
cation. Table 1 gives a short resumé of t or bette 
variables. All had a high school guest en a , 
and the median age of both groups was ‘to the, 

Method. The method used is similar norm? 
which Dinoff et al. (1960) employed wit F: iyen 
subjects. Before any formal instructions we n yer 
to the subject, the experimenter entered ma hav? 
sation with him. Since lengthy instructions mr some 
been confusing to the experimental subjec o pa 
instructions were included in the gate) iom 
conversation to both groups. In this con ske x 
each subject was told that he would op T settint 
tell two stories employing any plot o*ecorder 
desired and that the stories would be tape to th 
The following instructions were then rea 
subject: 


sto! 

I would like to have you tell Wae a 
including you, me, and two other Ps lor 
characters. Make them about 5 minu ‘ou, a 
Any questions? Remember to include Y 
and two other people in the story. the 


: story? ing 
While the subject was telling the ee Duy 
experimenter remained silent and mohoa ced bY 
the second story the subject was reinforce" me 


i f 
experimenter each time he included the er a 
in his story. Reinforcement consisted of cb soile. 
ding, leaning forward toward the supligood, an 
and/or remarks such as “mmm-hm, order” jen 


These were administered in a variable 


sotis 
f alist! 
effort to render the situation more T° 
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aoe psychotherapy session. In case a subject 
ment either story short of 2 minutes, the experi- 
menter said, “Please make the story longer,” or 
a me more,” 2 
<a story was presented in typed form, with 
ivo S numbered and divided by red slashes, to 
or Fation. who were naive regarding the purpose 
were ae e of the study. The judges (two students) 
Criteria ruta aa terms of concise, unambiguous 
Story SA consider each separate phrase in each 
Categories _ determine to which of the following 
Expetime, it pertained: (a) Environment—E; (b) 
or (d) P (for therapist); (c) Subject—S; 
or each mbiguous—A, Scoring sheets were provided 
nated e are and the judges independently desig- 
Were ae phrase as E, T, P, or A. The stories 
ascertain ma So the judges would be unable to 
Presented Which group they were from and were 
lons whi is the judges with the following instruc- 
et al, (1960) follow closely those developed by Dinoff 


The i. 
stories you are to score should include at 


Cast ; 
th four characters, They are: the person telling 
the 


e 
amina” whom we shall designate as (P); t 
te as or person listening, whom we shall desig- 
We shall T); and at least two other people whom 
Other th, designate as (E). In addition, all people 
vitron an (P) and (T) and all things in the 
including nt and situations other than those 
ou 8 (P) or (T) shall be designated as (E). 
m e to score these stories, statement by state- 
about p determine which numbered statements are 
i Js (T), or (E). The following four scoring 
oe are to be used as the score which you 
ovided d on the scoring sheet which will be 


1, 
s Si When a statement includes any reference 
N Xaminer (or listener) including him alone 


> 0; e 
E Y, him in connection with the story teller 
° the Or (T, E)—score “T”, ie, Ony reference 

2, (plistener. 


ence 


When a statement includes any refer! : 
an 


e 
Phrase (P) alone or to (P, E)—score “p” ies 
3. (gy luding the listener. 
$ when a statement does not include (P) 
the story. 
o just who 
scored “A”. 
y reference 


it te eae “E”, ie, others in 

eludes ;2 „Statement is unclear as ti 

to emempe is ambiguous and is to be 

e Ji r to score (T) if there is an 

c en and this includes any ee m 
ìs which a personal pronoun referring 

Phrase, nen ds tnitieslond on oe context of the 


OF th Resutts AND DISCUSSION 
3 sup; total of 2,039 responses given by 
agg 4° two judges agreed on 2,033, Or 
h aiqu Indicating a highly reliable scoring 
tio d ina, For the remaining 36 responses @ 
‘Ong | C8 was asked to perform classifica- 
the judgment of the two judges 
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who agreed on these 36 responses was used 
in computation. 

The data were analyzed in terms of fre- 
quency of occurrence of the criterion re- 
sponse in each story, with the criterion re- 
sponse being reference to the experimenter. 
Because the stories varied somewhat in 
length, the percentage of references to the 
experimenter in relation to total number of 
phrases was used in analysis. The binomial 
expansion and the Wilcoxon T test were 
employed. 

Table 2, Columns A and B, shows the num- 
ber of criterion responses given by each sub- 
ject at the operant level. It is seen that all 
normal subjects made at least one reference 
to the experimenter while three schizophrenic 
subjects made no such reference. The greater 
variability of the schizophrenic group is ap- 
parent from an inspection of the ranges 
which were 1-15 criterion responses for nor- 
mals and 0-24 for schizophrenics. It might 
be argued that there is justification for 
disregarding the data of Experimental Sub- 
jects 3, 5, and 8 because they made no refer- 
ence to the experimenter. If this is done 
and the corresponding control subjects are 
discarded, there is even less difference be- 
tween the two groups. 

Table 2 compares the frequency and per- 
centage of criterion responses emitted by 
normals and schizophrenics under conditions 
of nonreinforcement and reinforcement. It 
is apparent that schizophrenic subjects, al- 
though increasing the percentage of their 
criterion responses slightly, did not change 
significantly from nonreinforcement to rein- 
forcement, while normals changed signifi- 
cantly in both direction and magnitude. In 
other words, normals not only emitted a 
significantly greater 


percentage of criterion 
responses under reinforcement than did 
schizophrenics, but also in comparison with 
the hospitalize 


d group, normals conditioned 
much more thoroughly. 


terpretations of the observed 
bal conditioning could be 
i i ested 

anced. An appropriate one is sugg 
2 Hartman (1955) who mentioned the 
immunity tO social stimulation which is fre- 
quently seen in schizophrenics. This is con- 


A number of in 
difference in ver 


226 J. SLecuta, W. Gwynn, AND C. PEOPLES 
TABLE 2 
FREQUENCY AND PERCENTAGE oF T RESPONSES 
Nonreinforced Reinforced 

= : z izophrenics 

A. Normals B. Schizophrenics C. Normals pas 

Pair 7 % f % F; % £ : 
33 

1 1 2.9 24 82.8 22 44.9 2 38.3 

2 2 4.2 10 29.4 47 47.0 23 46.1 
3 7 11.7 0 0.0 25 39.7 14 11.6 
4 15 39.5 2 4.4 21 35.6 5 0.0 
5 1 3.6 0 0.0 45 75.0 0 11.7 
6 12 20.0 4 10.5 17 28.9 7 3.3 

7 8 20.0 17 73.9 27 41.5 3 0.0 

8 2 2.4 0 0.0 24 54.5 0 31.4 
9 7 20.6 4 9.3 17 60.7 16 12.5 
10 4 14.8 1 5.3 20 54.0 2 13.5 

M 5.9 13.0 6.2 10.9 26.5 46.2 7.2 


Note.—Comparison of Colu 
binomial P = .01; 
D yields binomial 


mns A and B yields binomial P 
Wilcoxon T = 1.0, p < 0l. B and D yields bi 
P = 01; Wilcoxon T = 1.0, p < 01. 


sistent with the traditional view of schizo- 
phrenics who are said to withdraw from ex- 
ternal realities. Immunity to external stimuli 
implies that overt behavior of schizophrenics 
is more determined by internal stimuli than is 
overt behavior of normals. This suggests 
that the observed difference in condition- 
ability might be explained in terms of a 
difference in susceptibility of the two groups 
to positive external stimuli. If positive stim- 
ulation from the outside is not important to 
schizophrenics, it could hardly be expected 
to alter their behavior to a significant degree. 
It has been reported by various writers 
(Bleke, 1955; Buss & Buss, 1956; Pascal & 
Swensen, 1952) that schizophrenics may be 
more responsive to negative reinforcement 
than to positive reinforcement. 

There appear to be direct implications 
from this study to psychotherapy with 
schizophrenics. Since most traditional psy- 
chotherapy depends heavily on verbal ex- 
change and changes in verbal behavior are 
generally felt to be the sine qua non of gross 
behavioral change, it is readily apparent that 
inability to condition verbally would greatly 
na individual’s chances of responding 
lessan ‘ary 20 Of course it cannot be 
to: psychotherapy. TT en occurred in the 
claimed that no conditioning : 

: since 7 of the 10 sub 
schizophrenic group, 


igl: Wilcoxon T = 26.0 (not significant). jfcant). 
inomial P = .17; Wilcoxon T = 20.5 (not signi 


ge 
jects did show an increase in percents w 
criterion responses under reinforcemen a 
ever, the increase was arbitrarily 1a i 
not of significant magnitude to demo 
an acceptable level of conditioning. 
schizophrenic subjects who showed be o 
change similar to that of the norne r 
would probably make better candi = d 
psychotherapy since it has been dernie j 
that their verbal behavior is modifiable. 
conceivable that a standard verbal co” 
ing technique might be developed for 
a screening device in selecting schizoP 
subjects for psychotherapy. 


itio™” 


preni? 
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Factor scales were constructed to measure the 2 principle factors isolated in a 
factor analysis of 18 scales of the CPI. Correlations of these scales with a 
variety of test and behavioral variables suggests that they are similar to 
factors of neuroticism and extraversion found in previous studies. The scales 
were called Person Orientation and Value Orientation. The implications of 
the findings for test theory were discussed. 


The California Psychological Inventory 
(CPI) consists of 18 scales, most of which 
have been derived by item analysis against 
such socially relevant criteria as social status, 
sex, intelligence, ratings of dominance, etc. 
Many of the scales are correlated substan- 
tially with each other, and have been grouped 
into four general classes by Gough (1957) 
partially on the basis of their intercorrela- 
tions. A factor analysis of the 18 standard 
CPI scales (Mitchell & Pierce-Jones, 1960) 
found four factors which accounted for 
appreciable portions of the variance of each 
of the scales, and some scales had little 
reliable variance other than that accounted 
for by the factors. 

The finding of a few factors to account for 
a substantial portion of the reliable variance 
among the CPI scales is of particular interest, 
since most of these scales were constructed 
by criterion keying. Since each item, by virtue 
of being included in a scale, has been shown 
to be related to some important non-test- 
behavior, it is likely that the factors will 
also be related to important criterion be- 
haviors. This is not guaranteed, however, for 
there are three possible sources of correlation 
between two criterion keyed scales and thus 
of the factors derived from these correlations: 
(a) Common variance between scales may 
represent common variance in the traits the 
scales purport to measure. One would cer- 
tainly expect the “folk concepts” which the 
CPI scales attempt to measure to be cor- 
related in most populations, and this correla- 
tion among psychological traits may be re- 
sponsible for the intercorrelation among the 
scales. To the extent that this is the case, the 


CPI factors would be identical with the 
trait factors. (b) Common variance betwee? 
scales may be due to some aspect of a 
response that is related to the traits wA 
both scales purport to measure, but which e 
not dependent on any correlation among k. 
measured traits. It is possible that such T 
sponse characteristics as tendency tO 
scribe oneself in favorable terms or tendency 
to report interpersonal conflicts with paste 
may be related to a variety of sarge 
independent behaviors, To the extent t o 
the correlations among CPI scales are due 
response tendencies which are common <n 
variety of independent behaviors, the fac nse 
would represent the aspects of item respo e3 
most likely to be related to any new be- 
havioral criterion. (c) Common variance re- 
tween scales may be due to common hi 
sponse tendencies which are not related aul 
traits which the scales purport to mermi 
It is possible that such response charac 
istics as tendency to answer true maya y 
unrelated to the behavioral criteria, yet 
contribute to error in a number of peale ar 
Correlations among CPI scales are three 
likely due to some combination of all a 
of these common response tendencies. 
ever, to the extent that any one predom truc” 
factor analysis and rotation to simple 5 its 
ture should identify that tendency ™ gf 
various dimensions as a factor or factors w 
it should be the case that there are ^ ted 
aspects of item response which are 1° (the 
to a great variety of criterion behaviors ‘spe 
second possibility mentioned above g new 
factors may be quite useful in predicti? pe 
criteria. Whether or not this is the C% 


inate 
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ee of the factors in the CPI and their 

correlates should contribute to the theory of 

ventory tests. 

fe Dhiposes of this study were to factor 

a T, to construct scales to measure the 

a ors, and to obtain preliminary data con- 
rning the correlates of the factor scales. 


PROCEDURE 


ae correlation matrices of the 18 CPI scales 
ite, ed in the CPI Manual (Gough, 1957) were 
mon Separate factor analyses were done for 
matri (N = 4,098) and Females (N = 3,572). These 
of eo pre reported by Gough to be composed 
Rg mean correlations from five separate 
and jons of high school and college students 
matic m inmates, and in the case of the male 
1X, also military officers. 
Separat mal and female matrices were factored 
‘with con by the principal components method 
a ates estimated as the squared multiple 
eb tion of each scale with all other scales. On 
highe asis of the size of the latent roots and the 
Tetai tor, it was decided to 


a P 

“cording to the normalized varimax criterion. 

Mai ae factors obtained from the male and female 
es were very similar. The 
igh loadings of several scale 


only therefore, decided to 9° 

r sine the first two factors, since 
Citetion k of the common vani 
Which eyed scales. In order to 0 
Scales AE relatively independent, ane 
Positive ere selected to represent each factor so tha 
tende and negative loadings ON other factors 
eDres, to cancel each other. The scales chosen to 
i ole sack factor and their loadings are shown 


Bradt for CPIs from a sample © 
for he students by summing the standard scores 
300 supi presentative scales for each f These 
Broup, ECs of both sexes had taken 
ctor a part of several previous 

Scales were obtained for both 


ntainin groups on the basis of 
Cases, phe 2%, 20%, 54%, 20%, 
rue po ber of subjects in each g 

S obt sor each item was tallied, and a D score 
Weightp aned for each of the 480 CPI items by 
anq "8 the group total, respectively, +2, 41,0,—1, 
anq aah his item analysis was done for each factor, 
by ential scales for the two factors Were obtained 


Scting all items which were related to @ 
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given factor at the .01 level and which were not 
related to the other factor at the .05 level. This 
was done in order to keep the correlation between 
factor scales low. This procedure resulted in a 121- 
item scale for Factor I and a 55-item scale for 
Factor II. The procedure described by Webster 
(1956) for discarding items to maximize the 
correlation of the scales with the criterion was 
applied which resulted in the discarding of eight 
items from Factor Scale I and four items from 
Factor Scale II. 

Reliability and factor estimation coefficients of 
the final scales were determined on several cross- 
validation groups which were also used to study 
the relationship of the new scales to other variables. 
These groups and other data available are as 


follows: 

1. A sample of 37 male and 18 female under- 
graduate students, who received five or more 
psychotherapy interviews at the Purdue Psycholog- 
ical Clinic, took the CPI at the termination of 
therapy. At the same time, their therapist rated 
them using an Adjective Check List (Gough, 1960). 

2, A sample of 64 high school counselors were 
tested with the CPI at a counseling training in- 
stitute. Also available on these subjects were the 
Strong VIB, the MMPI, the EPPS, the Guilford 
Zimmerman Temperament Survey, the Minnesota 
Teacher Attitude Inventory, the Ohio State Psy- 
chological Examination, peer ratings on five traits, 
and @Q sorts from which  self-ideal correlations 
were calculated. 

3. A sample of 250 male college freshmen who 
were tested with the CPI as part of a research 
project at their dormitory. Also available on these 
subjects were the Rotter Incomplete Sentences 
Blank and dormitory counselors’ ratings of ad- 
justment. 

4, A sample of 88 female and 58 male psychiatric 
patients in an acute intensive treatment hospital 
were tested with the MMPI which was scored for 
a 46-item scale for Factor I and a 15-item scale 
for Factor JI, Also available on these subjects 
were independent interviewer ratings of the patient’s 
using the Multidimensional Scale for 
Rating Psychiatric Patients (MSRPP) (Lorr, 1953). 

Relationships between the two CPI factor scales 
and the other variables listed above were in- 
vestigated using either point biserial or product- 


t correlations. 


symptoms 


momen 
RESULTS 


The rotated factor loadings for males and 
females are shown in Table 1. As can be seen 
able, the factor loadings based on 


from this t or 
male subjects are very similar to those based 


on female subjects. 


1The writers are indebted to James Linden for 


oviding these data. 
Prg The writers are indebted to John McBrearty for 


providing these data. 
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TABLE 1 
ROTATED Factor Matrix 
2 
Factor I Factor II Factor III l 
1 F 
CPI scales M F M F M F N 
Class I 64 
Dominance 28 23 63 74 s = i j = 66 
Capacity for status = Er 4 95 E ele 68 a 
ae ‘00 10 71 73 27 36 a 0 
Self acceptance —.08 —.06 74 -17 — ie E Be ‘67 
Sense of well-being .70 78 -23 25 7 2 
Class II 50 
48: 
ibilit .68 .67 10 18 —10 -.14 . MM 
oan 58.55 —.04 02 —=25 —.33 E “gd 
Self-control -83 87 —.26 —.24 05 —.15 66 70 
Tolerance At .74 20 .32 .34 322. so 58 

Good impression 17 15 —.05 05 06 —.13 A 21 

Communality .20 .22 16 17 —.38 —.36 
Class III o & 

Achievement via conformance 79 75 19 .23 —.09 —.25 w2 -38 

Achievement via independence -48 .55 09 19 .62 49 ga 

Intellectual efficiency .58 66 51 48 .20 20 
Class IV m F 

Psychological mindedness .45 .49 14 BF 35 .28 0 45 

Flexibility —.01 01 .02 .05 aA 67 ‘og 

Femininity 14 .09 —.24 —18 —.02 —.27 $ 

nia 
; 5 ‘ origi” 

Table 2 shows the loadings on all three as an index of the extent to which oe acto! 
factors of the scales selected as measures of factor score is approximated by t 
the two factors for which scales were derived. scale. waly in- 
As can be seen, the scales selected to rep- The two factor scales were elan $ f 
resent a given factor tend to have loadings on dependent, correlating .27 in the sa” P 
the other factors which cancel each other. 250 male freshman students. 

Table 3 shows the results of applying — 
Webster’s procedure for maximizing the cor- MEANING oF THE SC which 
relation of a test with a criterion. It is There are three lines of evidence holo% 
obvious from this table that this procedure can be pursued for clarification of psy® facto! 
brings little change in cross-validation reli- ical constructs measured by these two inst 
ability, and factorial validity statistics. In scales, namely, the scales or criteria a 
this instance, it did not prove to be worth the which the items were selected, the eos scale 
considerable labor involved in carrying out the items, and the correlations of t fiables: 
the analysis. score with other test and behavioral Te ussed 

The data presented in Table 3 show ac- Each of the two factor scales will be d! 
ceptable reliability and factor estimation in terms of these lines of evidence. 
indices for both scales. It should be pointed , , | 
out, however, that the factor estimation Tector J, Value Orientation tor 

? 


indices are correlations between the scales 
and the factor score criteria which have a 
number of common items and are useful only 


so fac 
The scales which load highly on ne ure 
are reported by Gough (1957) to rity) ad 
responsibility, self-control, and matu 
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TABLE 2 


Factor LOADINGS OF SALIENT SCALES 


USED To OBTAIN FACTOR SCORES 


Factor loadings 


Factor I Factor II Factor III 
CPI scales M F M F M F 
Factor I 
Sense of well-bei 
vell-being .70 .78 -23 25 .07 - 
Responsibility ‘CT to 48 10 x 
Socialization ‘58 55 —.04 02 =25 83 
aT eonia 83 87 —.26 —.24 05 =45 
È erance 71 74 -20 32 34 .22 
= impression out 75 —.05 .05 .06 —.13 
chievement via conformance 79 75 -19 23 —.09 —.25 
Factor Ir 
Dominance 28 23 63 74 —18  —.11 
Sociability 2 31 78 79 —14 —.06 
re presence 00 10 Bye} 73 N 36 
elf-acceptance —os  —.06 74 77 —.14 04 
TABLE 3 


mize complaints 


well 
as a tendency to mini 
tions of others. 


n 

The w concern over the reac t 

the t in the scale can be grouped into 

abilit owing content categories: Emotional 
y and control, denial of impulsivity, 


Eve 
osae Perament and absence of anger an 
ity, absence of anxiety and tension, 
g, dependability 


a health and well-bein 
aii se of duty, happy home life, respect 
ustom, ers, and respect for rules and social 
relates S. This content and the scale cor- 
suggest a general psychological ma- 

3 

rent’ items and scoring direction for the Value 
2g tation scale are as follows: 20 F, 22 p 26 F, 
49 F, 55 F, 


3 

p e F, 32 F, 42 F, 44 F, 46 T, 48 F, 
Ws P, i F, 77 F, 80 F, 81 F, 93 F, 101 F, 103 T, 
120 p 106 F, 109 F, 114 F, 115 P i17 F, 119 F 
Mo T ae T, 132 F, 135 T, 138 T, 141 F, 142 F, 
168 T, 153 F, 155 F, 157 F, 161 F, 164 F, 165 T, 
a F, ts F, 174 T, 175 F, 178 F, 181 T, 183 F, 
SEn ee tos T, 203 F, 206 F, 
a T, ae F, 237 F, 247 F, 248 F, 250 F, 253 F, 
ae T, Bo F, 266 F, 268 F, 270 F, 275 F, 276 T, 
348 F, 299 F, 282 F, 289 F, 291 F, 294 F, 297 F, 
ae F, 351 F, 300 F, 302 F, 307 F, 313 T 336 F, 
aon Fs 380 F, 353 F. 356 T, 362 T, 367 T, 371 T, 
R T4 T, 392 T, 396 F, 398 F, 399 F 406 F, 
ae 43 F, 413 T, 419 F, 420 F, 428 F 431 F, 

Q p’ 439 T, 451 T, 453 F, 454 F, 463 F, 468 F, 


RELIABILITY AND VALIDITY ĪNDICES 
FOR THE Two FACTOR SCALES 


Factor Factor 
Statistic I Ir 
Original scale (before discarding 
items) 
Number of items 121 55 
Reliability (KR-21), original 
derivation sample 87 82. 
Reliability (KR-21), cross- 
validation sample* .89 81 
Factor estimation coefficient, cross- 
96 88 


validation sample® 


Final scale 
Number of items discarded by 
Webster’s procedure 8 4 


Number of items 113 ot 
Reliability (KR-21), cross- 
validation sample* 88 81 
Factor estimation coefficient, cross- 
96 89 


validation sample* 


a The correlation between the factor, score and the factor 
is here called the “factor estimation coefficient." These 
were calculated on the cross-validation sample of 250 male 


college freshmen. 
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turity reflected in a concern for values and 
conformity to conventional standards for be- 
havior. It is suggested that this scale be 
called Value Orientation or V.* 

The Value Orientation scale correlated 
positively with a variety of other scales and 
behavior ratings indicating psychological 
stability, control, and good interpersonal rela- 
tions. It correlated negatively with indices of 
maladjustment and emotionality. Among 
mental patients, it seemed to be related to 
an expansiveness—depressive hostility con- 
tinuum. 

With the three exceptions which are noted, 
all of the following variables were signifi- 
cantly related (p< .01) to V. With the 
MMPI, it correlated positively with K (.53) 
and L (.49) and negatively with Pd (—.34) 
and Sc (—.38). With the Guilford Zimmer- 
man, it correlated positively with Emotional 
stability (.44), Objectivity (.70), Friendli- 
ness (.58), and Personal Relations (.59). 
With the Edwards PPS, it correlated posi- 
tively with Deference (.32) and Endurance 
(.40) and negatively with Aggression (—.35). 
It correlated negatively with a maladjustment 
score on the Rotter ISB (—.31), 

Among mental patients, the shorter version 
of the scale contained in the MMPI cor- 
related positively with psychologist ratings 
of Expansiveness (.36) and Conceit (.36) and 
negatively with ratings of Melancholy agita- 
tion (—.38), Depressive tension (—.29), 

Feeling of sinfulness (—.36), Mood swings 
(—.32), Suicidal preoccupation (—.38), Self- 
Preoccupation (—.31), Lack of faith in self 

—.31), Hostile impulses (—.41), and Per- 
ception of the world as hostile (—.42). 

Among counselor trainees it correlated at 
the .05 level with peer ratings of Professional 
attitudes (.31), Improvement of professional 


4 The scale titles “Value Orientation” and “Person 
Orientation” were suggested by Harrison Gough. 

5A shorter version of V can be scored from the 
46 scale items which appear in the MMPI. The 
MMPI booklet numbers and scoring direction for 
these items are as follows: 3 T, 28 F, 29 F, 36 T, 
45 F, 63 T, 68 T, 80 F, 93 F, 9 T, 97 F, 104 F 
117 F, 118 F, 124 F, 139 F, 145 F, 146 F, 157 F, 
175 T, 181 F, 198 T, 225 F, 226 F, 251 F, 277 F 
290 F, 306 T, 323 F, 333 F, 348 F, 381 F, 386 F. 
395 F, 406 F, 438 F, 459 F, 464 T, 469 F, 472 F, 
481 F, 485 F, 516 F, 525 F, 545 F, 558 F. 
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competence (.31), and Extent of peer inter- 
action (.30). High scorers among the clinic 
clients tended to be rated by their therapi 
as Conservative, Curious, Dependable, Ideal- 
istic, Industrious, Responsible, and Serious 
while low scorers were described as Irrespo™ 
sible, Conceited, and Forgetful. 


Factor II, Person Orientation 


The scales which load on the second ll 
are reported by Gough (1957) to mersi 
sociability and favorable interaction Tm 
others. The items *® can be grouped accor a 
to content into the following categories: CO n 
fortableness with others, joy in interperson ; 
interaction, dominance and leadership, 9” 
sence of fear and embarrassment, and u 
ness of response. This content and the sc ng 
correlates suggest that this scale is measur! > 
the familiar Extraversion-Introversion 
mension as reflected primarily in interpet 
interaction. It is suggested that this scale 
called Person Orientation or P. Jes 

The P scale correlated with other s 
and behavior ratings which suggested we f 
and outgoingness. The following are P nich 
the correlations found with this scale W 
were significant at the .01 level. vith 

In the MMPI, it correlated negatively w, 
D (—.36) and Si (—.66), With the nage 
Zimmerman, it correlated positively 61)3 
General activity (.35), Ascendance (3 
and Sociability (.59) and negatively PPS, 
Restraint (—.44), With the Edwards 37) 
it correlated positively with Exhibition G j 
and Dominance (.39) and negatively 3 p 
Abasement (—.49). With the Strong, it ©, 
telated positively with Personnel oe 
(.33) and Public Administrator (.26) and 
negatively with Mathematician (—-35) 
Physicist (—.33), 

High scores among the clinic clients t 
to be described by their therapists as 


sonal 


ended 
ctiv® 


perso? 

€ The items and scoring direction for the 25 F, 
Orientation scale are as follows: 7 F, 13F, 79 F 
38 F, 45 T, 52 T, 53 T, 64 F, 68 F, 74 Fes Ti 
85 F, 97 T, 108 T, 131 T, 146 T, 154 T, 216 2 
179 T, 182 F, 186 F, 188 F, 200 T, 202 T, 272 È 
218 T, 223 F, 227 F, 239 T, 242 T, 256 T, 346 D 
287 T, 292 T, 314 F, 319 T, 320 T, 325 F ig F 


359 T, 383 F, 385 F, 391 T, 403 T, 
429 F, 448 T, 452 F, 460 F, 475 T. 


St 
: pa that “Studies with the 
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mergeti, Enthusiastic, and Outgoing, and 
TA scorers were described as Inhibited, In- 
erests narrow, Pessimistic, Dissatisfied, Shy, 
and Withdrawn. 
P niong mental patients high scores seem 
r dicate excitement while low scores in- 
me feelings of inadequacy. The shorter 
ee of the scale scored on the MMPI” was 
her positively with psychologists’ 
lo ings of Expansiveness (.37), Drive toward 
ng-term goals (.41), Talkativeness (27); 
ood swings (.30), Conceit (38), Elation 
aos Emotional responsiveness (44); and 
ae for the future (.38). It was nega- 
sind T correlated with ratings of Feelings of 
ne equacy (—.38), Gastro-intestinal dis- 
i s (—.31), Weakness of interests (—.40), 
ck of adaptiveness to the situation (—.38), 
om over symptoms (—.30), Lack of 
( in self (—.41), and Self-consciousness 
~.36), 
Discussion 
ee discussing the significance of these 
me Ings for test theory, We should first 
aa aoe two technical problems of the factor 
a alysis and then discuss the similarity of 
ur findings with those of other factor studies. 
ite he first technical problem concerns the 
span overlap among scales which will produce 
: riously high correlations. The item over- 
ie among CPI scales is not great, but it 15 
is ge enough to be a matter of concern. ‘There 
i evidence that item overlap in 1m- 
tory tests does not markedly affect the 


ae structure, Welsh (1956, pp- 246-281) 
With done factor analyses of MMPI scales 
simi item overlap eliminated and found very 
ilar patterns to those obtained wi the 
scales, Schutz and Baker (1962, P- 98) 
MMPI have 


tt a close correspondence in overall ae 
With T™ of analyses based on scores score 
and ae without item overlap.” Also, Nichols 
Chan eck (1960), in a factor analysis of CEI 
Ue Scores where item overlap may not be 
'Shificant, found factors identical to the 


kd 
15 À shorter version of P can be scored from the 
Mupr® items which appear in the M . The 
these į booklet numbers and scoring direction for 
3y items are as follows: 2T, 57 T, 343 Fs 350 T, 
455 p’ 391 T, 392 F, 409 T, 415 T, 427 F, 441 T, 


» 521 T, 529 T, 547 T. 


first two factors of this study. It thus ap- 
pears likely that the factor pattern is not 
seriously distorted by item overlap, since item 
overlap is most likely to occur in scales 
which would be correlated without the in- 
fluence of common items. Nevertheless, it is 
likely that the communalities are spuriously 
high because of the common items. 

The second technical problem concerns the 
method of estimating communalities and the 
number of factors retained. In this study com- 
munalities were estimated as the squared 
multiple correlation of each scale with all 
others. Principal axis analysis was used and 
only three factors were retained for rotation. 
This was a conservative procedure which 
produced a minimum number of factors, each 
accounting for a maximum proportion of 
variance. Mitchell and Pierce-Jones (1960) 
lysis and retained four 


used centroid ana 
factors which served to split our third factor 
ublished studies 


into two. Gough® in unp 

factored the two matrices in the manual with 
unities for communalities. This tends to 
result in larger factor loadings, which lead 
him to retain six factors for rotation, further 
splitting the factors found in this study. All of 
these procedures can yield useful information, 
and the decision among the various alterna- 
tives is often arbitrary. For the purpose of 
constructing factor scales, however, it is de- 
sirable for practical reasons to represent only 
the major clusters by factor scales. The sub- 
clusters can perhaps be better represented by 
the individual scales themselves. 

The factors of Value Orientation and Per- 
son Orientation found in the present analysis 
are identical to the first two factors in the 
CPI found by Mitchell and Pierce-Jones 
(1960) called Adjustment by Social Con- 
formity and Social Poise or Extraversion, 
respectively. Our two factors also correspond 
closely to the first two classes of CPI scales 
formed by Gough on the basis of early 
factor and cluster analyses of the CPI scales. 


They are also very similar to the first two 


factors found in an analysis of MMPI scales 


by Kassebaum, Couch, and Slater (1959) 
called Ego Weakness versus Ego Strength 


sH. G. Gough, personal communication, May 8, 


1962. 
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and Jntroversion-Extraversion, respectively. 
Eysenck (1953) repeatedly found in his re- 
view of factorial studies of personality that 
the bulk of the common factor variance in 
these studies was accounted for by a general 
factor of neuroticism and a bipolar factor 
of introversion-extraversion. Thus, the two 
factors found in this study seem to be the 
same ones indentified in many other analyses. 
We have preferred the titles Value Orienta- 
tion and Person Orientation, however, to 
avoid the confusion which would arise from 
such commonly used terms as neuroticism, 
ego-strength, and introversion-extraversion. 

We can now return to the problem pre- 
sented in the introduction of the interpreta- 
tion of the factors in terms of three possible 
sources of covariation among scales: ( a) com- 
mon factors among the criteria, (b) common 
criterion relevant variance among the items 
not subsumable under (a), and (c) common 
error variance among the items. The former 
possibility, common factors among the 
criteria, is suggested by the similarity of V 
to the factor of neuroticism or ego weakness 
and P to the factor of extraversion-intro- 
version found in factor analyses of diverse 
behavioral measures, However, as Becker 
(1960) has pointed out, it is dangerous to 
conclude that inventory factors are identical 
to behavioral factors solely on the basis of 
apparent similarity of content. Factor anal- 
yses of scales P and V in a matrix of þe- 
havioral measures would be necessary to settle 
this question. Until such evidence becomes 
available, the latter two possibilities men- 
tioned above must be considered, 

It has become common practice for many 
writers to interpret factors in inventory 
scales in terms of some response set or re- 
sponse style which is a characteristic of 
response to a particular item form, rather 
than representative of factors also found in 
non-test-behavior. Jackson and Messick 
(1958) for example, stated that “In light 
of accumulating evidence, it seems likely 
that the major common factors in personality 
inventories of the true-false or agree- 
disagree type, such as the MMPI and the 
California Psychological Inventory, are inter- 
pretable primarily in terms of style rather 
than specific item content” (italics are theirs) 


Rogpert C. NicHots AND RICHARD R. SCHNELL 


(p. 247). As evidence that acquiescence 
response style is an important factor, they 
note that Welsh’s MMPI Factor Scale A i 
composed of items predominantly ra 
“true” and Factor Scale R of items all keye 
“false.” Our Factor Scales P and V do not 
offer such clear evidence of en 
response style. Seventy-eight percent of t t 
items of V are keyed false, but P has dies 
equal numbers of true and false a) nage 
In spite of the predominance of false ys 
sponses in V, it is presumptuous to Caen 
that it is correspondingly weighted oa 
acquiescence. Responding direction is rela 3 
to content in the CPI, and one of the wa 
one shows positively valued tendencies a 
this test is by responding “false” to ait 
dealing with impulsivity, hostility, & 
tension. felt 
Another response style which has been a 
to be an important determinant of a 
factors is social desirability. For ene 
Edwards and Heathers (1962) have poin les 
out that the factor loadings of MMPI a 
on the first factor found by Kassebau™ 
Couch, and Slater (1959), called sale 
strength, correspond closely to the erent 
tions of these same scales with the So me 
Desirability scale. We suspect that the aoe 
is true of our first factor, The shorter vers 
of V scored from the MMPI correlated tal 
with the SD scale in the sample of menta 
patients. As was the case with acquisto 
the interpretation of this finding is sg 
because the item content is related A be 
social desirability of the item. It woul v 
interesting to see correlations of P a” 4 
with some of the pure measures of “peen 
escence and social desirability that have t 
developed where the response style 15 
confounded with item content. ales: 
Since it is possible that our factor 5° tyle 
particularly V, have a large response Siger 
component, it is important to conni 
whether the response style is predomin? inet 
error which should be discarded or yoe 
it is an important aspect of personality pja. 
possibly highly predictive of external ct the 
The former possibility appears to ae 
implication of Edwards and Heathers (1 r the 
while Jackson and Messick (1958) evi- 
latter. McGee (1962) has reviewed t 


ui- 
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aace relating response styles to personality 
clusiy, €s and has found it largely incon- 
shown The fact that P and V have been 
ehavi to be related to a number of non-test 
me ors Suggests that they are not com- 
style Predominantly of irrelevant response 

S which should be discarded as errors 
the neasurement. However, on the basis of 
tiie en evidence it is not possible to 
ties, Th etween the two remaining possibili- 
simi] e factor scales could be either (a) 

ar to factors in the criteria and depend- 

A on the item content or (b) factors 
re diffe: response to the item form which 
Corre] mit from the criterion factors, but 

ated with them. 
two m actor scales P and V represent the 
Not, site clusters of CPI scales. They do 
in ever, account for all of the variance 
a i standard scales nor do they represent 
titio ae scale interactions. Thus, the prac- 
Canin who has found the CPI profile 
Scales ful should not consider the factor 
He > 3S a substitute for the standard set. 
be ae however, find the factor scales to 
anq Pful additions to the standard profile, 
ulati °y may prove to be valuable in tlie 5 
“Wations empirically derived prediction 
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SIBLING PATTERNS AND SOCIAL ADJUSTMENT AMONG 


NORMAL AND PSYCHIATRICALLY 
DISTURBED SOLDIERS 


WILLIAM VOGEL? anp CARL G. LAUTERBACH °? 
Walter Reed General Hospital 


Birth order, and number, age, and sex of siblings were related to social 
adjustment in a group of 79 psychiatric patient soldiers and a group of 117 
normal soldiers, with age of Ss and socioeconomic status held constant. It 
was found that: (a) Psychiatric patients were more often “only” children 
than were normal Ss. Character and behavior disorders tended to be “only’ 
children more often than schizophrenics or neurotics. (b) In families of 5 
and more siblings, normals more often than patients came from the first and 
last birth positions. (c) Patients who were judged to have a good prognosis, 
and married patient and normal Ss, more often than patients with a poor 
prognosis, and unmarried patient and normal Ss, tended to come from large 
families. (d) Married patients, and patients who were returned to duty, more 
often had younger brothers than patients who remained single, or patients who 
had to be discharged from the service. 


There has been considerable interest over 
the years in the problem of how experience 
with siblings in childhood affects the psycho- 
logical and social adjustment which one 
establishes in adulthood. The purpose of this 
study was (a) to investigate relationships 
between certain factors influencing the 
nature of experience with siblings (birth 
order, family size, and age and sex of 
siblings) and incidence and prognosis of 
psychopathological conditions, and (b) to in- 
vestigate relationships between these same 
sibling patterns and marital status. 

Spiegel and Bell (1959), in an exhaustive 
review of the literature on relationships 
between family size, birth order, and psychi- 
atric diagnosis, report no definitive evidence 
on how birth order or family size is related 
to neurosis, psychosis, or character and þe- 
havior disorder. They attribute the contra- 
dictory nature of the research literature 
mainly to (a) the absence of control groups 
or else the use of improperly constituted 
control groups, (b) the employment of widely 
differing subject populations in the various 
studies without consideration of how the 
special characteristics of the particular popu- 
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lation used might affect the results k 
study, and (c) the use of different 2 
of psychopathology in the various erat 
Schachter (1959), reviewing the PY 
independently of Spiegel and Bell, °° tio? 
that “the many studies on ordina 

and various indicators of behav 
present a pattern of negative resul iter 
weak and inconsistent trends.” The jews we 
reported in this area since these ne pat’ 
been relatively sparse.. However, th 
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siblings tend to be born in 
the birth order. It is unclear wha a 
this finding might have, since it har 
been replicated on other diagnos gs 
nor has it been related to other ig 
relevant factors such as socioeco?? p 
within the schizophrenic populatio®”: oni 
Despite the general lack of secta 

empirical evidence in support of €P ; 
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0 z 
Ho TN is taken of variables which are 
in which affect directly the basic conditions 
reason, j the subjects were raised. For this 
aie the present study sibling patterns 
groups ee in normal and pathological 
e festion ich had been equated for sex, age; 
class), me (as an index of socioeconomic 
of both a number of siblings, and presence 
Until gen parents in the home at least 
This se subject’s sixteenth year of life. 

igate the ond purpose of this paper, to inves- 

abit relationship between marital status 
Pathological patterns in normal and psycho- 
Variables al populations (controlling for the 
area ee ee leads into an 
Ported. Te, few relationships have been re- 
Ships in a (1959) has reported relation- 
Opulatio oth normal and psychopathological 
Sibling ns between marital discord and 
h ae in childhood, although Toman 
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BIOGRAPHICAL DATA OF NORMAL AND 
PSYCHIATRIC GROUP 


Type of information 


Normal Psychiatric 


subjects 


subjects 


(N = 117) (N = 79) 


Mean age 
Mean number of siblings 


Education: 
Mean school grade completed 
% Grammar school or less 
% Some high school 
% Some college 
% College graduate 


Occupation : 
% Unskilled 
% Semiskilled 
% Clerical, white-collar, or 
skilled 
% Professional 
% Career military enlisted 
personnel 


% Married subjects 


26.6 
4.85 


37 
53 


28.7 
4.41 


11.6 
9 
77 
10 
4 


52 


Note,—No statistically 
categories exis 
was CO! 


personnel, o 
careerists. The two groups dii 


on any of these variables, 
mean number of siblin 
tion, in addition to 
siblings with whom t! 


significant difference: 
ts between the two groups. No oci 
ected from the normal subjects. 


s in any of the 
cupational data 


with about 40% of each group being 
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nor did they differ on 
gs. The nature of this informa- 
the large average number of 
he members of each group were 
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the lower ha 


If of the population socioeco 
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birth order be reported so that first born, 
as opposed to second born, or third born, 
etc. be grouped regardless of the size of the 
family from which the person came: for 
example, the second of 2 children and the 
second of 10 children may both be put in 
the same category of second-born child. It 
is also possible to take account of family 
size; for example, to compare first-born sons 
from families of three, only to other first- 
born sons from families of three. Alterna- 
tively, one might focus on ordinal position 
so that first-, intermediate-, and last-born 
children are the objects of study, regardless 
of family size. All three methods were em- 
ployed in this study. 

In comparing the 79 psychiatric subjects 
and 117 normal control subjects on distribu- 
tion of birth order without regard to family 
size, the data were arranged in a 2 X6 
contingency table. Five cells represented the 
first through fifth birth positions and the sixth 
cell represented the sixth and later birth 
positions, which were grouped in the sixth 
cell in order to avoid small expected fre- 
quencies. The normal and psychiatric popu- 
lations did not differ, the percentages in 
the comparable cells being very similar 
(x° = 5.74, p<.40). The schizophrenic 
(N = 24), character and behavior disorder 
(N = 36), and neurotic groups (N = 19) 
were compared in a 3 X 4 contingency table 
(the fourth and later birth positions were 
grouped in the fourth cell because of small 
expected frequencies in some cells). The 
results did not approach statistical signifi- 
cance. These data do not allow for the 
rejection of the null hypothesis of no dif- 
ference in birth order between psychiatric 
and normal subjects in this study, nor do 
they allow the rejection of the null hypothesis 
of no difference in birth order among the 
three main psychiatric diagnostic groups, 
when analyses are done without regard to 
family size. ae 

When the normal and psychiatric samples 
were compared for differences in birth order 
with family size controlled, so that separate 
analyses were done for families of 2; 3, 4, 
5, 6, 7, or 8 members, no significant dif- 
ferences were obtained. Similar results were 
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found when comparisons were made vii 
the psychiatric population, comparing bt, 
three diagnostic groups. Admittedly hart 
not an adequate test of the null hypothes 


mily 
of no difference between groups yten P 


a fam! y 
te small 


size is held constant, because of the 
the number of subjects coming from 
of any given size was at times qui ‘ch the 
For example, in the x? test in whic 


pare! 
largest number of people were compa 


13 
(psychiatric patient versus normal soldier | 
from families of three siblings), 
people were involved. As family 
creased and the effects of birth © 
investigated in families of four or more, 
number of persons involved became less, é 
of course, the number of contingency <,- 
to fill became greater, so that expect 
quencies in particular cells fell below ‘a e 

When the normal and psychiatric yE, if 
were compared on ordinal position to umbe! 
the samples differed in comparative water me 
of subjects who were first-born, ae 
diate-, or last-born siblings, no significa”? jjs 


: ar, © 
ferences were obtained. When, howeV©’’ ies 


only ' 
size 
re | 

der we 
i the 
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analysis was repeated, distinguishing s 
with four or less siblings from fam! "dies of 
five or more siblings as in the stu and 
Schooler (1961) and Farina, Barty’, pe 
Garmezy (1962), it was found tha 

57 normal subjects from families © 
more, 11 normals were first born, 3 
were intermediate-, and 7 normals W° 5 
born. Of 31 patients from families 

or more, 1 subject was first-born, 

intermediate- and 1 subject was 

(x? = 7.21, p< .05). There was "° ipint 
effect in families with four or 1655 1 awe, 
This analysis could not be repeate® —tjet” 
few F wer 
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from families of five or more S! 
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LA 


SIBLING PATTERNS AND SOCIAL ADJUSTMENT 


TABLE 2 


Jir 
NUMBER OF SIBLINGS OF NORMAL AND 
PSYCHIATRIC SUBJECTS 


Percentage of subjects having a 
iven number of siblings 
Number of Gi iz 


siblings per Normal group Psychiatric group 
subject (N = 117) (N = 79) 
i 2 15 
2 12 9 
3 23 18 
4 14 18 
5 14 16 
Giot 15 09 
More 20 15 
Total % 100 100 


Not, 

e.—Chi 

Dychiatri Chi square comparing distribution of normal and 
ie groups, 15.73, p < .02. 


sh 
cigs a significant difference (x° = 15.73, 
that Oe? the greatest single difference being 
Were % of the normals (2 of 117 subjects) 
tients only children, while 15% of the pa- 
(see ae of 79 subjects) were only children 
Strike able 2). This difference is all the more 
Conn ng if one considers these data m their 
ine with the data on birth order. 
at ie Mi is true that the normal and psychi- 
tions 8roups did not differ in their propor- 
Trstbey eldest children (34% of patients were 
alg) as compared with 27% of nor- 
dest it is also a fact that 12 of the 27 
only children in the patient group were 
dest dren (44%) while just 2 of the 32 
on „Children in the normal group were 
T children (6%; x? = 9.24, p< Ol). 
Dhreni three diagnostic groups (schizo- 
2 ma neurotics, character disorders) were 
S red regarding number of siblings in 4 
famij mtingency table. (All subjects from 
Roy, > Of four or more siblings were 
in the fourth cell to avoid smal 
°F Sta E equencies.) The findings fell short 
ite pea significance. What difference 
Metig as between these three groups Was 3 
SARON of the fact that only 1 of 4 


op Cteq 


5 of qo nics was an only child, and only 
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an’ On] character and behavior disorders 
‘tang... Children, When these data ms 


in a 2x3 contingency table, 


239 


findings fell short of statistical significance 
(2 = 51, P< 10 but p> .05). 

Age and sex of siblings. The normal and 
psychiatric groups were next compared to 
see whether they differed in the number of 
members within each group who had at least 
one older sister, one younger sister, one 
older brother, or one younger brother. The 
comparisons were carried out in a series of 
2%2 contingency tables (normal versus 
psychiatric group, number of subjects with 
one or more older brothers versus subjects 
with no older brothers, etc.). No significant 
differences were found, nor were any sta- 
tistically significant differences found when 
character disorders, schizophrenics, and 
neurotics were similarly compared. 


Prognosis in Psychiatric Patients 


The patients were divided into two groups, 
a good prognosis group (N = 32) whose 
members the psychiatric staff judged recov- 
ered enough to be returned to active military 
duty (duty group), and a poor prognosis 
group (N = 40) whose members the psychi- 
atric staff caused to be discharged from the 
army and returned to civilian life as unfit 
for further military service (discharge 
group). The duty and discharge groups did 
not differ in regard to education or occupa- 
tion; they were of equivalent socioeconomic 
status. The two patient groups were com- 
pared on three aspects of sibling relation- 
ships: birth order, the number of siblings 
each patient had, and the age and sex of 
their siblings. 

Birth order. The duty and discharge 
groups were compared on the birth order 
variable in a 2 X 4 contingency table. No 
tendency was seen for the duty group to 
have a distribution of birth orders different 
from the discharge group, whether or not 
size was taken into account. There 
was no tendency for the groups to differ in 
the subjects who were the first-, intermediate-, 
or last-born among their siblings. 

Family size. When the duty and discharge 
groups were compared on the number of 
siblings each subject had in a 2X4 con- 
tingency table (the fourth cell representing 
all those with families containing four or 


family 
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more siblings), the x? value was 7.09, 
p< .10 but p> .05. What difference there 
was between the two groups was contributed 
by a tendency of the duty group to come 
from families in which there were three or 
more siblings (including the patient) while 
the discharge group showed a larger number 
of “only” children and patients with only 
one sibling: only 13% of duty patients had 
one or no siblings, while 35% of discharge 
patients had one or no siblings. 

Age and sex of siblings. The duty and dis- 
charge groups were compared, in four 2 x 2 
contingency tables, to see whether there was 
any difference between the two groups in the 
age and sex of their siblings. The members 
of the duty group more frequently had one 
or more younger brothers (72% of the duty 
group had younger brothers while only 
37% of the discharge group had younger 
brothers), x? = 7.10, p < .01. There was no 
difference between the two groups in regard 
to other sibling relationships (older brothers, 
or older or younger sisters). 


Marital Status and Sibling Patterns 


The normal control group contained 54 
married and 61 single subjects; the marital 
status of the two additional normal control 
subjects was unknown. The psychiatric group 
contained 30 married subjects, 8 subjects who 
were divorced or separated from their wives, 
and 41 single subjects. Examination of the 
data of the divorced patients showed that 
regarding the dependent variables with which 
this study was concerned (birth order, number 
of siblings with which the subject was raised, 
age and sex of siblings), the data of the 
divorced patients were indistinguishable from 
that of the married patients. Consequently, 
the patient population was divided into a 
combined married or once married group 
(hereafter simply called the “patient mar- 
ried group,” N = 38), and a single group 

In neither the patient nor the normal group 
did the married and single samples differ 
on the variable of education. Further, married 
patients did not differ from the single pa- 
tients on the occupation variable. It would 
seem, therefore, that the married normals as 
compared to the single normals, and the 
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married patients as compared to the single 
patients, were of approximately equal socl0 
economic status. 

Birth order. The birth order data for the 
married and single normal subjects v 
examined in a 2 X 6 contingency table W! 
the first cell containing single children, aa 
the sixth cell all those families with six b- 
more brothers and sisters (including the F : 
ject). The x? was 10.15, p < .10 but $ > -4 
What difference existed between the hee 
and single normal subjects was contribu 
by a tendency of single subjects to be A 
oldest or next oldest among their siblings. 5 
larger proportion of oldest and next ol A 
children (66%) were likely to be anen o 
compared with children who were thir ilat 
later in birth order (37%). When the sina 
data for married and once married patie ar 
were compared with the data of single ier 
tients, however, no differences in birth © 
were found. and 

No differences were found when single f r 
married subjects within the patient and rdet 
mal groups were compared on the birth Mie 
variable when family size was taken oe ef- 
count by separately examining birth order 
fects within each family size (2, 3, 4 sa dif 
etc.). However, the null hypothesis © p ize 38 
ferences between groups when family are? 
held constant is regarded as not having pup 
adequately tested, because of the sma siz? 
ber of people from a family of any given tbe 

Married and single persons in neith® or 
patient nor the normal group tended i jast 
on having been first, intermediate, °" 
born in their respective families. inl? 

Family size. When the married and and 
normals, and married (or once married? > of 
single patients were compared on the ron 
the families from which they had cas 
striking similarities were noted. In the 


cluding the patient) yielded values just op 
of statistical significance (normal * atien 
X’ = 10.86, p< .10 but p> -05; a. 
sample, x? = 10.17, p < .10 but $ 7 ncy s 
both cases there was a marked tende 
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marri 
a O to come from larger families. 
| tree or z and single normal persons with 
Married an ee siblings were compared with 
oF fewer ae normal persons with two 
those oa | ings, it was found that 76% of 
Dated wit} arger families were married com- 
families Th 51% of those from smaller 
| Patient Dò e comparable figures from the 
| larger fa ie showed 71% of those from 
| Mattieg ea were married or were once 
"Tiitieg et, CMY 449% of those from smaller 
ge ea married or were once married. 
Marrieq Sex of siblings. When normal 
Were © Subjects and normal single subjects 


7 Cr 
tingen Pared 


| iu A Younger brothers, older sisters, OY 
| Be es (as opposed to their having 
| p thers ia no older brothers, younget 
| aTe AN der sisters, or younger sisters), 
Pa ant se significant differences. The only 
| ne Arison: ifference found when the same 
a? of m, S were made for patients was that 
of le nd or once married patients had 
(2 gle € younger brother while only 34% 
X Re Patients had a younger brother 
| 61, <0 1). 
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population were only children, as contrasted 
with only 6% of first borns in the normal 
population, puts the data in bolder relief. 
These data suggest that the only child is 
subjected to stresses to which children who 
have siblings are not subjected. It is also 
possible that the parents of the future psy- 
chiatric case had such difficulty raising him 
that they were discouraged from having ad- 
ditional children. In any event it is interest- 
ing to contrast the finding that only children 
are greatly overrepresented in the psychiatric 
sample, with the finding that in large families 
of five or more children, the members of the 
psychiatric sample are more likely than mem- 
bers of the normal sample to be intermediate 
children. While intermediate children in large 
families may be more likely to be neglected 
than are their first- and last-born siblings, it 
would also seem true that only children are 
more likely than other children to be con- 
stantly the center of their parents’ attention. 
Either situation apparently is more likely to 
lead to injury than other sibling patterns. 
The additional finding that character dis- 
orders tend to be only children more often 
than neurotics Or schizophrenics supports 
several studies cited by Spiegel and Bell 
(1959) to the effect that only children are 
overrepresented in the delinquent population, 
and also indicates that it is the character and 
behavior group which is primarily responsible 
for the greater occurrence of only children in 
the psychiatric as compared with the normal 
population. These data render some credence 
to Adler’s (1939) theories about the “spoiled 
only child,” which previously have had little 
firm support in the empirical literature 
(Spiegel & Bell, 1959). 
Prognosis in Psychiatric Patients 

The finding that those patients who were 
returned to duty had younger brothers more 
frequently than those who did not has im- 
plications about the important role that a 
younger sibling might play in the develop- 
ment of a man’s adequacy. It may be that 
the masculine leadership role in which older 
brothers who have younger brothers may be 
cast, is a positive force in the older brothers 
development. It is possible, of course, that 
this latter finding is an artifact due to the 
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fact that there is a tendency for the duty 
group to come from larger families, and thus 
the members of this group would have a 
greater chance statistically to have a younger 
oo family units from which the 
duty patients as opposed to the discharged 
patients tended to come may have rendered 
the members of the duty group more capable 
of adjusting to difficult social situations. 


Marital Status and Sibling Patterns 


It was found that in both the normal and 
psychiatric samples persons who were mar- 
ried tended more than single persons to 
come from large families. It was also found 
that the members of the psychiatric sample 
who were married were more likely to have 
younger brothers than were patients who were 
unmarried. Having a younger brother may 
incline one to assume a paternal role, and 
so predispose one to early marriage. Coming 
from a large family may provide experiences 
with peers that incline one toward easier 
acceptance of complex interpersonal situa- 
tions, such as marriage. 

The findings are interesting that both normal 
and psychiatric subjects who married as op- 
posed to those subjects who remained single, 
anl patients who were judged able to return 
from duty as opposed to patients who had to 
be discharged from the service, tended to 
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come from large rather than small oo 
Zigler and Phillips (1961) and Lane o na 
found that marriage is a competent inde be 
social effectiveness. Certainly those who ee 
able to return to active military duty ee 
be regarded as more effective persons abled 
those subjects who were so obviously a 
that immediate discharge from the mi r 
was necessitated. In each case it, yet 
appear that, in this lower socioeconomit. 
sample, more effective social pig vee 
sociated with coming from a large fami y- 
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SCORES OF PSYCHOTIC PATIENTS ON THE 
MAUDSLEY PERSONALITY INVENTORY 


T. G. CROOKES 
St. Johws Hospital, Aylesbury, England + 
ano S. J. HUTT 


Park Hospital for Children, 


The hypothesis that psychotics 
scales of the MPI was tested by 
20 manics and 20 depressives. The 
standardization group on the E sca 
N scale, but was considerably nearer 
of 60 neurotics. The schizop! 
half way between normals anı 
normal on N and a 
above normal on N an 
mg symptoms rather than a 
in relation to other findings. 
discrimination between psyc 
high E score was found to cons 
manics well from all other groups. 


Th 

(npr: Maudsley Personality Inventory 

19593) described by Eysenck (1959a, 
designed to 


» İs a short questionnaire 


Measur z 
d Extraversion 


(E à Neuroticism (N) an 
the oe of 48 questions, 24 for each 
or 9, hi scales; each answer is scored 2, 1, 
“ac that the range of possible scores on 
Scale is 0 to 48. 
x Psyche oy was administered to a group 
the retical c patients for two purposes, One 
OY is’ one practical. If Eysenck’s (1952) 
tog osion correct, that Psychoticism is & 
a cism th igo is orthogonal to both Neu- 
she Present Extraversion-Introversion, then 
aD, l mal ative group of psychotic patients 
"Oximate Scores on both MPI scales whic 
e, in mean and distribution, to the 
im w he theoret- 
ae to test this hypothesis. The 
Dro ilities was to explore the diagnostic 
Aen Ure pi: the inventory. It is a short 
ttj Se ia nd one which most patients find 
a tion T that if it showed good dis- 
between any psychiatric groups 


1 


Di Our 
C. Watt, Medical 


for permission to 
urses R. 
d the 


lish „2f 
tha this St. John’s Hospital, 
en and y and to Deputy Charge N 


would give norm 
examining the scores of 40 acute schizophrenics, 
total group did not 
le; it was significantly higher on the 
to the normal than to a control group 


hrenics were close 
d neurotics on N. 
bove normal on E, while depressives were significantly 
d below on E. This finding, 
permanent feature of 
The N scale was no 
hotics and neurotics, 
titute a manic pattern 


Oxford, England 
al scores on the N and E 


differ from the normal 


to the normal on E, and about 
Manics were significantly below 


that the scale was measur- 
the person, was discussed 
t found to make a useful 
but a low N score plus 
which distinguished 


ould be a valuable diagnostic aid. For the 
d here, scores were also 
up of neurotic and psy- 
The two aims of the 
Josely related, since the 
success of the main discrimination, that be- 
tween neurotics and psychotics, depends on 
the extent to which the hypothesis about 


psychotic scores is verified. 


it wi 
comparisons require 
obtained from a gro 
chopathic patients. 
investigation are © 


METHOD 
sence of definite knowledge of the 
the different psychoses in the general 
it seemed reasonable to construct our 
bers of the two main “func- 
ps, the schizophrenias and the 
d in the affective group to 
essed and manic patients. 


In the ab 
incidence of 
population, 
group of equal num 
tional” psychotic grou 
affective psychoses, an 
have equal numbers of depr 

The records were collected by giving the in- 
ventory to all cooperative patients in St. John’s 
Hospital admission wards who fell into the required 
diagnostic categories until there were 40 schizo- 
ics (27 males, 13 females), and 40 affective 
20 of them manic (12 males, 8 females), 

(9 males, 11 females). The schizo- 
e all acute cases who had been in 
hospital less than a year, mostly just a few weeks. 
At the same time, a group of 60 mixed neurotic 
and psychopathic cases (35 males, 25 females) was 
collected, f ison The diagnoses were sup- 
plied by t i charge of each case. 


phrenics Wer 


he psychiatrist in 
psychotics was 16-72 (mean 39.5), 


17-57 (mean 32.9). 
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TABLE 1 
MEAN NEUROTICISM AND EXTRAVERSION SCORES 


N Score E Score 
Group N M SD M SD 
Normals (Eysenck) 1,800 19.89 11.02 24.91 9.71 
Schizophrenics 40 2635 9.15 22.88 8.36 
Affectives 40 19.80 11.23 24.25 9.84 
Manics 20 14.60 9,21 28.80 7.66 
Depressives 20 25.00 10.82 19.70 9,83 
Total psychotics 80 23.08 10.70 23.56 9.10 
Neurotics 60 31.35 10.11 — 5 
RESULTS 


Table 1 gives the mean scores of each 
group on each of the two scales. The scores 
of Eysenck’s (1959b) normal standardization 
group are included for comparison. 

The E score of the neurotics is not given, 
since this depends on the type of neurotics 
included. Our group was collected without 
regard to its composition by subgroups, the 
aim being simply to obtain a group homo- 
geneous with respect to neuroticism. 

Table 2 summarizes the significance levels 
of comparisons between various pairs of 
groups. All probabilities are two-tailed based 
on £ tests. Where a group is compared with 
Eysenck’s normal mean, the latter is treated 
as the “true” mean of the normal population, 
and ¢ is the comparison of our group mean’s 
standard error with its deviation from the 
“true” mean. Where two of our groups are 
compared, ¢ is the comparison of the differ- 
ence between the group means with the 
standard error of the difference, Comparison 
of the variances of all the pairs in the Table 
gives no significant difference. Neurotic com- 
parisons on the E scale are not given, for 
the reasons mentioned above. 

Considered as a whole, the psychotic group 
does not differ significantly from the normal 
in E, but is significantly higher in N; it is, 
however, significantly lower in N than the 
neurotic group, and considerably nearer to 
the normals than to the neurotics. The schizo- 

phrenics are about midway between normals 
and neurotics in N, but do not differ sig- 
nificantly from the normal in E. The total 
affective group does not differ from the 
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normal on either scale, but taken separate 
the subgroups differ significantly from a 
other, and each from the normal on bo f 
scales, manics deviating in the direction 0 
low neuroticism and extraversion, gepresst 
in the direction of high neuroticism an 
introversion. These findings are quite iS 
accord with common sense, but they raise th 
question of what the scales are meašürne 
Is it a relatively permanent characteristi 
of the individual or a temporary one? T 
other words, are they measuring personality 
or symptoms? If the conditions are considere 
as phases in a “cyclic” psychosis, occurrink 
in the “cyclothymic” personality, they a 
be thought of as occurring in the same tyP 
of person, so the scales must be measuring 
symptoms. But it would also be possible 
think of a “manic personality” prone ne 
mania and a “depressive personality” Pt 
to depression. In this case the scales coti 
be reflecting the personality. Two addict 
operations were carried out to help The 
decision between these alternatives. a) - A 
patients had been assigned to their onin a 
groups on the basis of the diagnosis of F 
current admission. It is well known a 
some manics show quite rapid fluctuations E 
mood, and the clinical notes of all the maA 
were examined retrospectively to see if T 
of them had been actually depressed at 


TABLE 2 


SIGNIFICANCE OF DIFFERENCE (p) BETWEEN 
PAIRS OF Group MEANS 


m 
1D 
Groups compared N 
od 
Normals versus psychotics <.01 >A 
Normals versus schizophrenics <.001 >A 
Normals versus affectives >t 2.05 
Normals versus manics <.02 <05 
Normals versus depressives <.05 aad 
Neurotics versus all psychotics <.001 a 
Neurotics versus schizophrenics <.02 ri 
Neurotics versus affectives <.001 a 
Neurotics versus manics <.001 pE 
Neurotics versus depressives <.02 > 
Schizophrenics versus affectives a0 2,07 
Schizophrenics versus manics <.00 zA 
Schizophrenics versus depressives A <0 
Manics versus depressives <0 
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TABLE 3 


DISCRIMINATION OF PSYCHOTIC AND NEUROTIC GROUPS BY N Scare 


Cutoff point 


Neurotic tenth Psychotic ninetieth 


Group Median percentile percentile 
= >28 <29 >16 <17 >37 <38 
paratip (60) 41 19 54 6 (10%) 20 (33%) 40 
Sychotic (80) 24 56 57 23 (29%) 8 (10%) 72 
tim n 
e of testing; three definitely, and one tenth percentile of the neurotic group and 


Drob; 
ably, were found to have been in a short 


e 

Sa phase following their manic attack 
The oe when the inventory was given. 
— means of these 4 were N = 23.8, E 
temaini as compared with the means of the 
Patina 16, N = 12.3, E = 306. (b) Those 
tiy nts Were selected from the whole affec- 
ieee who were definite manic-depres- 
actual 4 the strict criterion of having had 
tack poa admissions for both types of 
epr here were 9 of these, 3 manic and 
the > “Ssed at the time of testing; means of 
the > Manics were N = 11.0, E = 31.7, of 
woo depressed, N = 25.3, E = 19.7. These 
Perations both support the hypothesis 


th 

at 

“onditi © scales are measuring a temporary 
nent 2N or symptoms, rather than perma- 


x eg sonality characteristics. The best way 
hhdiviaa’ the hypothesis would be to test 
‘Ye ual manic-depressives during both 
of attack, This we intend to do, but 
carly a long-term study. 


eps 
tminat; 
ation of Groups 


© the Joos 
tadig logical employment of the 


ta E case would be to use the N scale 
neu i © between neurotic and psychotic (or 
“evant and normal, but this is hardly 
ae 0 to clinical practice), then use the E 
inet e Place the individual in the various 
nite — of neuroses or psychoses, if 
Sh ere is a useful relationship between 

Ows pY of the subcategories. Table 3 


test in an 


9 s+ 

a th, OW the two main groups are divided 

agile Scale by taking a score as near as 
d hy © the median of the whole group, 


t 
Wo extreme scores based 0” the 


the ninetieth percentile of the psychotic. 
When the whole group is divided at the 
median, 43 patients (31%) are misclassified. 
Scores below 17 and above 37 have about a 
3 to 1 chance of belonging to the appropriate 
group on the assumption that the population 
from which they are drawn contains equal 
numbers of each group, ie. taking the per- 
centage shown in the table, not the absolute 
numbers. (For a discussion of the problem 
of using proportions of this kind when the 


proportions of the types in the population 
re not known, see Crookes, 1961.) 


drawn on ai D 
The discrimination of neurotic subgroups 
by the E scale is discussed elsewhere 


59a; Foulds, 1961; Sigal, Star, 
and our experiment was 
not designed to examine this. Thus no at- 
tempt was made to get equal numbers of 
the subgroups. They actually turned out to 
be 23 dysthymics (mean E= 19.1), 10 
hysterics (mean E = 27.6), and 27 psycho- 
paths (mean E = 26.0). These means are 
closer together than those of Eysenck’s 
(1959b) corresponding groups, 50 that dis- 
crimination of the three groups will be even 
more unsuccessful. If the hysterics and psy- 
chopaths are combined, and this group com- 
pared with the dysthymics, division at the 
median gives a misclassification of 37% and 
t the upper end of the scale there 1s hardly 
3 t the lower end, 13 


iscrimination. A 
oa the dysthymics have scores below 
16, whereas only 2 (5%) of the hysteric- 
, 


have E scores as low as 
chopath group @*" 
he a a neurotic giving a very low E score 
nasi tf good chance of being diagnosed as 
o 
dysthymic. 


(Eysenck, 19 
& Franks, 1958), 
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It can be seen from Table 1 that the main 
psychotic groups are not distinguished at all 
by the E score. The manic subgroup is quite 
high on E, and this combined with its low 
N score gives a manic score pattern which 
is perhaps the only discriminator with any 
practical value that this study suggests. A 
score of E > 25, N < 11, is given by 8 out 
of 20 manics, or 8 out of 16 if the 4 who 
were actually depressed are excluded, and by 
only 3 of the remaining 120 patients, 1l 
depressive, 1 hysteric, and 1 psychopath. This 
score should be of value in distinguishing 
mania from schizophrenia, a distinction which 
is sometimes required in practice. In a recent 
study by McGuire (1962), in which the 
MPI was administered to 151 unselected psy- 
chiatric in-patients, it was concluded that 
“the test had no part to play in differential 
diagnosis, although the N scale might be 
useful as a screening test.” With the ex- 
ception of its possible value in distinguishing 
mania and schizophrenia, our own results 


using the MPI must largely support this 
conclusion. 


DıscussIon 


The hypothesis that psychotics would give 
scores similar to the normal was very nearly 
confirmed. The psychotic mean on the N scale 
was significantly higher than the normal, but 
the difference was only about 1/3 SD. It is 
worth noting that the 213 normal subjects 
of Sigal, Star, and Franks (1958) had a 
mean N score of 23.2, almost exactly the 
same as our psychotic group. In fact our 
psychotic group apparently gave less neurotic 
scores than psychotics in the studies quoted 
by Eysenck (1960), in which other measures 
of neuroticism were used. This is probably 
because we included a considerable group of 
manics. 

The most interesting feature of the study 
is the scores of the manic-depressives, in 
which both the scales seem to be indicators 
of temporary attitudes, or symptoms, rather 
than permanent characteristics, or person- 

ality, although the inventory appears on the 
face of it to be a measure of personality ; 
subjects are asked to give their : usual way 
of acting or feeling.” This finding ties up 
with the experiment of Franks, Holden, and 
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Phillips (1961). They selected those items 
of the MPI which were the most likely t0 
be objectively observable. They had wa 
groups, one of normal subjects, one of r 
tients (alcoholics), and they asked each perso. 
to fill in the scale for himself and for every 
other member of his group. In each gro "y 
the members agreed closely in their raine 
of the others, but whereas in the ar 
group the self-ratings also agreed with up 
ratings by the others, in the patient gr? 
the self-ratings showed hardly any F 
agreement. It seems possible that a seale A 
this kind measures two different things W 
used by normals and patients. In the n 
it perhaps measures something relative 
permanent, and observable, while in nt 
types of patient at least, it seems to mees ds 
a temporary subjective attitude. ae. 
(1955, 1959) has repeatedly stressed sa 
need to distinguish between “symp de 
clusters” and “trait clusters,” and has are 
veloped measures which, in neurotics, dif- 
differentially sensitive either to symptom nck 
ferences or to personality differences. EY ae 
himself does not find it necessary to ont 
the distinction, In the Handbook of an 
Psychology (Eysenck, 1960), for el 
neuroticism is said to be based on gh the 
stitutional factor, autonomic balance, ÞU 
actual behavior depends on the stress 
which the person is subjected; o a 
=f (Predisposition and Stress) [p- . h Ë 
Neuroticism is presumably that whic pe 
measured by tests of neuroticism, am 
concept was derived from the intercon eh 
tions of tests, but what do the tests Meme 
the Outcome or the Predisposition? sag? 
answer is suggested by the following pa ple 
where Eysenck (1960) is giving a P° wer? 
reason for the fact that some psychotics © 
found to give high scores on neuroticis™ 


ntiall 
- We might regard it (psychoticism) as i a 
a powerful stress which, when multiples a out 
average predisposition, produces as the peha vi?" 
come a strongly emotional and neurotic 50 by 
pattern. In other words, although the ae 
on psychoticism is not necessarily high O; 
roticism, the strong stress produced Ma A 
ordered thought processes results in k r re 
sequence leading to strongly emotiona 
[p. 28]. 


| 
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ates that he has here abandoned the 
ae, to use the word “neuroticism” in a 
high agin The psychotics in question are 
tests, į neuroticism as measured by the 
originally oe of which the word was 
aros Pay a they also show neurotic 
sarily i m they are said to be not neces- 
must a on another neuroticism, which 
iy is M underlying constitutional factor. 
are ave it fairly clear that the tests 
condition” t of as measuring the temporary 
nent en symptoms rather than a perma- 
epressiy ition, and our findings with manic- 
NY ‘ives would support this as far as the 
Working j Concerned. For the psychologist 
is im in a practical, therapeutic situation, 
Portant to know which kind of neurot- 
eee test is measuring, if he wants 
Wishes changes due to treatment, or if he 
likely © know what aspects of behavior are 
the he be amenable to treatment. Hence 
It role of the work done by Foulds. 
istinctig also be useful to make a similar 
y Oticism. in the case of measures of psy- 
Patticn - Manic-depressives would seem a 
itọ p &tly suitable group for an enquiry 
tents 'S; and the authors hope to use pa- 
the . K this type for an investigation into 
Of ye POrary and permanent characteristics 
Ychotics, 


i 
ici 
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A study investigating the role of similarity between the perceiver and the 
stimulus in determining the projection of experimentally aroused hostile affect 
was initially carried out with a group of 57 college Ss, and then, due to the 
ambiguity of the data, was replicated on a sample of 65 college Ss, Prior to 
and immediately after administration of the anger-arousing or control treat- 
ment, Ss judged a series of stick figures and a series of photographs of two- 


person situations in 


which the age of a male figure was systematically 


varied. Both male and female experimental Ss in comparison to control Ss 


manifested reliable increments in 
stimulus persons only. 


The present study is part of a more gen- 
eral research effort aimed at investigating the 
effects of emotional arousal on the process 
of interpersonal judgment. The focal point 
of these investigations is an assessment of 
the variables which facilitate the attribution 
of the perceiver’s affective state to some 
stimulus person. In this sense, these studies 
can be said to fall within the rubric of 
“projection.” However, as has been stated 
elsewhere (Feshbach & Singer, 1957; Mur- 
stein & Pryer, 1959), the concept of projec- 
tion encompasses a number of diverse proc- 
esses and the use of a single label may 
obscure important functional differences. In 
accounting for the influence of emotional 
arousal on the perception of other people, 
the authors have stressed the role of stimulus 
generalization and the egocentric orientation 
which affective involvement is likely to 

ê, 
apn previous experiments (Feshbach & 
Singer, 1957; Singer & Feshbach, 1962) were 
concerned with the attribution of the emotion 
of fear or anxiety. In these studies, evidence 
was obtained that under conditions of fear 


i as supported by Research Grant 
ene ee National Institutes of Health, 


United States Public Health Service. 


attribution of hostility to college-age male 


sects 
arousal, both normal and psychotic subir 
tend to attribute a greater degree 0 ondi“ 
to others than under more relaxed i 
tions. Another group of experimen "ioe 
Feshbach and Feshbach (1963) based ati 
Murray’s (1933) well-known invest tit 
of the influence of fear upon the attri uent? 
of maliciousness also dealt with the infu” ; 
of fear upon social judgment but W! 
primary interest in the type rather 
degree of distortion resulting bere: ight 
arousal. These studies indicated that I" ys | 
ened boys tend to attribute fear tO a oh 
their own age, but, at the same timM® eif 
tribute maliciousness to older men. ` age 
judgments of girls similar to their OW? gn 
and of older women were relatively ' 
affected by their emotional state. roce 

It is apparent, then, that the Pi be 

mediating affective attribution bere tbe 
fully understood by an analysis only as ¢2 


s 
perceiver’s internal state. One also peiné 
consider characteristics of the perso” pe 
judged and his relationship to ese 
ceiver. The principal thesis of the P eive 


. . . . ere fi i 
study is that similarity between the Prttrib” 


and the stimulus person facilita ed P 
tion of affect. Similarity, as is r mul" 
the literature on substitution ani 
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ATTRIBUTION OF HOSTILITY To PICTORIAL STIMULI 


ee, is deceptive in its apparent 
laity icity as an independent variable. Simi- 
it ee vary along many dimensions and 
ee nial, where human beings are con- 
a a to sample more than one dimension 
able i the significance of this vari- 
emplo a particular experimental design 
the e ed in this study includes variation of 
‘pease object with respect to age and 
Was. s oe to the perceiver. Hostility 
ised i ted as the affective state to be 
n in order to extend the generality of 
Previous findings in the area of fear 
anxiety, 
A function of this design has been con- 
the rol within the context of an analysis of 
Ceiver € of the emotional state of the per- 
Xperim, in interpersonal perception. The 
the ual data are also relevant to 
especi oe of projective techniques, 
the œY Such measures as the TAT and 
Whic Osenzweig Picture Frustration Test, 
3 the employ human interaction situations 
Wi Mirror to reflect other characteristics 
Studies respondent. There are a number of 
by yy Which have recently been reviewed 
Simi es Stein (1961) in which some aspect of 
timy Y such as race and sex between the 
Varie Us object and the respondent has been 
‘teh j ‘he use in many of these studies of 
Bte ndices of projection as story length 
n yliits the inferences that can be 
Matei With respect to the relationship of the 
sident elicited to attributes of the Te 
kh in | An advantage of the present de- 
this regard lies in the ability to 
ity. e attribute—namely, anger and 


jects METHOD 
R ings 
erin sample consisted of 25 control 
Olog, Mal subjects drawn from introductory 
Classes at the University of Pennsyl- 

to sizable initial differences between 

e and control subjects which 
sin; results ambiguous, the experiment oe 
we 20 control and 45 experimental sub- 
pe taking summer session courses in 

z Were the City College of New York. 
d,Por a or randomly assigned to either the e- 
Une aon Rae control condition, with a greater 
“ey IStribution assigned to the former treatment. 
a Th . of males and females was 
ennsylvania control group ©? 


and 


also 
sisted 
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of 8 males and 17 females and the experimental 
group of 16 males and 16 females. The New York 
control group was comprised of 12 males and 
8 females and the experimental group of 23 males 
and 22 females. In both groups, the average age 
was about"19.5 and the mean number of years of 
education was approximately 14. 


Materials 


Two classes of stimuli were used. The first type 
consisted of slides of animated stick figure drawings 
and the second of slides of TAT type photographs. 
There were 26 stick figures, chosen from the Sarbin 
(1954) series. These are simple stick drawings, 
neutral in sex and suggesting a person in various 
poses. Those stick figures were selected which could 
be perceived as engaged in some sort of hostile 
activity. Information regarding the perceptual 
properties of these figures was available from pre- 
viously obtained norms. In addition, prior to the 
experiment proper, the full series was administered, 
with five alternative adjective choices provided for 
each stick figure, to a group of 65 students at the 
University of Pennsylvania who were asked to select 
the adjective that best described the figure. The 
stick figures were then divided into two matched 
series of 13 slides on the basis of the number of 
hostile attributions elicited by each stimulus. 

These same 65 subjects were also presented 28 
TAT-like photographs of two-person interaction 
situations. These pictorial situations were somewhat 
similar to the Rosenzweig Picture Frustration Test 
jn that they were so posed that, for each stimulus, 
one person, labeled A, could be perceived as frus- 
trating a second person, labeled B. The source of 
frustration varied and included a college-age male, 
a college age female, an older man, and an older 
woman. The object of frustration, Person B, was 
either a college age male or a 12-year-old boy. The 
task of the subjects was simply to indicate for each 
interaction how Person B felt. In contrast to the 
stick figure series, the question was open-ended 
in that no alternatives were provided. 

Two matched series of pictorial situations were 
then constructed on the basis of the number | of 
hostile attributions elicited by each situation. 
Series 1 consisted of nine pictures. In five, Person B, 
the object of frustration, was a college age male 
and in four he was a eee boy SA 

i of pictures similar in sex-age C 
ma in P Series 1 with respect to both me 
object and source of frustration, except that the 
ber of same age and younger age stimulus 
ee reversed. Ideally, it would have been 
Canna exactly corresponding situations for 
desirable © 3 -college age male variation, How- 
Hie youney, 4 oblem of whether the mean- 


i from the pr y u 
rer nen situation would remain the same with 
BES f the object of frustration, the 


i in age 0 3 
4 difference tical Sits of situations introduces pos- 


i kes trans- 

ts and, more important, ma) 
jose of the variation. In order to 
ae ble confounding of situational 
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TABLE 1 


MEAN CHANGE SCORES IN AGGRESSIVE ÅTTRIBUTION AS A FUNCTION OF ĪNITIAL LEVEL 
OF JUDGMENT AND EXPERIMENTAL TREATMENT: EXPERIMENT I 


Stick figures 


Pictorial situation 


Initial score 


Initial score 


0,1 2 
Experi tal M +21 +.4 
Experimenta! OE 3m) UE 5m) 
trol M +7 +8 
Contro (3f, 0m) (6f, 3m) 


3,4 0,1 2 3,4 
= +18 +8 e 
(3f, 8m) (4f, 7m) (8f, 4m) Gf, 5m) 

aM 
—.8 +14 +.2 : 
(8f, 5m) (5f, Om) (9f, 2m) (3f, 6m) 


Note.—The figures in parentheses represent the distribution of men (m) and women (f) in the respective cell. 


differences with age variation, the content of the 
situations was varied, a wide range of frequencies 
of aggressive attributions to both younger and 
college age males was employed, and the sources of 
frustration for each age group were quite similar 
in age and sex.? 


Procedure 


Series 1 of the stick figures and photographs were 
presented by means of slides to experimental and 
control subjects who were seen in groups of ap- 
proximately 10. In order to judge the stick figures, 
the subjects were provided with an answer sheet 
which had five alternative adjective choices for each 
stick figure. In each instance, one of the alterna- 
tives was an adjective denoting some hostile feeling 
or activity. Examples of such hostile adjectives 
include “annoyed,” “angry,” “sullen,” “mad,” and 
“indignant.” For the Pictorial situations the sub- 
jects were provided with a blank sheet on which 
they could indicate, for each Picture, how they 
thought Person B felt. The experimental subjects 
were then subjected to a series of insulting com- 
ments by a 29-year-old male experimenter who 
disparaged their motivation and cooperativeness, 
His accusations that they were not trying and were 
apparently not interested in research, etc. were 
arbitrary and unprovoked. The effectiveness of the 
procedure in arousing hostility has been demon- 
strated in previous studies (Feshbach, 1961). The 
experimenter delivered these comments in a hostile 
and condescending manner. In the control condition, 
the same experimenter made some neutral comments 
concerning the slide projector and the screen, 

In both the experimental and control conditions, 
the subjects then proceeded to judge the second 
series of stick figures and pictorial situations. These 


2A complete set of these slides can be obtained 
from the authors. The following examples are cited 

indicate the types of situations that were uti- 
i i (a) A college age male is holding a tie and 
oe at a second college age male, the caption 
a “You don’t mind if I borrow your tie?” 
age older woman is looking at a younger boy 
ie Ñe is opening a door. The caption reads, “Are 


you going out again?” 


er 
Procedures were carried out in an identical eN 
in the replication of the experiment, the stick | tort 
always being administered before the pic 
situations, 


Scoring 


an] situa” 
For both the stick figures and the pictorial aile 
tions, one point was given for every i 
ascription to the stimulus. The total scores 0? 13s 
stick figure series could then range from 0 ve ories 
while the scores on the pictorial situation jatten 
could range from 0 to 9, In the case of the initia 
a reliable scoring scheme, based upon the , 
Pre-experimental data, was readily established, 
being 95% agreement between two map pi 
scorers. The following words and phrases are y he 
examples of descriptions of the feelings "ads 
stimulus figures that were scored as hostile: to 
angry, irritated, hostile, irritable, sore, Pe p 
lose his temper, about to blow up, and aggress 


REsuULTS 


Je 
As was Previously noted, the first eP e 
ment could not be easily interpreted bec ptal 
of initial differences between the experime 0s" 
and control subjects, Since the range © Py 
sible change was quite restricted, esPe° ov? 
in the case of the pictorial situations: „p 
would expect, on statistical grounds, d 
inverse relationship between initial Jere ident 
degree of change. This relationship is ev us 
in Table 1. An analysis of the stick oni 
data presented in Table 1 reflected a tal 
cantly higher proportion of exp erm tio 
females increasing in aggressive attr weve! 
as compared to the control females. a De 
as Table 1 indicates, this difference al ip 
simply a consequence of the diffe Ta 
distribution of initial scores on this Pon 
between the experimental and 
females. It can be seen from Table 


ATTRIBUTION OF HOSTILITY To PICTORIAL STIMULI 


TABLE 2 


Mea 
N FREQUEN 
FREQUENCY OF AGGRESSIVE ATTRIBUTIONS TO 


ee FIGURES AND PICTORIAL SITUATIONS 
FORE AND AFTER THE EXPERIMENTAL 
AND CONTROL TREATMENT: 
EXPERIMENT II 


251 


TABLE 3 


FREQUENCY OF POSITIVE AND NEGATIVE CHANGES 
IN NuMBER OF AGGRESSIVE ATTRIBUTIONS 
TO STICK FIGURES AS A FUNCTION 
OF SEX OF PERCEIVER: 
EXPERMENT IT 


a_i Pictorial 
Stick figures situations Males Females 
N Before After Before After 
Males A = : ee ae es 
Baresi ; 
coperimental 3 He ig ig z Experimental 9 14 8 14 
Fe 12 a7 | Os 7 20 Control 3 9 5 3 
Males 
a > .10 F 
| Contpamental 22 2.9 2.5 1.8 3.1 e = 
18 2's 3.2 2:0 Us = 


— 


With inis; 
onsistent. level held constant, there is a 
Subjects. t tendency for the experimental 
| ie control attribute more aggression than 
hotogranhe to the stimulus persons 1n the 
mfcme While this effect is statistically 
figures nt only for males judging same age 
pi ama alig higher scores in judgments 
S an gt figures for the experimental 
“imental initially lower scores for the ex- 
females in comparison to their 


Teg 
ectiy, ; 
factors e controls may be confounding 


Co 
Cae uently the experiment was repli- 
Adom a new sample. In this instance, 
a Can Sampling worked as it should, and 
a Menge seen from Table 2, the initial 
wet S E Of the experimental and control 
me Axe ie quite similar. As in the earlier 
fig attrib scores in the amount of hos- 
pite aa to the second series of stick 
toy Pared ¢ Pictorial situation stimuli as 
ch bi initial series were computed 
Subs Wis ject. The distribution of these 
bo Jects dichotomized according to those 
oi b manifested an increment in 
bog ; Bae the second series versus subjects 
tile es fewer or the same number of 
therte first, spins to the second series as 
eis ho It is evident from Table 3 that 
Tay) ð reliable change in attribution of 
KON indi the stick figures. However, as 
fries, both the female as well as 
ee mental subjects manifest a 
Crement in hostility attribution 
et male stimuli. Again, as in 
è periment, there was no sigMmi- 
in response to the younger agè 


stimulus persons for either the male or 
female subjects. 


DISCUSSION 


The results indicate that angered indi- 
viduals will project hostile affect. The results 
also indicate that the stimulus object plays 
a critical role in the attribution process. One 
significant property of a stimulus is its cue 
value with respect to the relative frequency 
with which a particular interpretation is 
given to the stimulus (Birney, 1958; Haber 
& Alpert, 1958). The present experiment has 
been concerned with the effect upon projec- 
tion of another property of the stimulus— 
namely, similarity to the perceiver. Although 
all three categories of stimuli—the stick 
figures, the same age males and younger age 
boys—had been selected on the basis of 


having aggressive value, responses to only 


the same age males discriminated between 


the anger aroused and the control groups. 
A theoretical model which attempts to 


explain this form of projection as a joint 
TABLE 4 
FREQUENCY OF PosiTIVE AND NEGATIVE CHANGES 
ın NUMBER OF AGGRESSIVE ATTRIBUTIONS 
TO THE STIMULUS PERSON AS A FUNCTION 
or HIS AGE AND THE SEX OF THE 
PERCEIV! XPERIMENT IT 
== — 
Same age Younger age 
Males Females Males Females 
roe to- pi fir 
E § 16 6 5 gg 
Control a © 
p#>10 >10 


pr=.005 $ =.0027 


p Taes (one-tailed) determined by Fisher's exact test. 
sp va S 
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function of a lowered threshold for respond- 
ing with motive-related imagery and the 
presence of stimulus cues capable of evoking 
such imagery would not adequately account 
for the obtained data. Similarity between the 
stimulus person and the perceiver appears to 
be a critical variable in determining whether 
the perceiver will attribute his own emotional 
state to the stimulus person. Judgments of 
the stick figure stimuli which are symbolic 
representations of humans but which deviate 
markedly in physical similarity to “real 
people” were unaffected by the experimen- 
tally induced affective state of the perceiver. 
In addition, the experimental subjects failed 
to attribute or “project” their anger onto 
stimulus persons who were considerably 
younger than they. However, it is note- 
worthy that the female subjects in the 
experimental group as well as the male 
subjects projected their irritation and anger 
onto the male stimulus persons. While one 
cannot completely assess the effects of dif- 
ferences between the stimulus and the per- 
ceiver without varying the sex of the 
stimulus, it is evident that the presence of 
similarity with respect to the sex dimension 
did not vitiate projection by the female 
subjects, 
The present theoretical formulation of the 
process mediating affective attribution as- 
signs a significant role to stimulus generaliza- 
tion which in turn is presumed to be de- 
pendent upon stimulus similarity. Although 
“similarity” is a fundamental Psychological 
variable which is postulated to mediate phe- 
nomena ranging from transfer of training 
(Hovland, 1951) to displacement (Miller, 
1944) its effects from a priori standpoint 
are often difficult to assess, The relevance 
of a similarity dimension may vary with the 
situation and the particular response under 
study. Theoretical formulations are required 
which will lead to predictions as to the con- 
ditions under which particular dimensions of 
similarity become pertinent. By exploring 
the identifications and perceived similarities 
of the college-age student, for example, one 
may be able to provide a basis for Predicting 
when variations in sex and variations in age 
will influence generalization of some specific 
Oe a ties complication, if there is con- 
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: e 
siderable anxiety in connection wi kE 
expression of a motive, then one might © x 
that judgments of dissimilar rather The 
similar stimuli would be more revealing. eri- 
authors in a number of unpublished ee 
ments have been unable to demonstrate ea 
ably that the induction of aggression a > 
produces a relative shift in attribu What- 
anger to dissimilar stimulus persons. ticula 
ever the ultimate status of this par ida 
hypothesis, it is evident that the amea aly- 
of similarity require more systematic to 
sis and specification if similarity is 
used as a predictive variable. 
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SIMULATION ON THE CALIFORNIA PSYCHOLOGICAL 
INVENTORY AND THE ADJUSTMENT 
OF THE SIMULATOR 


FRANCIS M. CANTER 


Patton State Hospital 


This is a study of the relationship between 


` 


ability to present a good picture 
d the actual life adjustment ofS. 


on the California Psychological Inventory an 
licants for Ward Aide positions, both 


A group of 50 alcoholics and 50 app! 
highly motivated to appear “normal,” 
of well-functioning Aides, 
group of Poorer Adjusted A 


a good picture, under standar 
ment of S. The results are related 


the personality and not simply 


Mos : 
ates Pinte on the susceptibility of 
oe disto -type personality tests to delib- 
rll e e by the subject have used 
Xctionin ents or other relatively well- 
tally 5 people as subjects. They have 
Ma sanidad “hon to take the test under 
a ae: ‘honest? conditions and then 
e a instructions to distort the picture 
Core Be fon way, with comparison of 
to eee as indication of the 
PB? T ich the test is vulnerable to 
ie ologica the Manual for the California 
videne al Inventory (Gough, 1957) there 
on ee subjects can change their 
this test to a considerable extent, 
i testy S in ch usually resulted in tell-tale 
of «Attempt e Wb, Gi, or Cm scales. Some 
Seay faking» to correct for these possibilities 
ue pò eg, as the addition of validating 
SY forcing’ and Z scales of the MMPI, 
u fies choice between two items 0 
Ry is, 195 - ie equal social desirability 
crate attention has been paid 
ik personal adjustment of the 
t a “natural” inclinations tO 
Study in the best possible light. In 
und $ (Dicken, 1960) a relationship 
oo ability to simulate a 
tory on the California Psycholos- 
Nica] (CPI) and the sophistication 
| a knowledge of the subject. 
Olinger (1957), using psychi- 
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z 
ae 
p 
fel 
2 


a group of Better Adjusted Alcoh 
E Icoholics were compared under 

instructions. Results indicate a positive re 
d or faking conditions, 
to Gough’s concep! 


it is concluded that these questionaire resi 
sets to be corrected for. 


were compared. Additionally, a group 


olics, and a 
“fake good” 


Jationship between ability to present 
and the relative adjust- 
ts of socialization and 


sponses are meaningful aspects of 


cts, found that some of 
“fake good” instruc- 
heir MMPI profiles in 


certain ways and that ability to do this was 
related to favorability of prognosis for dis- 
charge from the hospital. These studies em- 
ployed the technique of giving instructions 
to fake good with subjects who presumably 
had no special motivation to answer other 
than honestly under the standard instructions. 
A point of possible theoretical and practical 
importance is the ability of subjects to pre- 
sent a “good” picture under standard in- 
structions, where there is presumed strong 
motivation to present such a picture and 
the relationship of this ability to the sub- 
ts actual life adjustment. It is specifically 
d that falsification of test scores and 
o so may be a personality 
e importance in its 


atric patients as subje 
their patients under 
tions could change t 


jec 
suggeste 
the capacity to d 
variable of considerabl 
own right rather than merely an undesirable 
and incidental factor to be “corrected for.” 

The present study compared the CPI re- 
sponses of two groups, each group presumably 
strongly motivated to present themselves in 
a favorable light, one group being objectively 
s yell-adjusted” and the other “poorly-ad- 
justed.” Additionally, the performance of the 
„adjusted group under fake good in- 
was examined in relationship to 
of adjustment and was com- 
the well-adjusted group’s per- 
der fake good instructions. 


poorly 
structions 
degree 


formance un 


3 
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METHOD 


Subjects 


The subjects of the study consisted of a group 
of 50 male alcoholic patients in a state hospital and 
a group of 50 male applicants for the position of 
aide at this same hospital. The alcoholic patients 
(poorly adjusted) were all involuntary admissions 
who were committed to the hospital for an in- 
definite period. The applicants, who were accepted 
for employment and worked for periods of at least 
6 months and usually much longer, were rated 
periodically on their effectiveness and personal ad- 
justment. Only those with a satisfactory 
after at least 6 months were included in the 
adjusted group. In other respects, the groups did 
not differ significantly from each other. The ap- 
plicants and the patients came from the same 
general socioeconomic backgrounds with occupations 
in the farming, small tradesman, salesman, factory 
worker, or construction labor areas. Median age 
for the alcoholic group was 41 and was 43 for the 
applicant group. Median years of school was, for 
both groups, 11. Mean Raven Progressive Matrices 
score was at the sixtieth percentile for the alcoholics 
and the sixty-third percentile for the applicants. 


rating 
well- 


Procedure 


The applicants 
Psychological Iny 


e hese patients were involuntary 
and quite anxious to be discharged from the 
hospital as soon as possible. They did not regard 
themselves as “mental patients,” felt insulted or 
put upon by their commitment to a mental hos- 
pital, and emphasized, in individual interviews 
and groups meetings, their basic normality and lack 
of difference from other People except for their 
bad habit of drinking too much. The Psychological 
examination was regarded by these patients as an 
effort to find out whether they were “crazy” or 
not, particularly since the test battery was pref- 
aced with the Rorschach. In general, it may be 
said that these patients regarded themselves, at least 
consciously, as personally “normal” except for the 
isolated area of drinking (which they were inclined 
to attribute to their biochemistry) and had ample 
motivation to convince the hospital staff of their 
eligibility for discharge at the earliest possible date, 

Of this group of 50 patients, 30 who were still 
available were requested to take the CPI again with 
instructions to 
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imagine you are applying for a job you really 
want and your employer will judge from ue 
test whether to hire you or not. Answer the o 
in such a way as to give the best possible im- 
pression of yourself, 


These 30 patients were then rated for general 
adjustment and acceptance of their drinking problem 
according to this scale: " 

1l. Past history: maximum score of 8 pole 
with 1 point each for items relating to ie 
patient’s work stability, maintenance of ane 
relationships, no previous hospitalization for ann 
ing, freedom from legal difficulties, and defini 
efforts to handle drinking by resort to Antabuse: 
Alcoholics Anonymous, or psychiatric help 5 

2. Locally devised questionaire: maximum SE 
of 4 points, based upon 34 items dealing Wit 
present acceptance of drinking problem sèn 

3. Interview: maximum score of 4 points, ipva 
for acceptance of self as personally maladjusr 
with drinking as a symptom, 2 points for consider! 
self as “normal” but with a drinking problem, a 
no points for regarding self as normal and W! 
no drinking problem 


3 r A ; nt 
Scores on this rating of relative adjustme 


ranged from 2 to 16 with a median score of A 


The 15 subjects in the upper half were compar 
with the 15 subjects in the lower half. These ae 
groups did not differ significantly on such variab es 
as age, education, Raven score, or average SCO" 
on the standard administration of the CPI. | ple 

Seventeen of the applicants who were availa ie 
were also asked to take the CPI under the fa "m 
good instructions, Because of circumstances, thes 


n 
tests were obtained after the applicant had bee 
hired. 


RESULTS AND DISCUSSION 


In Table 1 are presented the individual 
scale average raw scores of the alcoholics 4 
a group and the applicants as a group un i 
standard instructions. The differences betwer 
the two groups are highly significant on a 
scales except the Cm scale. In Table 2 “a 
Presented the individual scale average 1" 
scores of the Applicants, the Better Adjuste” 
Alcoholics, and the Poorer Adjusted Alc? 
holics (designated as Groups A, B, and ae 
respectively) under the fake good inst 
tions. All groups are able to improve ma 
scores but the Applicants show more improv 
ment than the Better Adjusted Alcoholics a 
these, in turn, show more improvement me 
the Poorer Adjusted Alcoholics. Althous a 
there are fewer statistically significant gan 
ences between groups under the faking ©° m 


dition, the general trend of the scores 
consistent. 


! 


| 
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SIMULATION AND ADJUSTMENT 


TABLE 1 


CALIFORNIA PSYCHOLOGICAL INVENTORY Raw SCORES 
OF ALCOHOLIC AND APPLICANT GROUPS: 
NORMAL ADMINISTRATION 


Alcoholic Applicant 
(N = 50) (N = 50) 

Scale M SD M SD t 

Do 21.9 5.0 32.1 5.9 9.18** 
Cs 14.8 4.5 20.8 44 6.70** 
Sy 20.8 5.3 26.9 3.7 6.57** 
Sp 31.0 6.1 35.6 5.8 3.83** 
Sa 174 3.9 223 45 5.77** 
Wb 34.9 8.4 40.6 4.2 4.22 
Re 261 41 329 4.6 7.76** 
So 293 6.1 39.2 43 s.50** 
Sc 268 7.9 34.7 6.3 5.48** 
To 169 48 246 5.2 7.62** 
Gi 143 6.0 23.8 6.8 731" 
Cm 26.4 15 26.6 1.7 0.62 

Ac 23.3 4.7 310 48 8.03** 
Ai 145 3.9 19.5 5.0 5.52" 
lẹ 32.7 5.3 41.0 5.9 7.36** 
EY 84 26 116 28 5.86** 
= 6.7 3.6 8.5 48 2.11* 
Fe 163 5.6 172 48 0.85 

Sisa 


eae results suggest that, under standard 
is itions of administration and where there 
jel sumed motivation on the part of sub- 
well, to present themselves as normal or 
ety dusted, there is a positive relationship 
si the ability to present a good picture 

of ike CPI and the actual life adjustment 
instru, subject. In addition, under fake good 
eit Ctions, subjects are able to improve 
ü test pictures to a considerable degree 
life even here there is limitation by actual 
oteg ustment. In this connection, it 1s 
the ~ that the Gi scores of all groups under 
usua ne conditions show the expected and 
dissemp found peak, indicating a conscious 
Pointed ling. However, Gough (1957) has 
Gi sco out that individuals obtaining high 
4 Biase do, in fact, tend to create a better 
iSticatty S on others and show some real- 

Y better adjustment. ; 

on a atral issue here is possibly the question 
Questi. Is meant by dishonesty in answering 
the Maire items. (Probably, the results of 
tring endard administration are more 1n- 
cally meaningful in this study than the 
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faking conditions, because instructions to 
fake something arouse a number of factors 
which are difficult to assess, including the 
subject’s interpretation of the purpose of 
the procédure, etc.) As mentioned above, 
most of the studies on response set, social 
desirability, and conscious faking have used 
subjects who by conventional criteria are 
fairly adequate and “socialized” people who 
at least have a background of personal social- 
ization and experience to help them to fake 
good. Gough (1948, 1960a, 1960b) has 
made a significant and central point of the 
idea that socialized behavior is rooted in the 
role-taking experiences of development and 
the asocial or inadequate person lacks the 
set of interactional expectancies and reaction 
patterns which permit more socially adequate 
behavior (including answering questionaire 
items on the relevant behavior). This is not 
contradictory to the findings of Jackson 
(1960) who concluded that response set and 
social desirability factors account for much 
of the variance in the CPI if the subjects in 
such investigations are individuals from the 
same cultural matrix with the same “folk” 


TABLE 2 
CALIFORNIA PSYCHOLOGICAL INVENTORY SCORES 
OF AIDES AND PATIENTS UNDER INSTRUCTIONS 
to Fake Goop 


Group C 
(N = 15) t 


SD A-B B-C 


Group A Group B 
(N =17) (N=15) 


Scale M SD M SD M 


7 2.0 31.0 9.3 1.25 0.73 


Do 37.2 4.3 35.7 

Cs 26.8 3.1 23.6 3.0 20.5 4.9 2.88** 1.31 

Sy 33.5 4.1 29.4 3.1 26.8 3.9 3.12** 1.46 
35.2 4.2 34.9 39 3.60** 0.13 


Sp 40.5 3.8 
Sa 23.7 3.0 22.9 
Wb 428 21 41.7 1.8 


2.8 21.5 2.3 0.74 1.43 
37.8 4.4 1.54 
28.1 6.3 


Re 37.7 3.7 35.7 2.2 : 
So 43.5 31 42.7 2.7 34.3 71 pee Te 

2 29.0 7.7 ai s 
Se 42.0 4.3 39.1 4 Le im 


26.2 2.9 18.8 4.6 
3.4 23.9 5.2 1.64 
ii 25.9 3.1 0.44 0.38 
26.5 5.6 1.07 


To 28.8 3.2 
Gi 32.5 5.2 29.9 
Cm 26.7 14 26.5 


32.3 4.0 33.5 1.9 J 
Re 0 18.9 3.0 13.9 3.4 2.09%  3.40%* 
Ai 21.2 3. £ F TF a 
4 3.7 42.4 3.0 35.3 K r . 

a iy 12.3 1.6 9.7 2.2 0.90 4,82** 

J 2. .0 a af $ 2.2 i! , 8 
rA Te Ai 4.7 20 31 3:0 362" He 
5 r 3 A9 O41 13: 
Fe 14.7 3.8 15.3 4.1 14.9 3 

+p <.05. 

ot Reber = Psychiatric aides; Group B = Better 


Note.—Group A G Bet 
aed Grotolic patients; Group C = Poorer adjusted 
alcoholic patients. 
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concepts and hence are able readily and 
naturally to fit into the way of thinking which 
is reflected as good adjustment on the CPI. 
Consistent with this is the finding by Sund- 
berg and Bachelis (1956) that their subjects 
were better able to fake bad than fake good 
on the California F Scale and the Gough Pr 
scale. From some recent research, Dicken * 
has indicated that efforts to “correct” for 
response bias on the scales of the CPI re- 
sulted in little improvement or even a reduc- 
tion in validity, suggesting that normal scores, 
response set and all, are meaningful aspects 
of the “real” personality, just as the ability 
to fake good says something meaningful 
about the person’s present or potential ad- 
justment, given the appropriate situation. 
Thus, a person who can and naturally does 
answer items in a more socialized way is 
drawing upon an actual and potential store 
of possibility for good adjustment than the 
person who has to concentrate on answering 
the items “correctly” and this person in 
turn has more possibility for good adjustment 
than the person who naturally does not or 


1C, Dicken, personal communication, September 
1962, 
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cannot answer in a socialized manner, even 
when asked to bend his conscious attention 
to it. 
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EFFECT OF PROFESSIONAL TRAINING AND AMOUNT 
AND ACCURACY OF INFORMATION ON 
BEHAVIORAL PREDICTION * 


JANIS H. WEISS? 


University of Iowa 


Behavioral predicti 


on tests were administered to 60 PhD clinical psychologists 


and 60 PhD physical scientists. Some judges had only minimal descriptive 


material from which to predict w. 


self-interview to utilize. In half the cas 


accurately descriptive of t 
tion was available to the judge, 
accurately than physical scientists. 
accurately than psycho! 


available. Use of the self-interview le 
did not significantly 


physical scientists but 
chologists’ predictions. 


mith the increasing interest in clinical psy- 
RT have come numerous studies investi- 
e variables that influence how clinicians 
Eoo at a valid set of predictions about a 
mad S behavior. The assumption is usually 
T that the formal training and profes- 
Tio experience of a clinician are important 
Predict in increasing the accuracy of his 
ections of client behavior. Although most 
a training programs are firmly rooted in 
Ww Premise, the empirical findings do not 
ays support it. 
Ypically when the predictive performance 
ee students in psychology is com- 
ists With the performance of PhD psycholo- 
are f no significant differences 1M accuracy 
os. a (Cline, 1955; Grigg, 1958; Kessen, 
(1943) Soskin, 1954a, 1959). Only Klebr 
lini 9) was able to report that experience 
in vans fared better than graduate students 
ate Wistive accuracy. In studies which com- 
of j Psychologically trained groups to groups 
ae who are trained in other areas, 
1959.7. Physical scientists (Cline, 1955; Luft, 
> Taft, 1955), it has often been found 


al 


of 


ation presented 
e University 
ents for 


l 
Th; 

pole F study is based upon a dissert 
a Towa epartment of Psychology of the 
he epee Partial fulfillment of the requirem 
* The e of Doctor of Philosophy- on 
the Soe wishes to express her appreciation 6 

S Oireen. Goodstein for his guidance throughou 
Velopment of this study and to the Old Gold De- 

t Fund for its financial aid. 


hile others had this plus a typescript of a 
es, the minimal information was not 
he S to be predicted. When only minimal informa- 
psychologists performed significantly more 
Physical scientists, however, perform more 
ogists when a greater amount of information is 
d to amprovement in the prediction of 


affect the accuracy of psy- 


that the latter groups perform more accurately 
than do psychologists. The results of Cline’s 
study in which engineering trainees were used 
are an exception. Indeed, Taft (1955) sum- 
marized the literature by suggesting that 
physical scientists and possibly other non- 
psychologists may be more capable of judging 
others accurately than are either psychology 
students or clinical psychologists. The fact 
that groups of judges of similar age and 
academic achievement have not been utilized 
in the above studies suggested a need for 
additional research. 

Another variable which is assumed to af- 
fect accuracy of behavioral prediction is the 
amount of information made available to the 
judge. Studies by Kostlan (1954), Sines 
(1959), Cline (1955), Borke and Fiske 
(1957), and Dailey (1952) indicate that 
accuracy of prediction can be enhanced by 
giving the judge more information from which 
to predict. However, several interesting limi- 
tations seem to apply to this general con- 
clusion. One is that prediction appears to 
be enhanced only by some type of interview 
or case history data, not by psychological test 


data Secondly, an increase in accuracy of 
i that made from minimal 


be found only when the 


have rep 
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increased amounts of information do not lead 
to more accurate predictions and in some 
cases to less accurate ones. Other investigators 
(Cline & Richards, 1960; Dymond, 1953; 
Hathaway, 1956; Leventhal, 1957) have 
pointed out that a small amount of data 
(usually class membership information such 
as age, sex, and vocation) accounted for 
nearly all the predictive accuracy of judges 
and that additional information appeared not 
to enhance predictions appreciably. Though 
accuracy of information is certainly presumed 
to affect predictive accuracy, few attempts 
have been made to experimentally manipulate 
the correctness of the information. A prelimi- 
nary study by Harvey (1938) apparently in- 
dicated that incorrect information is limited 
in its effect upon prediction of behavior when 
contradictory evidence is also present. 
The purpose of this study was to determine 
the effect of type of professional training, 
amount of information, and accuracy of 
information upon the prediction of the actual 
responses of individuals in real-life social 
situations. By using a group of males with 
PhD degrees in one of the physical sciences 
and PhD psychologists, it was possible to 
investigate the effect of training on prediction 
of behavior while presumably holding age, 
intelligence, and level of academic achieve- 
ment constant. Another purpose of the study 
was to investigate the effects upon predictive 
accuracy of differing amounts of information. 
To do this some judges had only informa- 
tion on the sex, age, and student status of 
the subject to be predicted while others had 
this information plus a typescript of a self- 
interview given by the subject. Finally, half 
of the judges were given accurate minimal 
information as to the presumed psychological 
health of the subject, ie., that the subject 
was a volunteer, hired to participate in this 
study, while the others were given inaccu- 
rate information, i.e., that the subject was 
receiving psychotherapy in the University 
Counseling Service for his (her) personal 
problems. i 
Because of the equivocal results of the 
research reported above, the following pre- 
dictions were based for the most part on 


common sense expectations: 


Janis H. Weiss 


Hypothesis 1. PhD psychologists should be 
more accurate predictors of the behavior of 
individual college students than PhD physi- 
cal scientists. e 

Hypothesis 2. Within these two professiona 
groups those judges who are given the greatest 
amount of information about the subject to 
be judged should be better predictors than 
judges receiving less information. i 

Hypothesis 3. Judges receiving the ene 
information about the psychological bey 
of the subject should predict the subject 
behavior more accurately than those receiving 
incorrect information. However, the incorrect 
information may produce poorer prediction® 
only for those judges who have the leas 
amount of other information. 


MetHop 


Experimental Material 


ired 
Three subjects, Ray, Ron, and Ruth, were hi 


to participate in an extensive program of pre 
logical testing and interviewing. The subjects pie 
naive volunteers from an undergraduate piya o i 
class. As part of the test battery, these stu! D 
were placed in a room with a tape recorder than 
asked to talk about themselves for no longer was 
30 minutes. The format for this self-interview ere 
developed by Gilbert (1959).8 The subjects were 
given 10 topics about which to talk which their 
intended to cover in brief form such things 45 ncesi 
educational and vocational goals, family expe iti 
emotional relationships, ethical and religious V! Jete 
and so on. Typescripts were made of these comp 
self-interviews in as exact detail as possible. 


Criterion Tests 


The criterion tests are similar to those described PY 
Soskin (1954a). Each of the subjects to be used 3] 
this study was interviewed by an experienced di o 
psychologist in an attempt to get information sence 
their childhood, adolescent, and college expe! 
Further, two close friends of each subject, one nour 
and one female, were also interviewed for 1 ation 
each in an attempt not only to verify informa 2 
given by the subject but also to obtain indepe ects 
assessments of the subject. For each of these 5U ane 
a multiple-choice, four-foil, 28-item test v avior 
structed, the test being based on actual be sibles 
as reported by the subject and, whenever por as 
verified by the friends. A sample question 
follows: 


wher 
There was a lot of talk at the restaurant * The 
Ruth works about something she had do 
incident involved: 


pal com” 
3 W. M. Gilbert and J. McV. Hunt, perso 
munication, 1959, 
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a. Ruth’s necking with a Negro employee at a 
party. 
b. Ruth’s organizing a strike for higher wages. 
5 Ruth’s reporting to work one night drunk. 
. Ruth’s keeping tips left for other waitresses. 


E judges used in the development and item 
ates of the three tests were all undergraduate 
he tee from the Introductory Psychology class. 
and cage two drafts of the tests were given to 30 
e judges, respectively. Items were revised if 
ie ae given by the judges to each one of four 
or as an item were not distributed within plus 
Renee three standard deviations from the expected 
So ae mean. This means the tests were constructed 
mining any response biases for naive judges were 
p ted and so that the information yielded by 
Bice alone did not enhance accuracy. The 
Dredete-, of the answers given by the judges was 
e e on the basis of the information from 
rate jects’ and friends’ interviews. In the final 
on ea pay to 132 judges, the mean correct score 
est k of the three tests was as follows: The Ron 
7937 3024 The Ray Test, 7.51; and The Ruth Test, 
the = Upan retest of 80 judges (1 month interval) 
est. as correct score on each test was: The Ron 
726, ay The Ray Test, 7.30; and the Ruth Test, 
ore None of these changes was significant. For a 
See Tote discussion of the test development 
orson (1962). 


Judges 


Si . . 
P) ei professional clinical psychologists with a 
legree and affiliation with either the Divisions 


of one 
menitical and/or Counseling Psychology of the 
Psycho Psychological Association constituted the 


the Ologist group. Their names 
ade edition of the American Ps; po 
or a Directory. The second group consiste 
Dhysj? sical scientists with a PhD de : 
Sele’? Chemistry, or engineering. Their names were 


Sele: v 
ln from Volume I, tenth edition of American 
all h of Science. As far as could be determined, 
k resently con- 


tf à 
hecteg OUr of the psychologists were p 
Six ed with an academic institution and all but 
an industrial 


ni 
oe 5 All judges were males who W 
Staq nd who had received their under 


u 
btaj Ate degrees in the United States. 
an 120 


4 . 
hy siggy Pe judges, 120 psychologists re be 
Orig; scienti by mail. 
Binal tists were contacted id mar chologists 


and Psy contacted, 49 of t 

the physical scientists com 
these judges did not constitute the necessary 
y of judges needed and did not cision? 
addii nee all cells, it was necessary to contact : 
Scientia Psychologists and 12 additional physica 


pleted the task. 


P. 
roce, dure 


Pri 
Drim, ef to the mailing of the materials, each | 
Professional groups was further subdivi 


on a random basis into 12 subgroups of five judges 
each. The divisions were made on the basis of the 
presence or absence of the self-interview typescript, 
the accuracy of the minimal information, and the 
subject to be predicted. The minimal information 
was included in the main body of the instructions 
given to all judges. Half of the judges were told 
that the subject to be predicted is a (age)-year-old 
(sex) college student who was hired to participate 
jn this study (accurate information). The other 
judges were told their subject is a (age) -year-old 
(sex) college student who is presently being seen 
in the University Counseling Service for help with 
his (her) personal problems (inaccurate informa- 
tion). In addition a complete typescript of the 
subject’s self-interview was included in the packet 
sent to half of the judges. Instructions were the 
same as those for the minimal information group 
except that these judges were asked to read the 
self-interview prior to taking the criterion test. Since 
there were three different subjects to be predicted, 
one third of the judges received the criterion test 
pertaining to Ruth, one third received the one per- 
taining to Ray, and one third received the one 


pertaining to Ron. 
RESULTS 


The mean accuracy scores and standard 
deviations for the treatment combinations are 
included in Table 1. In order to assess the 
differences in predictive accuracy between 
these groups, a 2 X 2 x 2 X 3 factorial analy- 
sis of variance (Lindquist, 1956) was used, 
the factors being type of professional train- 
ing (P), presence oF absence of the self- 
interview information (I), correctness of the 
minimal information (M), and subject to be 
predicted (T). The results of this analysis 
indicated that only the P X Į interaction was 
significant ($ < .001). It is suggested, there- 
fore, that the effect of increased information 
depends on the type of professional training 
of the judge. From Table 1 it can be noted 
that this interaction is due to the superiority 
of the physical scientists over the psycholo- 
gists in the self-interview condition but 
superiority of the psychologists over the 
physical scientists 1m the minimal informa- 
tion condition. None of the interactions o: 
main effects involving accuracy of the en 
information and/or subject to be predicte 

„as significant. 
was stent oss A 4 
hologist and p 


x 3) were then 
hysical scientist 


syc y j 
ma ior P alely; in order to investigate the 
ee af information while holding profes- 
effe! 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF THE ACCURACY SCORES FOR ALL 


TREATMENT 


COMBINATIONS 


Self-interview 


Correct minimal 


Incorrect minimal 


Ruth Ray Ron Ruth Ray Ron 
? M 6.80 7.20 6.80 6.60 5.40 7.60 
Psychologist SD 1.72 0.75 1.94 1.50 1.02 1.62 
k e a M 9.00 9.20 9.40 6.40 7.60 9.60 
Physical scientist SD 2.68 2.79 1.74 1.20 1.36 2:15 
No-self-interview 
Correct minimal Incorrect minimal 
Ruth Ray Ron Ruth Ray Ron 
; M 6.00 7.80 8.00 8.80 7.00 8.20 
Psychologist SD 1.79 117 245 3.54 228 244 
LRE M 6.40 6.80 6.80 5.40 5.60 7.40 
Physical scientist SD 1.50 0.98 0.75 2.06 0.80 2.24 
sional training constant. For the psycholo- DISCUSSION 


gists, none of the interactions or main effects 
was significant though the main effect of the 
self-interview information approached signifi- 
cance (.10 > p> .05). It should be noted 
that the direction of the difference was nega- 
tive, i.e., the use of the self-interview tended 
to lead to poorer predictions. Within the 
physical scientist group, the main effect of the 
increase information condition was significant 
(p < .001). The direction of this difference 
was positive, indicating that use of the self- 
interview led to improvement in predictive 
performance for the physical scientists. Ad- 
ditional factorial analyses were run in order 
to assess the effect of professional training 
while holding amount of information constant. 
Within the self-interview information condi- 
tion, the only significant result was the effect 
of professional training ($ < .001) with the 
physical scientists performing more accurately 
than psychologists. In the minimal informa- 
tion condition the main effect of professional 
training was again significant (p < 08). Silh 
the psychologists performing more accurately 


than the physical scientists. 


It would appear that psychologists 2a 
better predictors of the behavior of RT 
students than are physical scientists, Lee 
when minimal information is avallah 
Though it may be that psychologists ui 
developed more accurate knowledge of t 
behavioral base rates in the college studen 
population than have physical scientists, h 
amount and intensity of association W! 
college students that the judges in each gr oe 
have had in recent years must be taken i” 2 
account. The physical scientists were, pHi 
posely selected so as to minimize their 4° 
sociation with college students while Poy" 
chologists were selected who had maximu 
association with students. It remains re 
further research to investigate whether me 
may be personality factors associated Ta 
the choice of work setting which may to 
related to interest in and perhaps ability 
predict the behavior of college students- 

When the self-interview informatio? 
utilized in making predictions, physical a 
entists are found to be more accurate P 
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a pa are psychologists. It would 
Peiter > : erefore, that physical scientists are 
inci able to utilize the information con- 
Phan within the self-interview in order to 
a ae their predictive accuracy. Soskin 
cholo ‘singe 1959) has suggested that psy- 
Riedie / tend to overestimate a subject’s 
i ie ive propensities when using test 
ai thus tend to decrease their predic- 
by PREN on multiple-choice instruments 
ceog foils more indicative of mal- 
( 1955) he Hathaway (1956) and Cronbach 
Biot have further suggested that psy- 
Rs a are too concerned with the unique- 
etd t the individual to be predicted and 
o neglect their conceptions of the 

in any par- 
most clinical 
on i 0 asis is placed 
. Mcreasing the clinician’s sensitivity to 
É aspects of 
personality which differentiate a par- 
may be 

eee physical scientist does not @ 
or Sh reference toward t 
ilize que end of the continuum and 
curate the available material more ac- 
aed particularly when the individual to 
ce dicted belongs to @ “normal” popula- 


i Pethaps the finding of major importance 


aes of the criterion tests is that al- 
ests. improvement in scores is possible, the 
ce quite difficult and all the means 

around the expected chance mean. 


Th 

o 

taden, there is some evidence from an 

juq taduate population that on retest 
items in & 


Ses tend to respond to the 


an eos 

Pine consistent with their original Te- 

tee SS. (Thorson, 1962) and may not be 
der still 


spond; 
nding on a random basis, the rea 


a . 
al Y question whether the criterion tests 


Or 
for dan adequate or meaningful opportunity 
predictive 


abit Psychologist to test his 
Tuct s. No attempt has been made to con- 
holo items which would be judged by PSY- 
Os 8ists as relevant information though 
“s the items appear to have adequate 
city Since there is nO criterion 
o le by which material may be assessed 
its predictive importance and its 


Téley, 
ance to the functioning of @ psychologist 


e 
avail 
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at work, the experimenter is, of necessity. 
left to his own judgmental devices. Cons 
struction and choice of criterion measure 
would appear to be an area in which much 
important research still remains to be done. 
It was expected from prior research 
(Jentsch, 1958) that judges would differ in 
the accuracy of their predictions depending 
upon which subject they were asked to pre- 
dict. Of more interest is the lack of inter- 
action between the subject to be predicted 
and any of the other variables. This means 
that though the amount of information in- 
fluenced the accuracy of prediction, the ease 
with which the subjects could be predicted 
remained essentially the same. Coupled with 
Jentsch’s findings, it makes more feasible the 
notion that only a limited number of subjects 
need to be used in research such as this 
while still allowing for generalizability of 
results to a meaningful larger population. 
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NOTES AND COMMENT 


APPLICABILITY OF THE NORMS OF THE WECHSLER INTELLIGENCE 
SCALE FOR CHILDREN TO FIVE-YEAR-OLDS 


MICHAEL BELDOCH * 


Teachers College, Columbia University 


The WISC was administered to 90 children between the ages of 5-0 and 6-11, 
29 of whom were between the ages of 5-0 through 5-3, in accordance with 
the selection procedures of a school for gifted children. The youngest children 


were found to have a mean Full S 
than that of the group 5-4 in age an 


cale IQ approximately 10 points lower 
d older. A retest 11 months later on 14 


of the children of the youngest age group and 15 of the older Ss showed a rise 
in IQ scores of 9 points for the youngest group, those of the children originally 


tested when 5-4 or o 
interpreted as demonstrating a nee 


Ider remaining unchanged. The results of the study were 
d for revision of the extrapolated norms 


now provided in the WISC manual for the age range 5-0 through 5-3. A 


question was raised as to 


ae use of the WISC with subjects at the 
My age ranges of the test has been criticized 
ie ber of investigators. Delp (1953) noted 
*Vidence that the difficulty of the test for children 
com e lower age brackets may make results in- 
tome ee with those of other instruments. In 
With mn’ the Stanford-Binet Intelligence Scale 

the WISC, Krugman, Justman, Wrightstone, 
chig om (1951) found that the younger the 
avor the greater the difference in scores in 
Scale T the S-B relative to the Verbal and Full 

Q scores on the WISC. 

(1oqg5" , Standardizing the WISC, Wechsler 
tested his sample of normal children 


lo 


withi 
includ, Ls months of their midyear and did not 
5 “de subjects of 5 years, 0 months through 


ea 
anual? 3 months of age. Nonetheless, the test 
for ey, Provides tables of normalized scale scores 
S an, ory 4-month interval between the ages of 
that 15 years. Anderson (1953) has written 
iti 
ìs qj 
tor Seal to see how Wechsler can arrive at norms 
Sani, 4-month period and be consistent with 
here ee objection to interpolation and extrapola- 
© Cony Cores and his insistence that a person’s results 
testin, Pared with norms based upon results of 
ef Persons of his own age [p. 4781. 
errj n 
the dame to the lowest age groups, especially 
€r members of the group, Fraser (1959) 


E$ 
N 
OW at the College of the City of New York. 


the usefulness of the WISC with 5-year-old children. 


maintained that the standardization sample of 
WISC performances had not been adequate. 
The present study attempted to test the clinical 
impression that the use of the WISC with 
children 5 years, 0 months though 5 years, 3 
months of age, norms for whom are included in 
the WISC manual but are not based on standard- 
ization data, put them at a distinct test dis- 
advantage when compared with older subjects. 


METHOD 


In the Guidance Laboratory of Teachers College, 
Columbia University, a psychological clinic, almost 
200 children were administered the WISC in a pro- 
gram of selection of students for a municipally 
operated school for the gifted in New York City. 
Of this group, 90 were between the ages of 5-0 
and 6-11 at the time of testing. All testing was done 
by graduate students in psychology. 


TABLE 1 


IQ ScoRES ON FULL SCALE OF THE WISC ror CHILDREN 
5-0 tHRovcH 6-11 


Ages N M SD t 
29 111.03 15.23 —1.68* 
19 122.00 10.13 
5-8—5-11 18 120.22 13,53 
6-0—6-11 24 121.00 14.92 
*p< .025 
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TABLE 2 


Retest IQ Scores on FuLL SCALE or THE WISC For 
CHILDREN 5-0 THROUGH 6-11 


Age at 

original 

testing N M SD t 
5-0—5-3 14 121.57 13.20 —.43 
5-4—6-11 15 123.33 7.97 


Table 1 shows the mean IQ scores and standard 
deviations of the Full Scale of the WISC for the 
sample of 90 children, grouped into 4-month jn- 
tervals for ages 5-0 through 5-11 and in a year- 
interval for the 6-year-olds. It will be noted that 
the youngest age group (5-0 through 5-3) was sig- 
nificantly lower in mean Full Scale IQ score than the 
combined three older groups (p= 025), the three 
older groups having been combined because of highly 
similar mean IQs. In order to determine if this 
difference was due to the age factor or arose through 
some special sampling error, it was decided to test 
the children once again after the lapse of 11 months.2 
Tf the youngest children had been placed at a test 
disadvantage, the Passage of time should reveal a 
sharp rise in the mean IQ of this group while those 
of the older children should remain relatively 
constant. 

Approximately 1 year later, it was possible to 
contact 29 of the children and reschedule them for 
testing. Of this retest group, 14 had been 5-0 through 
5-3 at the time of the original testing while 15 had 
been 5-4 through 6-11. The graduate students who 
administered this test were different from the original 


RESULTS 


Table 2 shows the mean IQ scores and standard 
deviations on the Full Scale of the WISC for 
each of the two retested groups. From these 
results it can be seen that the younger children 
now obtain scores very similar to those of the 
older subjects and that the previously noted dif- 
ferences have almost entirely disappeared. The 
scores of the older children remained virtually 


2 The writer is indebted to Rosalea Schonbar for 
this suggestion. 


NOTES AND COMMENTS 


the same. ¢ tests comparing the samples retested 
with the rest of the original samples show no 
significant differences to obtain between the 
smaller retest groups and the rest of the NV of 90 
from which they were drawn. 


Discussion 


It seems clear that the extrapolated norms for 
ages 5-0 through 5-3 availables in the wey 
manual are in need of revision since they ei: 
to place children of that group at a distinct be 
disadvantage. The results of this study Mier 
question the advisability of the use of the wi 
with children of this age, in confirmation © 
Delp (1953). Since even among children Ne 
essentially superior intellectual ability, the su i 
jects of this investigation, the WISC scores be 
spuriously low, it might be argued that wi 5 
youngsters of lesser capacity, even greater di! j 
culty will be encountered, The subjective wi 
sions of the psychologists who examined H 
subjects in the original testing were that d 
younger children were often unable to ne 
the directions printed in the manual and v 
“at a loss” with the test materials. It was thonen 
that the major difficulty had been experience the 
the Performance Subtests, but an analysis of 
Verbal Scale versus the Performance Scale 
scores failed to support this hypothesis. 
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RESEARCH CREATIVITY IN PSYCHOLOGY GRADUATE STUDENTS 


MARTHA T. MEDNICK? 


University of Michigan i 


The validity of the Remote Associates Test (RAT) was investigated. Graduate 
student advisors rated individuals on a research creativity check list. These 
ratings were intercorrelated with RAT and Miller Analogy Test scores and with 


grade point averages. The ratings correlated significantly only with RAT scores 


(r= .55, p < .005 for 43 Ss). This supports the use of this test as a selection 
device where research creativity is a desired trait. 


Within an associative framework, creative 
es is regarded as the process of forming 
ae combinations of associative elements. The 
pes mutually remote the elements utilized, the 
sti i creative is the solution. An additional 

a ation of this definition is that the solution 
ass Useful or meet specified requirements. The 
in Ociative approach, which has been described 

greater detail by Mednick (1962) has led to 
thina el OPment of a measure of the creative 
Tost ing process called the Remote Associates 
nick (RAT) (Mednick, 1962; Mednick & Med- 
Word 1962). The test is composed of 30 three 

items of the following type: 


cottage 
class * 


Example 1: rat blue 
Example 2: railroad girl 


aot words are chosen from mutually remote 

edipte clusters and the task is to find a 
ea E link between them. The link is strictly 
ieee and does not follow rules of logic, 
able a formation, or problem solving. Consider- 
orm: ormative data has been gathered on various 

S of this test. 

advis, dnick (1962) reported that when faculty 
olors were asked to judge whether their psy- 
Tese BY graduate students were high or low in 
rch creativity, the resulting judgments 
highly positively to the students’ RAT 
The criterion measure used was roug! 


lp: F 
` his research is supported by the Cooperative 


c 4 
tra, arch Program of the Office of Education, Con- 


and did not, of course, give any notion of the 
value of the contribution of the middle range 
of scores. This study represents an attempt to 
assess this relationship in a more detailed fashion. 


METHOD 


Creativity Rating Scale (CRS) 


In the course of his work on scientific creativity 
and productivity, Taylor (1957) developed a 
Thurstone-type research creativity rating scale. For 
our own work, 24 of the items were reworded so 
as to make them suitable for the rating of graduate 
students.3 The scale values used are identical to 
those used by Taylor. When these scales were dis- 
tributed, the research advisor was asked to rate only 
those individuals whom he knew well. 


Remote Associates Test 


The RAT was administered to graduate students 
entering the Psychology Department at the University 
of Michigan in 1959 and again in 1960; a group of 
Northwestern University graduate students was also 
tested. This gave us a total of 81 Michigan and 
Northwestern students. At the time of the ratings, 
23 of these students had terminated their graduate 
education and 15 others had not worked intensively 
enough with any one research advisor to be rated in 
a reliable manner. CRS ratings were obtained on the 
remaining 43 students. The ratings were gathered 
between 1 and 2 years after RAT administration 
and the raters were not aware of the RAT 
scores achieved by their advisees. Miller Analogy 
Test (MAT) scores and grade point averages (GPA) 
were obtained from subject’s records but were not 
available on the entire group. 


ct : 

Soiva toe Martha T. and Sarnoff A. Mednick, RESULTS AND DISCUSSION 
rs shit 

Tosti Visiting Research Associate, Psychological The means, ranges, and standard deviations of 
mar Kommunehospitalet, Copenhagen, Den- the RAT scores of various graduate student sub- 

2 

Ans 3 i f the revised scale may be obtained fro 

Worn. Wers stems: 1. cheese; 2- Copies © m 

Otking, to sample RAT items ; the author. 
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groups are given in Table 1. The norms for the 
two undergraduate groups were also included in 
order to provide some comparison data. It can 
readily be seen that the specially selected group 
of honors students did better than unselected 
undergraduates and that the graduates as a whole 
obtain scores similar to the former group. The 
characteristics of the group which received ratings 
appear to be quite similar to those of the graduate 
student group as a whole. 

The average CRS scores ranged from 18.0 to 
56.2 with a mean of 40.7. The internal reliability 
of the CRS was checked in the following manner. 
The items were arranged in order of their scale 
values. These were then divided into odd and 
even halves and two separate CRS values were 
computed for each individual. The Spearman- 
Brown reliability coefficient was .93. 

The correlations which were obtained between 
the various measures are presented in Table 2. 
We should note that the mean MAT score of 
this group was 71.1 with a standard deviation of 
12.12. The norm group mean was 65.8 with a 
standard deviation of 13.5 (Miller, 1960). The 
effect of this restriction of the distribution was 
evaluated by a correction of the correlation of 
the MAT and RAT (Guilford, 1956). The cor- 
relation of the two tests was thus raised to .44, 
This indicates that the distribution of the MAT 
Scores in our group is not different enough from 
those of the original norm group to have an 
important effect on the correlations reported here. 

The correlation of greatest interest from our 


TABLE 1 


REMOTE ASSOCIATES TE 


ST SCORES or PsycHoLocy 
GRADUATE AND Un 


DER-GRADUATE STUDENTS 


Group N M 


Range SD 

Psychology graduate students 
Rated 43 19.8 10-27 4.3 
Drop-outs 23 20.8 09-28 4.9 
Not rated 15 20.0 16-26 3.1 
Total 81 20.1 09-28 4.0 

Undergraduate students 

Freshmen 312 16.3 00-28 52 
Psychology honors 23 20.0 11-28 42 


NOTES AND COMMENTS 


TABLE 2 


MATRIX or INTERCORRELATIONS OF 
RATING SCALE AND Test SCORES 


Measure RAT GPA MAT 
CRS 
A .55** 06 —.08 
N 43 16 24 
RAT 
r =f A1* 
N 26 a 
GPA 
—.03 
A 25 
*p<.025. 7 
+*+ p<. 005. 


point of view is, of course, that of the RAT = 
the CRS. It is evidently possible to pre st 
judgments of graduate student research perfor ë 
ance from these test scores. This Tse typ 
of rating was not si nificantly relate 

scores or to GPA, This property of the ees 
recommend it for use in selection of indivi heat 
in those instances where research creativity 5 

as is measured by the CRS is a desired trait. 
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MMPI PROFILES OF EPILEPTICS: 


A FURTHER EVALUATION 


EDWARD J. JORDAN, JR+ 


Research Service, Medfield State Hospital, Massachusetts 


This study investigated (a) the validity 
(b) whether the signs may reflect epilepsy specifically 


of MMPI-derived epilepsy signs, 
or brain disorder in 


general, and (c) whether age and/or IQ affect incidence of signs. 32 epileptics 
were compared with 19 neurological patients, 19 psychotics, and 26 neurotics, 
all male veterans, in percentage of patients surpassing given numbers of signs. 


Optimal cutoff points failed to discriminate epileptics from any 
psychotics, and this was due to unique characteristics of the psychotic group. 
The signs are thus not regarded as generally valid for either epilepsy or brain 


group but 


disorder. Age and verbal IQ apparently do not affect incidence of signs. 


Hovey, Kooi, and Thomas (1959) report a 
Set of 14 signs in MMPI profiles which dis- 
Criminated epileptic from psychiatric patients 
eo the .001 confidence level. A cutting point 
of Seven or more signs correctly identified 67% 
a the epileptics in the sample and misclassified 
S epileptic only 18% of psychiatric patients. 
repli e present study is primarily an attempt to 
“blicate the findings of Hovey, et al. (1959) on 
of iaependent sample of patients. In the course 
a his, the possibility was explored that the signs 
re a function of brain disorder in general rather 

an of epilepsy specifically, and the relationships 
ex, epilepsy signs to age and intelligence were 

amined, The latter two questions have not 
ee far been dealt with in the literature and 
es of some possible importance to the inter- 

ation of the signs. 


METHOD 


Gat sample consisted of 96 patients in a North 
y Ma Veterans Administration Hospital, includ- 
chia epileptic, 19 other neurological and 45 
Psych cases, The latter group was made up © 

Sh Stics (13 schizophrenics and 6 manic-depres- 
hysteri and 26 neurotics (comprising anxiety states, 
Sives) ics, obsessive-compulsives, and neurotic depres- 
all groups bore discharge diagnoses. Diagnoses 


NE vast c 
an Weed were based on abnormal EEG tracings 
lopicgy cally authenticated seizures; other neuro- 


ss included a variety of brain involve- 


1 . 
Was his investigation was begun while the author 
tion oan psychologist at the Veterans Administra- 
dye ,.°SPital, Durham, North Carolina. Thanks are 
Makin, Noble David and Russell F. Tomlinson for 
Thanke the appropriate records available for study. 
Dona ate also due to Theodore X. Barber and 
Yaluab) M. Isaac of the Medfield State Hospital for 

anusa; SU8gestions and critical reading of the 

Script, 


ments but had no history of seizures. The psychiatric 
groups had been inpatients on the psychiatric service 
of the hospital. The psychiatric groups were con- 
stituted by drawing every third relevant case on 
file in which an MMPI had been given during the 
hospitalization covered by the final diagnosis. The 
epileptic and other neurological groups were made 
up of all patients who met the criteria for inclusion. 

Patient’ MMPI profiles were scored for the signs 
given by Hovey et al. (1959). Results were tabulated 
as percentages of subjects in each group reaching 
or surpassing cutting points from one to all 14 signs. 
In this, the psychiatric group was considered both 
as a whole and in its two component groups. For 
each between-group comparison, fourfold contingency 
tables were prepared to show the percentages of 
patients in the two groups obtaining and not 
obtaining the maximally discriminating number of 
signs. These contingencies were evaluated statistically 
by the chi square test for independent groups. 

The correlations of number of signs with age and 
intelligence were computed by the Spearman rank- 
order method. Intelligence here is Wechsler-Bellevue 
verbal IQ, as estimated from Ohio Literacy Test 
scores according to the regression equation reported 
by Strong (1959). 


RESULTS 


Table 1 presents the percentages of patients in 
each group reaching or surpassing all possible 
cutting points. Validation of the signs requires 
at some cutting point a relatively high percentage 
of epileptics and relatively low percentages of 
patients in other groups. This situation occurs 
only in the comparison between epileptics and 
psychotics at the 5- and 6-sign cutting points. 
where 56%—26% and 44%—16% splits are 
found. As the table shows, the plurality of 
epileptics over psychotics at these two points is 
statistically significant beyond the .05 level of 
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TABLE 1 
PERCENTAGES OF PATIENTS IN Each Group REACHING OR Passinc Various CUTTING POINTS 
Number of signs 
Groups 1 2 3 4 5 6 7 8 9 10 11 12 13 14 
: A 5 rf c 0 0 
ileptic (N = 32) 94 88 81 75 Sh 44 31 25 16 3 3 3 
eia (N = 19) 95 90 79 68 53 37 16 10 5 0 0 0 0 0 
Psychiatric (V=45) 91 82 71 67 44 36 22 22 11 4 4 2 2 0 
Psychotic (N = 19) 89 79 63 63 26, 16, Sa Sa 5 0 0 0 0 P 
Neurotic (N = 26) 92 85 76 69 58 SH 35s 35% 15 8 8 4 4 


Note.—Entries in the same column with subscripts a and b differ beyond the .05 confidence level. 


confidence (minimum x*= 2.84, p < .05 for a 
one-tailed test.*) 

Neither the neurotic nor the neurological group 
is significantly lower in signs than epileptics; 
for these comparisons the lowest value is above 
.25. In fact, the proportion of neurotics at several 
of the cutting points is actually higher than the 
corresponding proportion of epileptics. 

Table 1 also shows that the signs statistically 
differentiate the psychotics from the neurotics at 
the 6-, 7-, and 8-sign cutting points. 

Table 2 gives the correlations of age and 
estimated verbal IQ with number of signs ob- 
tained within each group and for all groups 
combined. No correlation coefficient reaches an 
acceptable confidence level. The highest degree 
of correlation, —.25 among the epileptics be- 
tween estimated verbal IQ and number of signs, 
reaches merely the .18 confidence level, for a two- 
tailed test (¢ = 1.40, using the formula presented 
by Siegel, 1956). 

Discussion 


The present results indicate that the epilepsy 
signs proposed by Hovey, Kooi, and Thomas do 
not perform their intended function in a popula- 
tion of North Carolina veterans. The only rele- 
vant discrimination is between epileptics and 
psychotics, and there are two limitations on the 
value of this finding. 

First, the discrimination seems more associated 
with uniqueness of the psychotic group than of 
the epileptic group. Table 1 shows that the 
psychotics drop off markedly at the 5-sign 
cutoff point and remain below all other groups 
up to the 9-sign point. Of these differences 


2 The one-sided statistical test was used here and 

wherever epileptics were compared with other 

ince any tendency of the epileptics to have 

Epa E than a control group would be prima 

fewer i nce of the signs’ invalidity. All comparisons 

Eo eee oa epileptics were evaluated by two-tailed 
n 


statistical tests. 


only some with the neurotics are large enough for 
statistical acceptance, but the average of the 
psychotic groups’ maximum separation from com- 
parison groups is nearly twice that of the epileptic 
group. Furthermore, the optimal cutting points 
for differentiating epileptics are less uniform than 
are those for psychotics, The signs, then, are 
somewhat better as negative indices of psychos! 
than as positive indices of epilepsy. ; f 
The second limitation to the discrimination i 
epileptics from psychotics lies in its relatively 
low true-positive and high false-positive at 
These are considerably less favorable than a 
found by the original investigators. Generally 
speaking, the successful identification of age 
half the epileptics does not seem worth 1 
price of incorrectly naming around 20% of PSY 
chotics as epileptic. Since psychotics are apt a 
outnumber epileptics in many settings and sr 
the majority of patients diagnosed psychotic a 
not epileptic, the actual number of false-positiva 
cases in any unselected population would ae 
likely to dwarf the roughly 50% of epilepti $ 
correctly named. The resulting increase in frita 
less referrals for epilepsy workup would overlo at 
available facilities in most settings to an exte 
that would make use of the signs impractical. 


TABLE 2 


TED 
RANK-ORDER CORRELATIONS OF AGE AND ian s 
VERBAL IQ with Numer or Sicns OBTAIN! 


Groups Age vIQ 
Epileptic (N = 32) —.05 i 
Psychotic (N = 19) 10 a 
Neurotic (N = 26) —.03 0 
Other 12 

Neurological (N = 19) al, f 
All groups .03 Er 


NOTES AND 


As the selection of patients before testing be- 
came more accurate, the practical embarrassment 
of this false-positive rate would decrease. But 
very high accuracy in weeding out nonepileptics 
would be required before the actual number of 
false positives became a negligible percentage of 
Patients judged by the signs. Yet it should be 
Temembered that the signs are intended to assist 
Physician’s judgments, not vice versa. Also, little 
More than half the true epileptics would be 
identified unless a lower cutting point were 
chosen, and the present study does not indicate 
reliable differentiation of epileptics at any lower 
Cutting point. 

The crux of the difficulty in using the signs 
With the present population is the fact that the 
ite Carolina epileptics obtain many fewer signs 

n Hovey, et al.’s epileptics. The percentage 
curve of North Carolina epileptics over all num- 
ers of signs is significantly lower (= 8.31, 
’<.05) than that of Hovey and associates 
1959) two epileptic groups, by the two-tailed 

olmogoroff-Smirnoff test (Siegel, 1956, P- 135). 
an is is seen more clearly in the fact that Hovey 

associates’ two epileptic groups had means 
(y. 9.6 signs (original group) and 8.1 signs 
sS alidation groups), while the present epileptic 
oe Pas a mean of 5:2 signs, Comparisons 
oe psychotic and neurotic groups 10 the two 

les show negligible and nonsignificant dif- 
erences, 
on it is only in the epileptic sample er n 
able and the earlier findings are no oo pa A 

erer The question thus arises of possible di 

meres between the samples on one OF more 
pil acteristics influencing the tendency to obtain 
son PSY Signs. The two samples seem similar in 
Who. Tespects: they are both groups of veterans 
ve required hospitalization, and their 
Sentia] diagnoses were apparently basra io Gr 

Be a Y the same criteria. On the vara es a 
data nd verbal IQ investigated here, compara 
et al ¢ Rot reported for the sample of = 
Det 1°59), but the low insignificant correlatio 
Signs > cach of these variables and number at 
either 2008 the present epileptics suggests tha 
Signs ņ Variable is critical to the difference M 

e ma the samples. ” 
fto Rida epileptic samples do, however, oats 
ited idely separated geographical areas 0 : 
their st States. Hovey et al. (1959) carried ou 
don oady on Utah veterans, while this study was 
tion ih, North Carolina residents. This distinc- 
the Sa; ay imply a number of differences between 
their ples, some of which might be relevant to 
Constitu, erence in signs. For example, Negroes 

uted 19%, of the North Carolina sample, 
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and are no doubt less strongly represented in 
the Utah sample. Another possible factor lies in 
the difference in dominant religious influence in 
the two areas, Mormon as opposed to Baptist. 

Whatever demographic variables may be oper- 
ating here, it is suggested that their influence might 
well lie in a difference in attitudes toward epilepsy 
reflecting itself in different patterns of MMPI 
responses by epileptics in the two samples. Other 
patient groups studied here do not seem dif- 
ferentially influenced despite subcultural dif- 
ferences. 

The failure to find more general validity to 
the signs in the present population makes 
equivocal the finding that organics without 
seizures are not different in signs from epileptics. 
In this population it is just one aspect of the 
signs’ inadequacy. However, it leaves open the 
possibility in the findings of Hovey et al. (1959), 
that the signs discriminate epileptics because they 
have brain dysfunction. 

The epilepsy signs originated as a means of 
differentiating true epileptics from patients with 
“spells,” i.e. epileptic-like behavior not accom- 
panied by abnormal brain-wave patterns or clear 
cut seizures. This study casts no light on the 
signs’ value for this purpose. However, neither 
does the validation procedure of Hovey et al. 
(1959). These authors validated their signs on 
several psychiatric groups not selected for 
“spells.” It is precisely these latter differentia- 
tions that the present study cannot confirm. In 
the original study the epilepsy signs also dis- 
criminated patients with paroxysmal and non- 
paroxysmal EEGs. The present findings also do 
not bear on these results. 

It does seem clear that the epilepsy signs are 
not universally valid and useful. Where they are 
valid, the possibility remains that they are signs 
of organicity, not epilepsy specifically. It may 
tentatively be suggested that age and intelligence 
will not prove critical variables in future study. 


3 Hovey et al. (1959) used a third psychiatric 
group of “personality problems,” which has no 
counterpart in the present study. 
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SEX AND PERSONALITY DIFFERENCES IN 
RELATION TO FANTASY? 


C. W. LaGRONE 


Texas Christian University 


In an attempt to determine (a) the nature of the daydreams of representative 
college students, and (b) how variations in these daydreams are related to 
differences in sex and personality, typical daydreams written in narrative form 
were collected from 96 women and 123 men. Results show that daydreams of 
men differ significantly from those of women both with respect to need drive 
and content area. Evidence that some personality variables among men and 
women are related to differences in daydreams is suggestive but not conclusive. 


Fantasy as obtained from clinical study or via 
projective methods as applied to normal or ab- 
normal subjects has received a good deal of at- 
tention. Despite the data which have been ac- 
cumulated, little is known, as Singer and Schonbar 
(1961) have pointed out, concerning the day- 
dreams of normal people and how variations in 
their daydreams relate to specific personality 
characteristics. The present study was thus under- 
taken to gather data relative to two questions: 
What kinds of daydreams do typical college 
students have, and how are variations in these 
daydreams related to differences in sex and 
personality? 

The approach here employed is unique in two 
ways. First, each subject wrote in narrative form 
his most typical daydream. Singer and Schonbar 
used the questionnaire method to study the rela- 
tion of personality characteristics to reported day 
and night dreams in women graduate students. 
Shaffer and Shoben (1956, p. 206) report data 
concerning daydreams of normal people but they 
too used the questionnaire method. The second 
way in which the present study is unique is in the 
classification of daydreams on two dimensions: 
need drive and content area. The Shaffer and 
Shoben data list only types of daydreams, and 
while some of the types such as “Money or 
possessions” and “Sexual” clearly refer to con- 
tent, other types such as “Worry” and “Homage” 
cannot be identified as to content, nor can the 
motivation be inferred. 


PROCEDURE 


The subjects for the study were 96 women and 
123 men who were taking the sophomore level course 
in General Psychology at Texas Christian Unviersity.2 


1 This study was supported by the Hogg Founda- 


i University of Texas. , =. 
get other reasons, 45 students taking a junior 
lective course in Abnormal Psychology were ex- 
maed from this sample because the means for the 


The data were collected anonymously and were ob- 
tained from 96% of the members of the classes 10 
the following manner: each student was given an 
envelope which contained the Minnesota Multiphasi¢ 
Personality Inventory with answer sheet and ye 
additional sheets of paper. At the top of the firs 
sheet were the following statements: 


(1) Are you male or female? (2) Write in story 
form your most typical (characteristic, representa 
tive, or frequent) daydream or fantasy. Please 
give as detailed an account as you possibly can. 


At the top of the second sheet were the following 
statements: 

In whatever manner seems most appropriate 
you, write down as many statements as yOu sA 
concerning your daydreams. I want to know pren 
things as how often you daydream, the dine 
kinds that you have, how you feel about A A 
daydreams, and everything else about your i 
dreams that you can remember. 


The students were told to complete the materials 
out of class and to return them on a certain E 
An identifying number was written on each of t® 
different sets of materials. The fantasies were © p 
independently by two raters. The coding sheet wee 
double entry table involving need drive on one ee 
and content area on the other, Thus, each 14” 
checked for each daydream the cell which in n 
opinion corresponded to the primary need drive rive 
the primary content area in which the need on 
was expressed. If a secondary need drive Or cor de 
area seemed evident, it was also checked. Inclu for 
also on the coding sheet were three 5-point scales o 
ratings of Acceptance, Frequency, and Variability 
the person’s daydreams as indicated from the K Sge 
ments on the second sheet. The raters’ coded TBM 
ments and the MMPI scores were punched 0” 

cards for analysis on a 1620 computer. 


Women in Abnormal on all of the scales © the 
MMPI (except the Mf Scale) were higher Palogy 
means for the women in General Psych? 
($ < 01). Cf. Wise (1959). 
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RESULTS AND DISCUSSION 
Rater Agreement 


rig: ene between the two raters was fairly 
a . The correlation for Acceptance was .60, for 
a gea .79, and for Variability .64. In 42% 
aoe daydreams, the raters were in agreement 
the rning both the primary content area and 
mee ee need drive. There was 65% agree- 
= concerning primary need drive and 62% 
‘oot concerning primary content area. Al- 
ag a secondary need drive and/or content- 
the Hise recorded only in about one-fourth of 
oe aydreams by either rater, one rater coded 
so candar what the other coded as primary m 
a of the daydreams, both with respect to need 
ve and content area. 


AN ‘i 
ex Differences 


dea Square tests for each rater show that the 
nf Fi ‘ams of men differ significantly from those 
fontani both with respect to need drive and 
respect area (p< .001 for each rater). With 
and y to the variables of Acceptance, Frequency, 
i eu ability; only one statistically significant 
ir oie was found. Men are more accepting of 
laydreams than are women (¢ test, P< .02). 

in ties Manner in which men and women differ 
n thes daydreams can be seen in Tables 1 and 2. 
combing tables the two raters’ codings have been 
wice ned so that the totals which are given are 
Were fo large as they would be if each rater 
able aered separately. As can be seen from 
lew Ey the most frequent need drive in men 3s 
as fe ice. For men having New Experience 
quent P primary need drive, the two most fre- 
fees ail areas in which this need drive is 
erco are Adventure or Travel and Sexual 
urse. The need drive in women which is 


TABLE 1 


ELAT 

IVE F 

VE FREQUENCIES AND RANK ORDER OF NEED 
Drives IN MEN AND WOMEN 


N Men Women 
eed drives T x 
otal Rank Total Rank 
aseme 
4 ievement 5s 10 ee: 
<r poto omi 
Aug ion i : o 12 
Domjan á a 2 9.5 
New pence E 8.5 2 9.5 
Nure \Petience 66 L 13 4 
Recognise 2 n5 7 6 
Secu tion 2 27 2 
Usion, 50 
Curity 12 7 4 7 
orang, 15 5 A : 
2 n5 
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the most frequent by a wide margin is Affiliation. 
For women having Affiliation as their primary 
need drive, the most frequent content area in 
which this need drive is expressed is Marriage 
or Family. 

Although large differences between men and 
women exist in frequencies of specific need drives 
(for example 50.6% for Affiliation in women as 
compared to 10.5% for Affiliation in men), the 
relative frequencies of the various need drives 
are fairly similar in men and women. The rank- 
difference correlation between the relative rank- 
ings of need drives in men and women is .71. 
This correspondence, however, does not hold for 
content areas where the correlation is only .11. 
The marked difference in rank for content areas 
between men and women is particularly evident 
in the case of Marriage or Family, Adventure 
or Travel, and Money or Possessions. 

There are several interesting similarities and 
differences in the data reported by Shaffer and 
Shoben and that of the present study. Both 
studies find Money or Possessions to have a 
high frequency, but sex differences are much 
greater in the present study. In the Shaffer and 
Shoben data Physical Appearance ranks fairly 
high, especially in women, whereas in the present 
study it has the lowest rank for both men and 
women; also sexual daydreams have a much 
lower frequency in the present study. These 
variations could have resulted from differences in 
method, population, or the manner of classifying 
the daydreams. 

Chi square tests for differences in need drive 
and content area for the raters were made be- 
tween groups of men and women who were rated 
high and low on the variables of Acceptance, 
Frequency, and Variability; those rated as 1 or 
2 were compared with those rated as 4 or $. 
In these comparisons the following two statis- 
tically significant relationships were noted either 
as the result of combining chi square tests for the 
two rater’s judgments or as the result of the chi 
square tests being independently significant and in 
the same direction. 

In men there is a difference in content-area 
between those who are high and low on Vari- 
ability (p < -01). Men who are low in Variability 
daydream more in the content-areas of Adventure 
or Travel, Money or Possessions, and Sexual 
Intercourse than do men high in Variability. Men 
high in Acceptance daydream more than those low 


3 The writer will send to the reader upon request 
otocols of representative daydreams 
d tables showing the interrelation- 
and content areas among 


mimeographed pr 
and mimeographe' 
ship between need drives 
men and women. 
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TABLE 2 


RELATIVE FREQUENCIES AND RANK ORDER OF CONTENT AREAS IN MEN AND WOMEN 


Men Women 
Content areas : Total Rank Total Rank 
Adventure or travel 37 2 3 10 
Death, illness, or serious 2 10 10 6 
accident 
Marriage or family 11 9 81 1 
Money or possessions 40 1 8 7 
Physical appearance 1 11 i} 11 
Pre- or extra-marital 27 5 28 2 
Tomance 
Recreation or hobbies 13 8 4 8.5 
Sexual intercourse 24 6.5 4 8.5 
Varied content determined 24 6.5 13 5 
by life situations 
Varied content secondary 36 3 17 4 
to need drive 
Vocation 31 4 20 3 


in Acceptance in the content-areas of Adventure 
or Travel, Pre- or Extra-Marital Romance, and 
Sexual Intercourse; those low in Acceptance day- 


dream more frequently in the area of Money or 
Possessions (p < .01). 


Personality Differences 


In order to determine the manner in which 
need drives and content areas are related to 
differences on the various scales of the MMPI, 
the following comparisons were made. For each 
of the scales, and for both the men and women 
considered separately, those individuals whose 
scores were at least one half a standard deviation 
below the mean were compared with those whose 
scores were at least one half standard deviation 
above the mean. Relatively few of these com- 
parisons were statistically significant, and in 
view of the large number of comparisons which 
were made, it may well be that all of the dif- 
ferences are a function of chance, 

The chi square tests which were significant 
(p < .05) were three related to need drive and 
one related to content area in the case of men and 
two related to content area in the case of women, 
On the Depression Scale, the men who scored 
high had more frequently the need drives for 
——— and Security, whereas among those 
who scored low, the need drive for New Experi- 
ence was more frequent. On the Hypomania 


Scale, the need drive for Autonomy was rae 
frequent among the men who scored eae 
whereas the need drive for New Experience 
more frequent among those who scored a ant 
the Social Scale, Recognition was more fred New 
among men in the high group, whereas an 
Experience was more frequent in the low £ ofile 
On a similar comparison for the psychotic Pt age 
(Sc and Pa) the content-area of Money of 
Sessions was more frequent among the men in 
low group (p < .01). + abate 
For the women the two scales producing $ con- 
tically significant differences with respect to e 
tent-area were the Depression Scale an 


‘ho 
Interest (Mf) Scale. When the women scale 
scored high on the D Scale and on the Mf the 


were compared with those who scored low, ore 
content area of Marriage or Family was ine 
frequent among those who scored low on 

of these scales. 
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RELATIONSHIP BETWEEN TIME IMAGERY AND RORSCHACH 
HUMAN MOVEMENT RESPONSES 


RONALD B. KURZ 


Syracuse University 


In order to test the hypothesis that M is related to the slow-fast dimension 
of the time concept, 46 female undergraduates were given the Harrower 
Multiple-Choice Rorschach Test and the Time Metaphor Test in a group 
administration. Correlations between M and the ratings of preferences for the 
metaphors as satisfying descriptions of time revealed that the Ss who produced 
relatively few M preferred swift time images while Ss who perceived larger 
numbers of M found the slow or static time images more satisfying. The 


results were considered support for th 


e notion that M, as a measure of 


capacity for delayed need satisfaction, is inversely related to the level of 
motivation for the rapid passage of time. 


Ee perception of moving human beings (M) 
PS the Rorschach has been related to impulse 
Ontrol and the ability to delay the immediate 
Stratification of impulses (Rapaport, Gill, & 
chafer, 1046, p. 214). A number of studies 
ave attempted to clarify this relationship and 
= e demonstrated that subjects who produce 
atively large numbers of M tend to have 
eo control over overt motor activity 
Inger & Herman, 1954; Singer, Wilensky, & 
ra caven, 1956) and expression of affective 
Sponses (Meltzoff & Litwin, 1956). Apparently 
= Person who has more M in his Rorschach 
pu has greater capacity than the person who 
and ew M to inhibit expression of his impulses 
to delay the satisfaction of his needs. 

N concept of delayed gratification has also 
ieg related to time concept or time sense. 
twi pon (1961) has shown a relationship be- 
Stient impulse control, time estimation and 
otha oe and delinquent behavior. Siegman 
u that a group of delinquents had a shorter 

t pe time perspective and lower time estima- 
at Score than a group of nondelinquents, and 
tria oE the delinquents there was & positive 
rs ation between the subject’s future time 
i aah and their scores on a task of motor 
rae control, Le Shan’s (1952) finding that 
a children had shorter future time 
finds es than middle-class children and the 
Davide of Barndt and Johnson (1955) and of 
chi,’ Kidder, & Reich (1962) that delinquent 
th Ten have shorter future time perspectives 
Nondelinquents may be considered further 


evidence for the relationship between time vari- 
ables and the ability to delay the immediate 
satisfaction of impulses. 

In the present study an attempt was made to 
investigate the relationship between the delayed 
gratification aspect of M and the slow-fast 
dimension of the time concept. Taking a motiva- 
tional approach (Siegman, 1961), it may be 
assumed that the person who has difficulty in 
delaying the immediate satisfaction of his needs 
for the sake of some other and more distant 
goals is highly motivated for time to pass 
quickly. Waiting is uncomfortable for him; time 
must pass swiftly so that he may be gratified 
without excessive delay. Conversely, it may be 
assumed that the person who is well able to 
withstand delays has little need for time to pass 
rapidly. The difference in motivation should be 
reflected in the ways a person thinks about or 
conceptualizes time. Specifically, the person who 
has little capacity for delayed gratification ought 
to find it more satisfying to think about time 
in rapid-moving terms than in slow terms while 
the person with relatively more capacity for 
delayed gratification should find the slower terms 
more satisfying. In regard to the Rorschach, 
one would expect persons who produce relatively 
few M to prefer to conceptualize time in swifter 
terms than persons with relatively large num- 
bers of M. 

Recent work by Knapp and Garbutt (1958) 
has resulted in the development of a set of 
metaphors or images concerning time called the 
Time Metaphor Test, by means of which the 
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TABLE 1 
CORRELATIONS OF RATINGS OF THE PREFERENCE FOR EACH METAPHOR WITH M 


Metaphor Correlation with M Metaphor Correlation with M 
a galloping horseman +.48** a devouring monster a 
a fleeing thief +.40** budding leaves = A 
a fast moving shuttle +.35* an old woman spinning —.0 
marching feet +.31* a stairway leading upwards — 04 
a speeding train +.30* a string of beads —.06 
a whirligig +.29* a winding spool —.07 
a bird in flight +.24 an old man with a staff -Al 
a dashing waterfall +.19 a quiet, motionless ocean - 2, 
a tedious song +.03 a road leading over a hill x 
a space ship in flight +.01 a burning candle =e 
a vast expanse of sky 00 wind-driven sand =ar 
a large revolving wheel .00 drifting clouds — 34 
a massive glacier 00 
*p <.05. 
** p < 01 
subjects may indicate their most satisfying and alone was not considered M. For example, on gai 


least satisfying ways of conceptualizing time. 
This technique was used in the present study to 
measure the degree to which the subjects pre- 
ferred slow or rapid images of time. It was 
predicted that the subjects with relatively few 
M would find swift images more satisfying as 
ways of conceptualizing time, while the subjects 
who perceive relatively large numbers of M 
would find the slower images more satisfying. 


METHOD 


In a single group testing session, 46 female under- 
graduate psychology students were given the 
Harrower Multiple-Choice Rorschach Test (Har- 
rower & Steiner, 1945), and the Time Metaphor 
Test (Knapp & Garbutt, 1958). 

The Rorschach Test was administered by pro- 
jecting slides of the 10 blots on a screen in a semi- 
darkened room. Each subject had an answer booklet 
which listed 30 possible responses for each blot. 
The 30 choices for each blot were divided into three 
groups of 10 choices each. The instructions required 
the subjects to study the blot, read through each of 
the three groups of answers for the blot, then to 
underline one answer in each group which they 
thought was the best description of the inkblot or 
any of its parts. The subjects were required, there- 
fore, to underline three answers for every blot. 

Each blot was exposed for 3 minutes. The Rorschach 
a were scored for M in the manner prescribed 
te k, Beck, Levitt, and Molish (1961). That 
by Beck, hoites which clearly indicated human 
2 ing ee considered M choices. Human content 


II “Two people playing pat-a-cake” was score 
but “Two clowns” was not. next. 
The Time Metaphor Test was administered hic 
This test consisted of 25 images (see Table 1) a 
might be considered poetic descriptions of pe 
(Knapp & Garbutt, 1958). The subjects were P 
sented with the list of 25 metaphors, precede 

the following instructions: 


Below are listed 25 phrases which might be m 
ployed by a poet or a writer to symbolize ug! 
sense of time. I should like you to read piar 
this list of phrases and then indicate before ¢ in 
how appropriate you think this phrase 15 


5 5 jrst 
evoking for you a satisfying image of time. oe 
select the five phrases that seem to you ZUA 


satisfying and before each place the number ing 
Then pick out the next five most satisty! on 
Phrases and before them place the number the 
Continue this process until you have place ases 
number “5” before the five least satisfying PP" 

in your opinion. 


RESULTS 
the 


The mean number of M responses on és. 
group was 4.0, with a standard deviation of 1°” 

In order to test the hypothesis that ges, 
related to the swiftness of the time ar 
product-moment correlations for the rable | 3 
each metaphor with M score were comp in 
These correlations are given in Table is 8 
ranked order, It can be seen that there witt 
marked tendency for images involving, 3 M; 
Movement to be correlated positively with 
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while static images and those involving slow 
movement are correlated negatively with M. 
To clarify possible confusion about the positive 
and negative correlations in the table, it should 
be pointed out that because of the procedures 
used by the subjects in responding to the met- 
aphors, the least preferred metaphors have the 
highest numerical values. A positive correlation 
therefore indicates that high M people placed 
the swift metaphors lowest in rank order. In 
other words, the subjects who gave relatively 
More M on the Rorschach preferred to describe 
ae by using static or slow-moving images 
ease the subjects with relatively few M pre- 
erred the images involving rapid movement. It 
should be noted that all of the top 10 metaphors 
gel motion and that perhaps only 2 of 
fick marching feet” and “a tedious song,” do 
sibl express swiftness. Among the last 10, pos- 
in Yy only “an old woman spinning” and “a wind- 
Pa Spool” express rapid motion, while the rest 
cor either static or slow. Chi square with Yates 
or rection, comparing swift images with slow 
Pod pein images among the 10 most positively 
W 10 most negatively correlated metaphors, 
aS 5.0, p< 05. 
tte a result of a factor analysis of the 25 
aphors, Knapp and Garbutt (1958) described 
ele distinct clusters. In the present study the 
iny ‘onships between these clusters and M were 
estigated. The first cluster, termed by Knapp 
Mice it the Dynamic-Hasty Cluster, was 
is zed by images of rapid movement. 
cluster included the following metaphors: 
loping horseman, a fleeing thief, a fast 
ashi shuttle. a speeding train, a whirligig. a 
ing waterfall, a space ship in flight. When 
fan ratings by each subject for this cluster 
„53 taphors were correlated with M, the r was 
Th $ <.001. 
asiy second cluster, called the Naturalistic- 
o Tee Cluster, was characterized by references 
Sisteg Ure and lack of rapid motion, and con- 
Clouds of the following metaphors: drifting 
hi.” “ind-driven sand, a road leading over a 
a Re quiet, motionless ocean, budding leaves, 
Corret, PANSE of sky. The product-moment 
on lation between M and the median ratings 
he Second cluster was —.41, p<0l 
Cluste third cluster, named the Humanistic 
Surrogat seemed to involve human or human 
Sludeg e figures or human artifacts, and in- 
tevolyi the following: a tedious song, 2 large 
Wom as Wheel, a devouring monster, an old 
Spinning, a string of beads, a winding 


d 


spool, an old man with a staff, a burning candle. 
Median ratings on the Humanistic Cluster cor- 
related .00 with M. 


. Discussion 


The data of the present study seem to support 
the hypothesis that the subjects who report 
seeing relatively more human movement in ink 
blots prefer to conceptualize time in slow or 
static terms, while those who report seeing 
relatively little human movement prefer to con- 
ceptualize time in terms of swift movement. 
These data appear to be related to the findings 
that M is related to the ability to delay im- 
pulse gratification (Meltzoff & Litwin, 1956; 
Singer & Herman, 1954; Singer & Spohn, 1954; 
Singer et al., 1956). The high M person is 
capable of delaying satisfaction of his needs for 
relatively long periods of time without undue 
discomfort. For him, the rapid passage of time 
is unimportant in the eventual gratification of 
his needs and he finds it satisfying to think of 
time as stable or moving slowly ahead. How- 
ever, the orientation of the low M person is 
one of quick sequences of tension and relief. 
He must achieve gratification of his impulses 
within a brief period of time and he prefers to 
conceptualize time as passing rapidly so that 
he can be released from discomfort quickly. 

An alternate explanation of the data which 
does not take account of the slow-fast dimension 
could be made by referring to the “poetic” 
quality of the metaphors. It could be maintained 
that the swift images are less poetic or imagina- 
tive than the slow or static ones. Since the sub- 
jects who produce numerous M are said to be 
more creative or imaginative (Rorschach, 1942), 
they would naturally prefer the slow or static 
metaphors. However, this explanation does not 
seem to be supported by the finding of zero 
correlation between M and the images of the 
Humanistic Cluster which appear to be as poetic 
as those in the Naturalistic-Passive Cluster. 

The lack of relationship between M and the 
Humanistic Cluster is itself a surprising finding. 
It would be expected that the subjects who show 
sufficient interest in human activity to perceive 
it in meaningless blots would also show interest 
in describing time in similar terms. However, 
the data suggest that the slow-fast dimension of 
time is probably the most important source of 
common variance between M and the metaphors 
and that interest in human activity 1s unimpor- 
tant. Further investigation of the relationship 


276 


between M and other time variables, such as 
time estimation and future time perspective, is 
now being carried out. 

The exclusive use of women subjects in this 
study raises the question of the generality of 
the results. Certainly if men had been available 
for and used in the study, limits in the ability 
to generalize to both sexes would not be a 
problem. Although the impression of the writer 
(and apparently that of many other husbands 
who have waited for their wives) is that the 
sexes do differ in their attitudes and motives 
concerning time, the absence of empirical 
findings regarding this point leaves the question 
of generality an unanswered one, 
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CHARACTERISTICS OF HOMOSEXUALLY INVOLVED 
INCARCERATED FEMALES +* 


WILLIAM G. MILLER anp THOMAS E. HANNUM 


Iowa State University 


Cau purpose of this study was to examine 
erences between a group of homosexually 
vol ved and a group of nonhomosexually in- 
ved female prisoners. 
ane Prisoner was included in the homosexual 
enga ce if it were certain that she was actively 
Bre, in homosexual relationships while in- 
it we ated and in the heterosexual group only if 
Sie certain that she had avoided any homo- 
ack jelationships during incarceration. Due to 
rue hy any heterosexual choice, the degree of 
omosexuality could not be determined. 
© data were collected from testing, case 
ty data, and counseling interviews. The inter- 
k and group selections were made by the 
ologist regularly employed at the institution. 
Multi, P ata included scores from the Minnesota 
Ence R asic Personality Inventory, Kuder Prefer- 
Otis €cord, Wide Range Achievement Test, and 
Touting eck Scoring Mental Ability Test, all 
tion © 7 administered upon admission. In addi- 
askeg o estions pertaining to homosexuality were 
and many items of a biographical nature 
aken from the case history. 
tween actor analysis using intercorrelations be- 
temp, People on variables was pursued. An at- 
"ational made to cluster the jntercorrelations 
8touns Y into homosexual and heterosexual 
dence | This analyses failed to provide any evi- 
Of factors common to either group an 


histo 
View. 
Psyc 


Were ¢ 


l 
An 
wine qatended report of this study may be ob- 
part Ithout charge from Thomas E. Hannum, 
mee a of Psychology, Iowa State University, 
ition S or for a fee from the American Docu- 
tint a stitute. Order Document No. 7398 from 
D% Servi lary Publications Project, Photoduplica- 
to) S A Library of Congress, Washington 25; 
aries an it $2.50 for microfilm or $1.75 for photo- 
Dlication Make checks payable to: Chief, Photo- 
n Service, Library of Congress. 


was not pursued further. Rather the means of 
each group in each variable was computed for 
further description and analysis. 

Comparison of the groups on the variables 
indicated a high degree of similarity in almost 
all comparisons. The MMPI mean T scores, al- 
though generally high, were not significantly dif- 
ferent. The MMPI profiles, coded by Welsh’s 
method, provided interesting but nondifferenti- 
ating information. 

A chi square test of the degree to which the 
groups differed in the distribution of occurrence 
of the 10 MMPI scales did indicate a significant 
difference existed in the case of Scale 1, the 
Hypochondriasis scale, the heterosexual groups 
scoring higher. 

The comparisons of Kuder scores and profiles 
indicated greatest disparities were on the Social 
Service scale with the heterosexuals scoring 
higher and the Music Scale with the homosexuals 
scoring higher. 

Examination of the remainder of the variables 
furnished little to differentiate the groups except 
for two direct questions concerned with the in- 
volvement in and approval of homosexual rela- 
tions. The fact that the homosexually involved 
persons stated their approval does not, of course, 
aid us in diagnosing homosexuality in persons 


reluctant to provide such information. 

Though none of the variables incorporated in 
this study differentiated the homosexual group 
from the heterosexual group to a degree suf- 
ficient for individual diagnosis, it is possible that 
other psychological measurement devices could be 
found to be more discriminative. It is also quite 

ossible, or even probable, that homosexuality is 
such a complex characteristic that no one such 
device will be found. A combination of several 


less discriminative measures may have to suffice. 
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FANTASIZED DANGER AS A FUNCTION OF PARENT-CHILD 
CONTROLLING PRACTICES? 


DONALD W. TIFFANY anp FRANKLIN C. SHONTZ 


University of Kansas 


Rogers (1959) proposed a “continuum which 
reaches from rigidity and fixity of psychological 
functioning on the one hand, to psychological 
flow and changingness on the other” (p. 96). 
Previous work by the authors reported an attempt 
to operationalize this continuum through the 
measurement of experienced control (C) in pre- 
adolescents via the Picture Q Technique. 

In the present study, degree of C served as 
the criterion for subject grouping. High C scores 
represented the restricted and rigid end of Rogers’ 
psychological continuum, while low C scores 
represented the fluid, free, and comfortable end. 
It was hypothesized that children who experi- 
ence high C (rigidity and fixity) reveal a 
characterstic mode of response to the environ- 
ment, viz., they evidence a basic distrust and fear 
of unfamiliarity, 

The subjects were 22 
randomly selected from a Midwestern grade 
school. They were administered the Picture Q 
Technique (PQT) and a fantasy task called 
Mother Hubbard. The PQT consists of 48 drawn 
parent-child interactions. There are two pictures 
of each specific parent 2-child interaction, in one 
the parent is male and in the other, female. The 
pictures range from extreme permissiveness of 
parent-child interaction to parental control of the 
child, such as inhibiting the child's behavior. The 
child is asked to describe himself by Q sorting 
the pictures on a 5-point scale. The weight as- 
signed to each Picture is determined by its 
placement. Total C scores are computed for the 


preadolescent boys 


1An extended report of this study may be ob- 
tained without charge from Donald W. Tiffany 
(Psychology Department, University of Kansas, 
Lawrence, Kansas) or for a fee from the American 
Documentation Institute. Order Document No. 7458 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress, Washington 
25, D. C. Remit in advance $1.25 for microfilm or 
$125 for photocopies, and make checks payable to: 
Chief, Photoduplication Service, Library of Congress. 

2The term “parent” also refers to parent sur- 


rogates. 


24 C pictures by summing the weights assigned 
to these pictures. r 

For the Mother Hubbard task the w 
was given 5 minutes to think of and write ans 
the consequences of Mother Hubbard fin : 
her cupboard bare. The responses were ya 
on 4 X 6 inch cards, shuffled, and given to t a 
clinical psychologists (judges) with ihe i 
tions to rank order the cards on the basis 0 ie 
degree of physical danger expressed in © 
response. i 

The 11 children with high C scores (me 
58: range 56-65) formed one group while 54: 
11 children with low C scores (median aie 
range 54-55) formed the other. The groups vier 
closely comparable in age, intelligence, and s¢ 
grade. 

Kendall’s coefficient of concordance for ee 
samples (df=21) for the judge’s soa wen 
the Mother Hubbard responses was signi 
(W = 67; p < 01). r 

The Mann-Whitney U test was used a 
the independence of the two groups on me 
of the weights assigned to the Mother u the 
responses by the judges. It was found man 5; 
two groups differed significantly (U = orts 
$ < .05; two-tailed test). This finding STE sort 
the hypothesis that children whose se fron? 
reveals high parental C fantasize more pe ae 
mental danger than children whose P the 
reveals low parental C. Confirmation tention 
hypothesis lends support to Rogers’ contem 
that degree of permissiveness in parent- 
interaction is a significant determinant pe me 
child’s perceptions of himself and his env 
ment. 


est 
sis 
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CROSS-VALIDATION OF STEIN’S SYMBOL-GESTALT 
TEST FOR BRAIN DAMAGE + 


OSCAR A. PARSONS,? HARRIET I. MASLOW 
University of Oklahoma School of Medicine 
axb KENNETH D. STEWART 
University of Nebraska 


Stein (1961) recently reported highly success- 
ul discrimination of brain damaged subjects 
rom non-brain-damaged subjects by use of a 
rapid, easily administered, age-corrected, per- 
Vaal ator task, the “Symbol-Gestalt Test.” 
While the test has been used by Stein on a 
number of samples in prepublication studies, 
oe may be raised concerning the generaliz- 
Aes of Stein’s findings to different populations, 

miners, and locales. 
ae the course of a series of studies of brain 
of ee it was possible to test the applicability 
dra: $ Symbol-Gestalt Test to two populations 
tr wn from widely separated Veterans Adminis- 
ation hospitals, Durham, North Carolina and 

lahoma City, Oklahoma. 
on e population for the Durham study was 

i prised of 20 clearly diagnosed cortically brain 
Hr patients (BD) and 20 non-brain- 
nee patients (NBD); the Oklahoma City 
atic, consisted of 20 BD patients and 30 NBD 
Bro ents, The diagnoses of patients in both BD 
ae fell mainly in the CVA, tumor, or 
iái matic head injury categories. The NBD had 
D Bnoses predominantly of neurosis Or psycho- 
oe to ogical reaction, All patients were male 
ara alized veterans with a mean age of 41.5 
e S and a mean education level of 9.7 years; 

Stoups were equated on these variables. | 
rati € Symbol-Gestalt Test followed the adminis- 

on of the Vocabulary subtest of the WAIS. 
t: ia An ext 


aineq ended report of this study may be ob- 


Versity without charge from O. A. Parsons, Uni- 
treet of Oklahoma Medical Center, 800 N.E. 13th 
fron? Oklahoma City 4, Oklahoma, or for a fee 
Documes American Documentation Institute. Order 
Ment No, 7459 from ADI Auxiliary Publi- 
on Project, Photoduplication Service, Library 
Vance eek Washington 25, D. C. Remit in ad- 
anq Te for microfilm or $1.25 for photocopies 
tion ake checks payable to: Chief, Photoduplica- 
eS, Library of Congress. 
Rese: 4 p aa was supported, in part, by PHS 
Aut tee Grants B-1459 and B-2507 to the senior 
iseases rom the National Institutes of Neurological 
2 Ose and Blindness, USPHS. 
alsog Sar A. Parsons and Harriet I. Maslow are 
Aq jaltiliated with the Oklahoma City Veterans 
nistration Hospital. 


The test protocols were scored according to 
instructions appearing in the manual; inter- 
scorer reliability was 95%. 

The results in both studies indicate a clear- 
cut difference between the performance of the 
BD and NBD. In the Durham population the 
BD attained a mean score of —.668 and the 
NBD a mean score of —.042 (¢ = 2.56; 
p < 02). In the Oklahoma City study the BD 
attained a mean score of —.482 and the NBD 
a mean score of —.059 (t= 4.23; p< 001). 
Stein utilized a cutoff point of zero to differenti- 
ate BD subjects from NBD subjects: positive 
scores (+) fall in the NBD category and 
negative scores (—) in the BD category. It 
should be noted the four mean scores in the 
present study carry a negative sign. Therefore, 
the application of Stein’s cutoff point to these 
populations results in a high percentage of false 
positives (31%). 

In attempting to account for these results, it 
is important to note that while the groups used 
in our study were in many ways comparable to 
Stein’s, e.g, Veterans Administration patients. 
sex, age, diagnoses, etc., their educational level 
was lower by approximately 1.5 years and IQ 
by 14 points. In the Durham BD group, Stein 
performance is not correlated with either educa- 
tion or vocabulary. However, in the three other 
groups the correlations are positive and signifi- 
cant (four of six correlations are above +.69). 
Correlations of this magnitude suggest that 
education and IQ play a greater role than Stein 
has emphasized. 

These findings indicate that the Symbol- 
Gestalt Test is sensitive to impairment associ- 
ated with cortical brain damage. However, 
standardization of the test on populations which 
have a greater range of education and intel- 
ligence is indicated before it can be used as a 
clinical tool with diagnostic power. 
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EFFECT OF INTELLECTUAL LEVEL AND NEUROPSYCHIATRIC 
STATUS ON THE DIVERSITY OF INTENSITY OF 
SEMANTIC DIFFERENTIAL RATINGS? 


CHARLES NEURINGER 
University of North Dakota 


The hypothesis that a person’s capacity to 
perceive concepts in a varied manner (as meas- 
ured by the diversity of semantic differential 
ratings) is related to intellectual level Was evalu- 
ated by Ware (1958) with disappointing results, 

Since previous research seemed to indicate that 
utilization of intensity rating spaces appears to 
be related to the emotional state of the indi- 
vidual, it was felt that further exploration of 
the relationships between intellectual level, 
neuropsychiatric disposition and diversity of 
semantic differential intensity rating spaces would 
prove to be fruitful. 

An 18 concept, 9 scale semantic differential, 
and the Information Subtest of the Wechsler- 
Bellevue Intelligence Scale, Form I were admin- 
istered to neuropsychiatric hospital patients 
(N =16) and to normal hospitalized patients 
(N= 16). The subjects were matched within 
+1 or —1 correct responses on the Information 
Subtest in order to control for intellectual level 
when examining the effects of neuropsychiatric 
status on intensity diversity of semantic dif- 
ferential ratings. The extreme, moderate, mild 
and neutral rating spaces on each scale were 
assigned a value of 3, 2, 1, and 0, respectively. 
The mean standard deviation of the intensity 
ratings were computed for each subject from the 
162 scale judgments made by that subject. The 
greater the value of the mean standard deviation, 
the greater the amount of intensity diversity of 
ratings on the semantic differential, 

A ¢ test of the mean standard deviation dif- 
ferences between the normal and the neuro- 


1 An extended report of this study may be ob- 
tained without charge from Charles Neuringer 
(Psychology Department, University of North 
Dakota; Grand Forks, North Dakota) or for a 
fee from the American Documentation Institute. 
Order Document No. 7456 from ADI Auxiliary 
Publications Project, Photoduplication Service, 
Library of Congress; Washington 25, D. C. Remit 
in advance $1.25 for microfilm or $1.25 for photo- 
copies, and make checks payable to: Chief, Photo- 
duplication Service, Library of Congress. 


psychiatric groups (1.16 and .85, respectively) 
yielded a value of 2.844 (significant at the ‘ A 
level). A Spearman Rho between the SE 
standard deviations and Information Sub 07 
scores of the 32 subjects was found to be ore 
(significant at the .72 level). The subjects W lit 
also divided into two groups via a median oe 
of the Information Subtest scores. The ri 
standard deviations for the above and be yy 
median groups were 1.04 and .97, respec oye 
A ¢ test of the differences yielded a value ae 
-583 (significant at the .55 level). The ge 
procedure was followed with the upper whe 
lower quartiles on the Information Subtest. re 
mean standard deviations for the upper 
lower quartile subjects were 1.03 and 312 
respectively. A ¢ test yielded a value of x i 
which was significant at the .76 level of e 
dence. It would appear that diversity of inten 6 
of semantic differential ratings seems to eñt 
related more to neuropsychiatric involvem 
than to intellectual level, ewaiyivente 
Although one would expect that intellig A 
should be related to the ability to perceive ity, 
produce scores reflecting diversity of ae y. 
this does not seem to be the case in this $ at 
It is strongly felt here that failure to find Sam 
a relationship may be due to the error of fa 
sidering intelligence as a unified entity and ting 
accepting single intelligence scores ag. Tella 
that unity. The effects of emotional disturba 
on perception of intensity diversity may his 
related to the neurotic’s need to organize ane 
world in a simple manner, making it more ES 7 
ageable. In this sense, lack of intensity ave 
cation may be related to the same dynamics 
produce rigidity in perception and thinking. 


, 
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DIGIT SPAN PERFORMANCE AS A FUNCTION OF 
NOXIOUS STIMULATION * 


SANDRA PYKE ayp NEIL McK. AGNEW 


University of Saskatchewan 


Easterbrook (1959) proposed that increased 
drive would reduce range of cue utilization. Such 
a reduction will facilitate performance on simple 
tasks (those involving a small number of cues) 
and impair performance on complex tasks, lead- 
ing to a drive by task complexity interaction. 

Seventy-five female university students were 
assigned at random to three equal groups. All 
Subjects were given two testings, on alternate 
forms, of the digits forward and digits backward 
tests. Group I was given a first testing under 
normal conditions with no shock or threat of 
Shock but, just prior to the second testing, elec- 
trodes were placed on the fingers of the non- 


Preferred hand and a 100-volt, 30-millisecond 


Shock was administered. Group II was shocked, 
en given a first testing followed by removal 
y the electrodes and a second testing with no 
3 Ock or threat of shock. When performing under 
ock condition, in addition to receiving a sample 
shock, experimental subjects were informed that 
Poor performance would lead to further shocks. 
aoup, III served as a control group to provide 
estimate of improvement due to practice. 
mi he Taylor Manifest Anxiety Scale was ad- 
ion tered to all subjects. To monitor the dura- 
ee ed the effects of the shock and nonshock 
: itions, continuous heart rate recordings Were 
‘ken from an additional 17 subjects not part 


e original sample. i : 
Pre Coring, (a) The number of digits in the series 
is ceding the one in which the individual made 
(b) E mistake was the number correct sor 
Tep he sum of the number of series incorrectly 

fated for each testing was the error score. 

1 
tainen extended report of this study may be a 
Psyleho Pout charge from Neil McK. Agnew, 
Saska Osical Research Center, 1214 College Drive, 

ene Saskatchewan, or for a fee from the 
No ican Documentation Institute. Order Document 
Proje 7460 from ADI Auxiliary Publications 
Bres cb Photoduplication Service, Library of Con- 
ere ashington 25, D. C. Remit in advance 
Make or microfilm or $1.25 for photocopies, and 
Ser ic checks payable to: Chief, Photoduplication 


& Library of Congress. 


Group I (nonshock, shock) and Group II 
(shock, nonshock) provide data suitable for a 
switchback analysis (Brandt, 1938) which tests 
for combined stress effects on first and second 
testing against combined nonstress effects on 
first and second testing. Only in the case of 
digits forward number correct was a signifi- 
cant high stress impairment obtained (t = 2.98, 
p < .01, df 48). Using an analysis of covariance 
procedure, there was no evidence of stress ef- 
fects on second testing on any score. In a 2 X 2 
analysis of variance high anxiety subjects 
show significant impairment in both stress and 
nonstress conditions only on the digits forward 
data, number correct, and only on first testing. 

Using a Friedman two-way analysis of vari- 
ance, it was found that shock, plus threat of 
shock, significantly increased heart rate during 
performance (p < .001, k= 4, n= 10), regard- 
less of whether the stress condition was intro- 
duced on first or second testing. 

The limited effects of anxiety, whether defined 
in terms of noxious stimulation or questionnaire 
scores, raise some questions regarding the clinical 
application of the digit span tests. 

The results obtained suggest a drive by prac- 
tice interaction since, when shock was intro- 
duced after some practice on the task, no con- 
sistent evidence of a drive or an anxiety effect 
appeared. Assuming that a practiced task is a 
less complex one, the findings are in keeping 
with the proposed drive by task complexity 
interaction. Increased drive may impair per- 
formance only when a task is complex or is 
relatively unpracticed. 
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FURTHER SUPPORT OF SOME FINDINGS ABOUT THE 
CHARACTERISTICS OF SMOKERS AND NONSMOKERS 


BRUCE C. STRAITS 


University of Chicago 


Smoking and nonsmoking are alternative and 
opposite behaviors of sufficient complexity and 
potential social importance to be of increasing 
interest to social scientists. A recent review of 
research findings relating to smoking behavior 
pointed succinctly to the psychological aspects 
of the problem as a fruitful area for research 
endeavor (Matarazzo & Saslow, 1960). It is the 
purpose of this paper to report some data which 
tend to support previously reported findings. 

Two questionnaires developed for use in the 
study consisted of a 270-item true-false “per- 
sonality inventory” comprised of items from a 
number of well-known personality scales plus 
items from a scale used to measure the extent 
to which an individual feels that what happens 
to him is dependent upon his own behaviors as 
opposed to the operations of chance (internal 
versus external control). The second question- 
naire was of a factual, biographical nature. The 
subjects for the study were 125 male college 
students initially, plus an additional 120 to check 
a near-significant finding concerning internal- 
external control. Of the original 125 subjects, 
54 identified themselves as smokers. 


The major findings may be summarized as 
follows: 


1. Smokers were significantly higher on the 
MMPI Pd scale and significantly lower on the 
Sr scale. 

2. Smokers were more “chance oriented” 
(p < .08) in the first sample, a finding which was 
consistent at the .10 level in the second sample. 

3. Consistently with previous research but not 
at a significant level smokers were found to have 
lower MMPI L and Si scores, 


4. Smoking was significantly related to both 


AND 


LEE SECHREST 


Northwestern University 


coffee and beer drinking as previously reported. 
However, the Pd and Sr scales were related only 
to beer, not to coffee drinking. Smoking was uF 
related, as might be expected from an ok 
indulgence hypothesis, to consumption of mi 
or soft drinks. r 

5. From the standpoint of biographical vari- 
ables smokers, when compared with nonsmokers, 
were found to: attend church less often; come 
more often from private preparatory schools; 
be more frequent among Catholics; be more 
likely to belong to social fraternities. One finding 
which was expected but not confirmed was that 
smokers were found to be more frequent 1M 
families in which both parents were reported tO 
smoke. 


The finding that smokers are more “chance 
oriented” than nonsmokers is particularly 10- 
triguing inasmuch as it offers some hint COn- 
cerning the manner in which heavy smoking MY 
be maintained in the face of strong evidence ts 
its danger. In effect the smoker may reduce th 
dissonance between his smoking and medite 
reports by adhering to the philosophy that all H 
is a gamble. g 

In general the findings are supportive of me 
vious descriptions of the typical smoker Ei 
convivial person lacking strong identification pA 
cultural standards, and of the nonsmoker as te 


at ease socially, sober, and imbued with a stroné 
ethic. 
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Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 


page and prepared according to the specifications 
given below. 


2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 


Documentation Institute to insure indefinite avail- 
ability, 


3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 


charge to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


author’s lines, must not exceed 85 lines a 
eraging 42 characters and spaces in leng z 
Set the typewriter margins for short lines z 
42 characters, which are 3.5 inches DE x 
elite typing, and 4.2 inches long in pica. 

‘le keaciint of the Brief Report m 
be double spaced throughout. Except for 2 
short lines, it follows the standard style ia 
the 1957 revision of the APA Publicis A 
Manual. Headings, tables, and referie A 
avoided or, if essential, must be counte! aa 
the 85 lines. Each Brief Report must pa a 
companied by a footnote in the style be a a 
which is typed on a separate sheet and 
counted in the 85-line quota: 


b- 
1An extended report of this study may be ue 
tained without charge from John Doe (yee 
author’s full name and address) or for a fee sh 
the American Documentation Institute. Order tions 
ment No. — from ADI Auxiliary pubia cont 
Project, Photoduplication Service, Library © ~— 
gress; Washington 25, D. C. Remit in anam make 
for microfilm or $—— for photocopies, a cervic®s 
checks payable to: Chief, Photoduplication 
Library of Congress. 
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Extended report. Because the extended a 
port is intended for photoduplication, an tyle 
not copy to be sent to a printer, its a 0 
should differ in several ways from tha ort 
other manuscripts: (a) The extended TER 
should be typed with single spacing fig 
economy in duplication. (b) Tables 40 text 
ures should be placed adjacent to the d be 
which refers to them. A caption et al 
typed below each figure. (c) Footnotes $ hic 
be typed at the bottom of the page on WE 
reference is made to them. In other resp eci- 
the full report is prepared in the style SP 
fied by the Publication Manual. 
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PHOTOS PREFERENCE TEST (PPT) AS A MEASURE OF 
MENTAL STATUS FOR HOSPITALIZED 


PSYCHIATRIC 


BERNARD MEER 
Stockton State Hospital 


AND 


PATIENTS * 


ALBERT H. AMON 


University of North Carolina 


The Photos Preference Test (PPT) consists of 100 facial photographs, stand- 


ardized for like and dislike. This study at! 
responses to the photographs were relat 
newly admitted patients to 
retested 4 months later or 
controls were gi 
deviant responses than controls 
deviant responses than nonschiz 
less deviant responses on admissio 
more deviant responses (< .01); 


before then 


and (d 


deviant responses as they improve (p < 01), 


“a Much of the research with human faces as 
ee device stems from Murray’s 
and ) early study on the projection of fear 
fhuuliciousness and from the introduction 
1949. Szondi Test in this country by Deri 
eA Although the latter test was initially 
ca with a great deal of enthusiasm, re- 
studies have found little evidence to 


Su 
ae the assumptions underlying its use 
& povcimann & Klopfer, 1953; David, Orne, 


k pabinowitz 1953; Gordon, 1953; Gordon 
Kraus bert, 1954; Hurley, 1957; Mussen & 
ETN 1952; Short, 1954). Nevertheless 
oura nce with the Szondi Test has en- 
„aged investigators to find new ways of 


Sin š 
& pictures of human faces as projective 


tests 


sons, Tecwesting subjects to describe the per- 
(Glad & (Izard, 1952) or emotional state 
Photo Shearn, 1956) of the persons in the 
bout os; investigators have tried to learn 
Ore fy e personalities of the perceivers. A 
Staphs equent model for using facial photo- 
ests as a projective test is one that re- 
like gi inbjects to respond first in terms of a 
Sion D ike or favorable-unfavorable dimen- 
trib a then on a second occasion to at- 
Personality characteristics (Chambers, 

a grant from the 
ate of California, 
d here was 
et- 


T Thi 
Sebaran Y was supported by 
MAd the pè Of Mental Hygiene, St 
Tea rst of the two studies reporte! 
; © Western Psychological Association me 
an Jose, California, in April 1960. 


a state hospital were tested on adi 


tempted to determine whether deviant 
ed to psychological disturbance. 490 
mission and 
if discharged sooner. 369 “normal” 


ven the PPT. The results showed: (a) patients give more 

(p <.01); (b) schizophrenics give more 
ophrenics (p< .0001); (c) patients giving 
n are discharged sooner than patients giving 


) female patients show a decrease in 
while males do not. 


1957a; Flyer, 1951; Liggett, 1957), values 
(Fensterheim & Tresselt, 1953), and occupa- 
tional categories (DeRath, 1957) to the same 
photographs. An analysis is then made for 
characteristics attributed to the liked and 
disliked pictures in order to determine to 
which variables a subject reacts positively 
and to which he does not. A third model and 
one most closely related to the present study 
is that of obtaining normative reactions to 
the photographs and then measuring the 
deviations from the norms for different clin- 
ical groups (Chambers, 1957b; Eysenck, 
1954). 

In our work with a series of 100 pictures 
of human faces, we found marked individual 
differences in the number of pictures liked 
when groups of normal subjects were asked to 
give their first impression as to whether they 
liked or disliked each of the 100 photographs. 
Included in our sampling of normals were 
professional and nonprofessional hospital em- 
ployees, old people, college students, student 
nurses, labor groups, PTA groups, church 
groups, etc. No matter what comparisons 
were made between these groups, the stimulus 
value of the photographs remained relatively 
the same, that is, the rank order of the 
photographs in terms of a like-dislike dimen- 
sion changed very little from one group to 
the next. It is this high degree of consensus 
about the affective value of the various facial 
photographs that suggested the present study. 
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It is interesting to note that some years 
ago Bell (1948, p. 287), in reviewing the lit- 
erature on expressive movements, suggested 
the idea of constructing a test based on the 
social stereotypes that photographs provide. 
More recently Eysenck (1954) and Chambers 
(1957b) used this technique to differentiate 
normals from clinical groups. Eysenck, in 
fact, pointed out that it was the pictures 
which produce simple stereotyped responses 
that appear from a diagnostic point of view 
to promise more success than difficult non- 
stereotyped photographs. 

The fact that there is a great deal of agree- 
ment as to the affective quality of the photo- 
graphs indicates that the cues evoking these 
reactions are responded to by a great part 
of the general population. It is our proposi- 
tion that the failure to interpret these cues 
in the usual way may be indicative of the 
presence of a psychological disturbance. On 
this basis the following hypotheses were 
made: 

1. That hospitalized psychiatric patients 
will show more deviation in their affective 
reactions to the photographs than a “normal” 
control group. 

2. That patients on admission whose af- 
fective reactions to the photographs are more 
deviant from the standardization group will 
improve less quickly than those patients 
showing less deviation. 

3. That patients who qualify for dis- 
charge or indefinite leave of absence within 
4 months following admission will show a 
greater decrease in deviation from the stand- 
ardization group upon retesting than patients 
who do not so qualify. 


Photos Preference Test (PPT) 


The Photos Preference Test consists of 100 
facial photographs which are the full-face 
views from the booklets prepared by Camp- 
bell and Burwen (1956) for another purpose. 
The photographs include 10 pictures of each 
sex for four different age groups (adolescent, 
young adult, middle age, and old age). An- 
other group of photographs of 20 middle-aged 
men were included to increase the reliability 
of the test. The test requires that subjects 
give their first impression as to whether they 
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like (L) or dislike (D) each of the people 
in the photographs. The 100 pictures are 
in booklet form and can be administered in- 
dividually or to a group. 


Standardization Group 


Approximately 500 male and 500 female 
“normal” adults and adolescents were teste 
in Stockton, California, a community of about 
100,000 people. Vormal was defined as any- 
one not institutionalized at the time of test- 
ing. Subjects were asked to give their ag¢ 
education, and occupation but were told to 
omit their name. All testing was voluntary: 
A small percentage of subjects, about 570 
refused to take the test. A standardization 
group consisting of 200 males and 200 females 
was selected from the above subjects in suc 
a way that the distribution of the standar- 
ization group in terms of age (15 years a 
up) and education is similar to that of F 
1950 United States Census. This standardin 
tion group was used to determine the stimu 
value of each photograph. z 

To determine the stability of the stim ig 
values of the photographs, the 200 protoco 
for each sex were assigned to groups of 


ulus 


ge 

randomly, but constrained to the same a 

and education distributions. The a = 
c 


values of the photographs based on ea ake 
of 100 protocols were compared. The var 
order correlations were .95 for the two ™ u- 
populations and .96 for the two female PIP 
lations, confirming our informal aome 
about the reliability of the stimulus values n 
the photographs. Because we found consistan 
sex differences between males and Tema 
to certain age-sex categories, we decide! a 
retain separate norms for males and Tem 
despite the high overall correlation betwe 
them. 
Scoring the Photos Preference Test ee 
In this report the chief concern will 5 
with an overall congruence score, jects 
which indicates the similarity of a a - 
pattern of Ls and Ds to that of the stan 
ization group of his sex. r or 
First, e. scale values were obtained : 
each photograph, using a modification en 
method of successive categories (Torge 


ne 2 eC rSSe”~—“=s:SOOe eee 
eS a 


o — °°. 
sa 


PHOTOS PREFERENCE TEST 


1958). These values represent the degree to 
which each photograph was liked by the 
male and female standardization groups, 

respectively. 
$ For an analysis of the responses of each 
individual, a method similar to one reported 
by McReynolds (1951) was used. This 
method has the advantage of yielding a C 
aS which is independent of the proportion 
„Ls and Ds, and which provides for a 
petting of each deviation from the con- 
ensus, according to its degree of deviation. 
Braet assume a subject gives 40 Ds, he 
if We a score of 100 or perfect congruence 
Blots Ds are assigned to the 40 least-liked 
io Ographs. Any deviation from this pat- 
on ee his congruence score. A weighting 
Used oO for the extent of the deviation is 
nd = then total all the deviation scores, 
Dossible percentage this is of the maximum 
obtai e deviation, and subtract from 100 to 

un a final C score. 

reliable the C score tends to become less 
ikes e the more extreme the proportion of 
than and dislikes, subjects who gave fewer 
10 Ls or 10 Ds were not scored for C. 


Reliability 


ee Measures of reliability were obtained: 

96 fe e split-half reliability coefficient for 

Be È patients is .84, and (b) the test- 

Nurses reliability coefficient for 103 student 
» Corrected for restricted range, is .83. 


PROCEDURE 


Thi 

Was “eacpart covers two separate studies. Study I 
October ried out between November 1958 and 
an, g 1959, and Study II between June 1960 
2 repi ed 1961. Since the latter was an attempt 
tudy te the former, the procedure used in 
eht ift followed closely that of Study I. The 
ate “ferences that did occur will be mentioned 


Sup: 
ub jects 


Stu, d 
a new n For a period of approximately 6 months 
fe tal Mission patients at Stockton State Hos- 
ae ered Were no more than 50 years old were 
hadeg, Potential subjects for this study. Ex- 
ties, n Owever, were patients diagnosed as alco- 
4 arcotiç addicts, or as having any one of 


R Oia e 
na o disorders. Of the 134 male and 173 
or Patients who were available for the study, 
of the 


© of the males and 96 oF 55% 
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females met all criteria and gave protocols scorable 
for C. 

Following Study I, a group of approximately 60 
male and female new admission patients were given 
only the Photos Preference Test. The procedure 
was the same as in Study I except for the fact that 
this group of patients was told that the investigators 
were interested in how groups of patients as a whole 
responded to the test and were not interested in 
how any specific individual answered his test. 
Therefore, they were told to omit their name on 
the answer sheet. Since all normal subjects were 
given these instructions in contrast to the patients 
of the first study, it was felt necessary to determine 
how important was the effect of identifying oneself. 
The C scores and the number of like and dislike 
responses of these patients were compared with a 
group of patients from the first study, equated for 
age, education, and diagnosis. When no appreciable 
differences were found, the “nameless patients” were 
combined with the patients of Study I. These pa- 
tients were not retested and therefore could not be 
used to test Hypothesis 3.? 

Study II. Approximately 9 months following the 
completion of the first study, all new admission 
patients for a period of 5 months were tested. This 
time the age limit was raised to 55 years. Otherwise 
the criteria remained the same. Of the 187 males 
and 203 females who were available for this study, 
122 or 65% of the males and 131 or 64% of the 
females gave protocols scorable for C. 

Table 1 shows the distribution, by diagnosis, of 
all male and female patients used in the two studies. 

Control subjects consisted of males and females 
drawn from normals who had not been selected for 
the standardization groups. Within the framework 
of matching the controls with the patient groups 
for age and education, the selection was carried 
out on a random basis. Table 2 shows the mean and 
standard deviation of age and education for male 
and female patients and controls. 


Test Procedure for Patients 


Study I. All patients were tested in relatively 
small groups (15-25 patients) within 1 week of 
their admission. The patients were brought to the 
testing room by the ward personnel. Before the 
testing began, the patients were told that they were 
participating in a research study and that the 
results of the testing would be kept confidential. 


2 There were enough identifying data on the 
answer sheet (age and occupation) to determine the 
name of each patient. This the investigators did 
in order to obtain these patients’ diagnoses and to 
determine whether they were short- or long-stay 
patients. The investigators have no way of knowing 
whether any of the patients realized that his protocol 
could be identified even without his name. — 

83 We wish to acknowledge our appreciation to 
Robert B. Freeman who was responsible for the 
testing in Study I and for his general assistance in 


the first half of the project. 
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TABLE 1 


DISTRIBUTION OF PATIENTS IN Stupres I ann II 
COMBINED BY DIAGNosIs* 


Diagnosis Male Female 
Schizophrenic 

Paranoid 51 50 
Acute undifferentiated 22 30 
Chronic undifferentiated 36 35 
Schizo-affective 8 37 
Other il 11 

Total schizophrenic 128 163 


Nonschizophrenic, psychotic 


Involutional 9 30 
Manic depressive 2 20 
Depressive 6 0 
Paranoid state 1 i 
Nonschizophrenic, nonpsychotic 

Neurotic 28 34 
Personality trait disturbance oT 9 
Personality pattern disturbance 12 6 
Sociopathic personality disturbance 6 3 
Other 2 3 

Total nonschizophrenic 93 106 


^ Based on the American Psychiatric Association's Diagnostic 
and Statistical Manual of 1952. 


It was impressed on the patients that the test 
results would have no influence on the length of 
stay or the nature of their treatment in the hospital. 
The purpose of these instructions was to lessen the 
anxiety of the patients and make the test con- 
ditions more equal to that of the control group. 
Three tests were administered: the Photos Preference 
Test, the Draw-a-Person Test, and the MMPI, in 
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that order. The instructions for the PPT were the 
same as those given to the normals. Some patients 
protested that they could not judge whether they 
liked a person or not just by looking at a picture. 
In such cases they were told, “Write down e 
first impression,” or “Do the best you cana 
some cases where a patient was either especia y 
reluctant or appeared to be having difficulty wie 
the examiner helped out by filling in the eu CE 
as given by the patient. Because a follow-up a 
to be made of the patients, they were asked 
fill in their name on the answer sheet. oa 
Patients who were given a discharge or wen ir 
indefinite leave of absence within 4 months of t -i 
admission were retested with the PPT and Drar 
Person Test at the time of their departure. Pari 
who remained in the hospital for a period © th 
months were retested at that time. The Kee 
cutoff was chosen to separate the “short- ba 
“long-stay” patients because experience had shore 
that approximately half of the patients left 5 
hospital in the first 4 months of their hospital had 
Patients who left the hospital within 4 montbe ea 
to remain out of the hospital a minimum of 30 43) 
to be rated short-stay patients. 198 
Study II, Except for the fact that there W3 
change in the test administrator (Amon), ved 
Procedure for the first part of Study II oe Kj 
closely that of Study I. Approximately one-thir' ai 
the way through the second study, however; it to 
decided to administer the PPT twice in order 
determine the consistency of the patient’s ae É 
and so identify those patients who were respi the 
ing randomly. It should be noted that only ara 
data for the first administration of the PPT tests 
used in the comparisons to be reported. The wate 
that were administered from this point 0n, A 
the PPT, a 40-item exploratory questionnaire, 
repeat administration of the PPT, and the ped 
in that order, The Draw-a-Person Test was drop the 
because it was time consuming and because we 
results of the first study indicated that it a 
little promise. As in Study I these patients Y 


a 


1 


TABLE 2 


MEAN AND STANDARD DEVIATION OF AGE AND EDUCATION FOR MALE AND 
FEMALE PATIENTS AND CONTROLS 


p 
Study I Study IT 
Subjects N Age Education N Age Education 
Males M SD M SD M SD M A 
Patients 9 B6 95 10.7 28 122, 354. tit 10.3 ; 9 
Controls 99 32.2 10.2 10.8 2.8 108 34.7 11.4 10.6 
Females ae 28 
atients 138 35.4 9.4 10.7 2.6 131 36.5 10.3 10. 3 
cots 18 360 Ml te Zs n Yee ape Geese 


Not Some of the low education controls in Study I were used in Study II. 
ote.—S 


= —E —_ 7 
————=E—p>. C—O 
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TABLE 3 
MEAN C SCORE DIFFERENCES BETWEEN PATIENTS AND CONTROLS 


Males Females 

gan Pa- Con- Pa- Con- 

tudies N tients trl N t p* N tients trol WN t p 
Study I 99 75.7 785 99 1.71 <05 138 79.3 820 138 2.58  <.005 
S 

tudy IT 122 75.6 79.5 108 2.78 <.003 131 774 819 131 417 <.0001 
Studies I & IT 

combined 

Education 108 739 78.7 57% 2.43 <.008 114 77.3 821 61% 298 <.002 
<11 years 

Education 113 77.3 79.2 110 141 <.08 155 79.2 81.8 141 2.82 <.005 
211 years 


* P 
Qhe-tailed test. 
Made; (@ tae an insufficient number of normal: 
or subjects with less than 11 years of 


Tete: 

or vi at the time of discharge or leave of absence 

Ca 4 months following admission, whichever 
me first, 


RESULTS 


port a Fesults shown in Table 3 clearly sup- 
betwe e first hypothesis. In every comparison 

è in patients and normals, equated for 
iis education, the former have lower c 
Sponge thus showing more deviation in Te- 
In ad p to the photographs than the latter. 
the dition the findings in Table 3 show that 

ifference in C scores between patients 
normals is more striking among females 

males and among the less educated than 


amon 
8 the more educated groups. 


s were available as controls at the low educational level, two separate analyses were 
f education and (b) for subjects with 11 or more years of education, 


great variability of the patients’ scores.* Some 
of the highest and lowest C scores were those 
obtained from patients. We were interested 
in determining whether these C scores were in 
fact related to an independent measure of 
mental status. For this purpose, the patients 
were divided into those diagnosed as schizo- 
phrenic in one group and all other diagnostic 
categories in a second group. Since schizo- 
phrenia is a label given to the most patholog- 
ical patients, we predicted that their C scores 
would be significantly lower than for the 
nonschizophrenic patients. Table 4 gives the 


4Standard deviations were in all cases significantly 


greater for patients than for normals. However, 


Bente of the factors operating to reduce the these differences were not so large as to call into 
Hficance of the results in Table 3 was the question the validity of the t test (Boneau, 1960). 
TE 
TABLE 4 
MEAN C Score DIFFERENCES BETWEEN SCHIZOPHRENIC AND NONSCHIZOPHRENTC PATIENTS 
Males Females 
Non- 
S Schi at Schizo- schizo- 
Š tudies N ren sirene N t pt N phrenic phrenic N t p* 
tug 
Sty, YI 77.2 82.5 57. 3.21. KW 
.008 81 i 
Stuga l oe ee ae ae 288 02 
: 3 y 6) 81. 1 X <.0001 
Combines 128 727 79.7 93 450 <.0001 163 76 
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TABLE 5 


MEAN C SCORE DIFFERENCES AT THE TIME OF ADMISSION BETWEEN SHORT- AND LONG-STAY PATIENTS 


Males Females 
Short- Long- Short- Long- g 
Studies N stay stay N t pt N stay stay N t p 
Study I 54 784 731 37 214 <02 83 80.9 76.4 53 2.72 <.004 
Study II 80 77.7 73.5 32 186 <.04 95 77.8 75.8 33 .96 Pa 
Studies I & IT 134 78.0 73.3 69 286 <.003 178 79.2 762 86 239 <. 
combined 
ital 
Note.—Short-stay patients left the hospital within 4 months of admission while long-stay patients did not leave the hospita 
within 4 months of admission. e 
* One-tailed test. 
i 3 is 
results of this comparison for each study and deviant responses. Results relevant to thi 


for both combined. The difference in C scores 
between schizophrenic and nonschizophrenic 
patients is extremely significant in each study 
and for both sexes. As predicted, schizo- 
phrenic patients as a group show more devia- 
tion (lower C score) than the nonschizo- 
phrenic patients. 

A further analysis of the patient data re- 
vealed that patients diagnosed as schizo- 
phrenic tend to be less educated than patients 
not so diagnosed. In order to determine how 
much education contributes to the differences 
in C scores between schizophrenics and 
nonschizophrenics, correlation coefficients be- 
tween education and C score were calculated 
for all patients. These correlation coefficients 
are positive and significant, that is, the more 
educated patients tend to have higher C 
scores. However, the degree of relationship 
between these two variables is so slight, 
+.125 (p< .05) for males and + .189 
(p < .01) for females, that it accounts for 
very little of the variance in C scores between 
schizophrenic and nonschizophrenic patients. 

Finally, it should be pointed out that 
while normals have significantly higher C 
scores than patients (Table 3), their C scores 
are no different from those of the non- 
schizophrenic patients (Table 4). In short, 

it is the deviant responses of the schizo- 
phrenic patients that make for the difference 
between normals and patients. 

The second hypothesis predicted that pa- 
tients giving less deviant responses to the 
facial photographs on admission would leave 
the hospital sooner than patients giving more 


hypothesis are shown in Table 5. Focusing 0” 
the individual studies separately, we find oe 
three out of the four ¢ tests are significan : 
and the fourth (Study II, Females) while no 
significant is in the predicted direction. +, i 
t tests for the combined studies are sian 
cant at the .003 level for males and at H 
.009 level for females. Thus patients who pe 
high C scores on the PPT when testce=2urine 
the first week of their admission to the hose? 
tal tend to leave the hospital sooner tha 
patients who get low C scores. 

Since the C score is significantly Š 
to length of stay and also to diagnosis * net 
Table 4), the question was posed whet! 
diagnosis is also related to length of stay: 
and if it is, whether the C score length ° 
stay relationship obtains within the schizo 
phrenic and nonschizophrenic groups- 
answer to the first question is show? 2 
Table 6. The results show that diagnosis 3 
indeed related to length of stay, a 
strikingly for males (p< .001) than 30 
females (p < .05). In general, patients Es 
do not receive a schizophrenic diagnosis a i 
a much better chance of leaving the hosp! + 
within 4 months of admission than patie? 
receiving a schizophrenic diagnosis. 

Table 7 shows the relationship betweer 
C scores and length of stay® within €% 


related 
(see 


n in 


the 
5In this study the term “diagnosis” refers to | 


0” 
differentiation of schizophrenic from nonschiZ 
phrenic patients. 

® Length of stay for this analysis was 
of days in the hospital up to a year before 1 
absence or discharge. 


the number 
eave ° 


j 
7 
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TABLE 6 
RELATIONSHIP BETWEEN DIAGNOSIS AND SHORT-STAY AND Lonc-Stay PATIENTS 
Males Females 
Short- Long- Short- Long- 
Studies stay stay Peg $ stay stay ee p 
Study I 
Schizophrenic 36 25 10.11 <.01 57 24 1.07 ns 
Nonschizophrenic 44 if 37 10 
Study II 
Schizophrenic 25 27 8.32 <.01 44 37 4.70 <.05 
Nonschizophrenic 29 8 iý 18 
Studies I & IT combined 
Schizophrenic 61 52 18.69 <.001 101 61 4.89 <.05 
Nonschizophrenic 73 15 17 25 
Note.—See footnote 5. 
di 5 = 
agnostic group and for both groups com- cant for males (r= — 219, p< .02) and 


females (r= —.153, P< .05). Within the 


Med. First it should be noted that all 18 l Vithin th 
nonschizophrenic group this relationship is 


Correlati : i 
Vtrelation coefficients in Table 7 are nega- 


the at is, the higher the C score the shorter 
in ae of stay. Of particular importance 
dia “te table are the correlations within the 
Fixe Ostic groups for Studies I and II com- 


ed. The results show that the correlation 


significant for females (7 = —.189, p < .05) 
but not for males, although the correlation is 
in the predicted direction and close to sig- 
nificance (r = —.156, p< .10). 

The third hypothesis predicted that pa- 
tients who improve sufficiently to qualify for 


e À : 
peen C and length of stay within schizo- l to 
*enics, although quite modest, is signifi- leave of absence or discharge within 4 months 


TABLE 7 


AND LENGTH OF STAY FOR Dirrerent DiacNostic GROUPS 


RELATIONSHIP BETWEEN C SCORES 


Males Females 
SS ee s 
Studies N r t + N r t b 
Stuq 
yI 
Schi 5 —.202 182 <.05 
ton taphrenic w =e tes ae 35 —258 197 08 
chizophrenic 36 (259A ee 
T Patients 90 =z 328 <0 136 —.256 298  <.002 
Udy Ty 
Schi so —.133 118 ns 
Nenat hte So o= E w 46 —109 072 ns 
izophreni —.065 0.45 = 
P Daime e Bo Dis 187 <05 126 2137 154 ns 
Udieg 7 
& II combined 
Schi i w =i r9 a0 
Nonschienie ii =n Ar a 101 —189 189 <05 
Seren eats : 2 ‘002 
AU tate BS Dis ars <o %0 187 303 <00 
‘ 


O; 
tailed teas 
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TABLE 8 
DIFFERENCES IN CHANGE OF C SCORES For SHORT- AND Lonc-Stay PATIENTS 
Males Females 
Mean Mean r 
Studies N* change F p Ns change F p A 
Study I 
Short-stay 12 —1.31 55 ns 32 +43.1 8.32 <.005 
Long-stay 22 +1.62 25 —3.6 
Study II 
Short-stay 39 -+1.90 1.33 ns 47 -+4.90 1.88 ns 
Long-stay 26 —0.53 27 -+2.60 
Studies I and II combined 
Committed patients only 
" Ghareatay 29 +1.34 a2 ns 50 -+6.94 23.55 <.0001 
Long-stay 40 +0.28 39 —0.29 
Committed & voluntary 
patients i 
Short-stay 51 +0.84 00 ns 79 +4.18 10.36 <.00 
Long-stay 48 +0.78 52 —0.39 


n i i ital. Ow 

a Retesting of short-stay patients depended on our being notified by the wards that patients were leaving the ony patien 

often the ward personnel were too involved with more immediate problems than to think of contacting us. Also, quite a not retesté 
left the hospital on weekends or after hours. For the above reasons, approximately 50% of the short-stay patients were 


This explains our reduced N. 
* One-tailed test. 


of their date of admission (short-stay pa- 
tients) when retested at the time of their 
departure will show a greater decrease in 
deviation from the standardization group 
than will patients not so improved (long- 
stay patients). The latter were retested 4 
months following their admission. Table 8 
shows the results of an analysis of covari- 
ance of the change in C scores between 
short-stay and long-stay patients, controlling 
for the effect of the original C score. Here 
a striking sex difference occurs. For male 
patients there appears to be no relationship 
whatsoever between a change in C score and 
length of stay. On the other hand, this rela- 
tionship is very significant for female pa- 
tients. It should be noted that among females 
this relationship is more significant for com- 
mitted patients (F = 23.55, p < 0001) than 
for committed and voluntary patients com- 
bined (F = 10.36, p< .001). This is under- 
standable since voluntary patients can leave 
the hospital without necessarily being judged 
as improved, while committed patients have 


no such freedom. 


ite 
ts 
d. 


Other Findings 


Pa nly 

Since the MMPI was administered 4 0 

once in both studies, that is, at the psa 
admission, its interest lay in its relatio 


er 
to length of stay and to the C a Soine 
1: i t to de 
analyses were carried ou of leng 


whether the MMPI is predictive a- 
of stay. The short-stay and long-stay a 
tients were compared for each of the ri! 
standard scales. Of the 48 ¢ tests (suet 
and II, males and females) only ae 
significant at the .05 level, and no one a in 
consistently differentiated the two grouP 
both studies. In those instances Whe on 
scale showed a significant difference "ither 
study, the means of that scale were S 
reversed or showed no difference in the 
study. Another approach was to hav MPt 
experienced clinicians evaluate the 5 of 
profiles and rank order them in term 
“degree of pathology.” This was CATT owe 
for Study I only. Since the clinicians pei 
a high degree of consistency in their hae 
each MMPI profile was assigned the @ 
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rank as its score. These average ranks were 
then correlated with length of stay, and 
the results yielded low positive correlations 
which, however, were not significant. Finally 
an item analysis for short-stay and long-stay 
Patients was carried out for both studies. A 
oe of items differentiated the two groups 
oo for Study I. However, practically 
fits a of the items that were significant in the 
E study showed up as significant in the 
at nd study. The failure to find any con- 
“ed in the two studies was true for the 
a e and the female patients and discouraged 
ber use of other scales to predict 
A of stay. The overall results suggest 
Kine piratenes „the MMPI measures at the 
to È admission, it bears little relationship 
3 TA of stay, at least for this population. 
with “a analysis of the MMPI for patients 
fa high and low C scores similarly yielded 
8ative results. 
a Draw-a-Person Test which consisted 
vida S drawings, one male and one female, 
ha by six clinicians” for degree of 
Patient: The data for the male and female 
tankin J were treated separately. Since the 
ined 8s were fairly reliable, they were com- 
of dr to obtain an average rank for each set 
then awings. These average rank scores were 
nate with C and length of stay. 
et showed no significant relationship 
Other the _Draw-a-Person Test and the 
Wo variables for males or females. 


DISCUSSION 


oh us of this study support, for the 
Ogica] "a the general hypothesis that psycho- 
ectiy, isturbance is reflected in deviant 
faces e responses to pictures of human 
indeed The hospitalized mental patients do 
to the show more idiosyncratic responses 
Weve Photographs than do the controls. 
Atient r, it is the more seriously disturbed 
this qi S, the schizophrenics, that account for 
tha ifference, The results, therefore, indicate 
Ver, PS¥chological disturbance has to reach 
Brave proportions before the ability to 


7 
We wy: 
Shanes ee to acknowledge our appreciation to 
tiag Ro Dutton, Francis McFarland, Rollin Rose, 
or thei, Sy, Emanuel Skoor, and Richard Wolton 
* aid in ranking the MMPIs and DAPs. 
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respond to facial photographs is markedly 
affected. In order to determine whether edu- 
cation, too, was related to deviant responses 
to facial photographs, correlations were car- 
ried out between C and education for schizo- 
phrenics and nonschizophrenics, separately 
and for the controls. Only the male and female 
schizophrenic correlation coefficients were sig- 
nificant (+.19, p<.05 for the males and 
+.17, p < .05 for the females). The corre- 
lation coefficients for the nonschizophrenics 
were +.05 for the males and +.08 for the 
females, and for the controls +.11 for the 
males and —.04 for the females. One may 
conclude from the above that the ability to 
react to stimuli such as facial photographs 
in a way that is shared by most people is 
a relatively stable characteristic and is upset 
only under great psychological stress (schizo- 
phrenia). In addition, the data suggest that 
schizophrenics with a limited education may 
have a little more difficulty resisting per- 
ceptual disorganization than schizophrenics 
with more education. 

The second hypothesis in this study is 
really a question of the prognostic value of 
the C score, without regard to the type of 
treatment given to the patients. In a recent 
review of the literature on the prognostic 
use of psychological tests, Fulkerson and 
Barry (1961) point out that where tests are 
successful in predicting outcome it is because 
they accurately measure the variable “severity 
of illness.” The results in this study are 
certainly in line with this generalization. The 
C score has proved itself as a sensitive dis- 
criminator of the more from the less severely 
ill at the time of admission and has been 
able to predict length of stay to a moderate 
but significant degree despite the short- 
comings of length of stay as a criterion of 
improvement.° The results have also shown 
that, although diagnosis and C both cor- 
relate with length of stay, C makes some 

8 Length of stay, which has been used as a 
criterion of improvement in many studies (Fulker- 
1961), while objective, is far from 


i an ideal criterion because some patients 
stay or leave the hospital for reasons other than 


the state of their psychiat t 
variable that has been little studied but which 


plays an important role in determining length of 
stay is “family resources.” 
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contribution to the prediction of length of 
stay independent of diagnosis. . 
Was the C score effective in differentiating 
the short-stay from the long-stay patients 
because the sicker patients were responding 
autistically or because, not being able to 
attend to the task, they were responding 
randomly? To answer this question, most of 
the patients in the second study were given 
the test twice in the space of 1 hour. When 
the number of changes in response to the 
particular photographs was more numerous 
than could occur by chance once in a hundred 
times, the patient was judged inconsistent. 
The proportion of patients who were found 
to be inconsistent was surprisingly low, 10% 
of the males and 11% of the females. The 
critical question was whether the proportion 
of short- and long-stay patients was different 
for the inconsistent and consistent responders. 
The chi square turned out insignificant (x? = 
.54), which indicated that the C score’s 
sensitivity to differentiating short- and long- 
stay patients is not based on the fact that 
long-stay patients are simply more random in 
their responses to the photographs than short- 
stay patients. 
The results of the third hypothesis, assum- 
ing that the difference in findings for the 
males and females is reliable (Table 8), are 
most puzzling. Why should the change in C 
score reflect the improvement of the short- 
stay female patients but not the short-stay 
male patients? We tried to rationalize these 
results by suggesting that since men tend to 
resist psychiatric treatment more than women 
and since there is more economic pressure on 
men than on women from the environment, 
men are released from the hospital as soon 
as they manifest a superficial improvement in 
behavior. Women, on the other hand, may be 
kept longer until a more a significant change 
in their behavior takes place. If the above 
were true, we should expect that males would 
be released more quickly than females (con- 
trolling for diagnosis) and that the recidivism 
rate would be greater for the former than 
the latter. We found neither expectation to 
be confirmed in our sample. , 
A rationalization that is more speculative 


than the above but which may have more 
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merit in explaining our results concerns the 
difference in emphasis that men and we 
place on the interpersonal role. Early in a 
boys begin to direct their interests and ai 
directed by society toward things mechanic i 
and scientific, toward ideas and absirac” P 
in contrast to girls whose interpersonal a 
terests remain central. This difference 15 ~ 
only reflected in the choice of toys, ee 
and school subjects, but also in the ii 
of interpersonal communications nn wa 
ings and people, as shown by Jourar' ae 
Lasakow (1958). Support for this view Er 
from our standardization data which other 
that females tend to agree with each bent 
much more than men do as to which EE 
photographs they like and which they hott 
like. In short, they share their feelings 2 A 
likes and dislikes of people to a i the 
tent than men do. Thus, because A Í 
centrality of the interpersonal nee roto 
women, their ability to respond to the P tuate 
graphs appropriately is likely to aeu 
with the degree of disorganization of € f the 
processes. On the other hand, because “i 
more varied involvements of men, sate to 
of interpersonal affect reflected in ee ttle 
the photographs is less central, and t on 
their response to the facial photographs 1 dis- 
be less likely to fluctuate under centra = 
organization. The above would explain show 
male patients even when they improve ow 
little change in C score while women 5 
striking change. These speculations 2" 
gestive of some important differences 
the interpersonal needs of males an 
that might reward study. 
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FIELD DEPENDENCE AND OVERCOMING EMBEDDEDNESS* 


STEPHEN A. KARP 
State University of New York, Downstate Medical Center 


Witkin and his colleagues have suggested that field dependence involves 
ability to overcome the effects of embedding contexts. The present study 
evaluates this hypothesis in relation to an alternative one, that field dependence 
involves ability to resist distraction. 150 male college students were given a 
battery of 18 tests including tests of field dependence, WAIS subtests, Adaptive 
Flexibility measures, and measures of ability to resist distraction. A factor 
analysis was carried out on the matrix of intercorrelations between these 
variables, The results tend to support the Witkin hypothesis; the field 
dependence and other embeddedness tests load and define different factors 
than the distraction tests. The two kinds of factors do, however, tend to be 


moderately correlated. 


Since 1948, Witkin and his associates 
have carried out an extensive series of studies 
of perceptual field dependence (see Witkin, 
Lewis, Hertzman, Machover, Meissner, & 
Wapner, 1954, for a summary of the earlier 
work). On the basis of evidence obtained 
from these studies, they describe a broad 
dimension of psychological functioning 
which they call differentiation (Witkin, Dyk, 
Faterson, Goodenough, & Karp, 1962). They 
demonstrate that from knowledge of one’s 
relative level of differentiation, predictions 
can be made regarding aspects of his func- 
tioning in a variety of areas including per- 
ception, problem solving, intelligence, and 
personality. Among the measures employed 
to tap the differentiation dimension, a series 
of tests of perceptual field dependence have 
been most frequently and centrally used by 
these investigators. In view of this central 
role assigned the field dependence dimension 
in their work, Witkin et al. have carried 
out a number of studies for the purpose of 
defining the kinds of cognitive abilities 
measured by tests of field dependence. 

On the basis of evidence obtained from 
some of their early studies of field depend- 


1 Based upon the author’s dissertation at New 
York University in partial fulfillment of the re- 
quirements for the degree of Doctor of Philosophy. 
The author is indebted to Jacob Cohen, sponsor 
of the dissertation, and to Richard Loeff for his 
assistance in gathering and analyzing the data. This 
tudy was supported in part by a research grant 
( 628) from the National Institutes of Health, 
G States Public Health Service, with H. A. 


Witkin as principal investigator. 


ence, Witkin and his colleagues hypothesized 
a general cognitive style by which Pal 
formance in many perceptual and intelled 
situations might be characterized. TA 
viewed this cognitive style as rene a 
ability to find or manipulate an item W. A 
is embedded within a surrounding one 
To test this hypothesis, several studies aie 
carried out for the purpose of deter ae 
relationships between measures of field res 
pendence and other kinds of measu 

which, upon rational analysis, appeare cect 
require ability to overcome embeddedno® 
Witkin and his colleagues predicted ae 
such measures should be significantly © 


oat to 

related with and factorially similar 
measures of field dependence. Karp 
In one such study, Goodenough and scale 


(1961) gave the Wechsler Intelligence à 
for Children (WISC) and three tests st 
field dependence, the rod-and-frame ia 
(RFT), body-adjustment test (BAT), T 
embedded-figures test (EFT) to two 8! 


the 

of children (aged 10 and 12). OD sub- 
basis of rational analysis of the WISC S" 5 
tudies, 


tests and previous factor analytic S$ A 
they identified three subtests (Block ae 

Picture Completion, and Object AS d- 
as involving ability to overcome embe me 
ness. They predicted that these a ae 
along with the tests of field depen ee 
would define a single factor on the ming 
of the common requirement of overco! 

embeddedness. This was substantiated Dies 
results of two factor analyses (one ee 
group). Two other predicted factors emo 
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in both studies, a Verbal Comprehension 
factor (loading Vocabulary, Information, 
Similarities, Arithmetic, and Comprehension) 
and a Memory or Attention-Concentration 
factor (loading Digit Span, Arithmetic, and 
Coding), neither of which consistently and 
substantially loaded the embeddedness tests. 

In another study, Goodman * postulated 
relationships between measures of field de- 
Pendence and Flexibility of Closure (Botzum, 
1951; Pemberton, 1952; Thurstone, 1944, 
1949), based upon her evaluation of meas- 
ures of Flexibility of Closure as involving 
ability to overcome embeddedness. She ob- 
tained significant correlations between the 
two types of measures. Gardner, Jackson, 
and Messick (1960) have since found tests 
of field dependence and Flexibility of Closure 
to define a single factor. 

In a third study, Witkin et al. (1962) 
Predicted that tests of field dependence 
ped be significantly related to tests of 
adaptive Flexibility (Guilford, Frick, Chris- 
tensen, & Merrifield, 1957). This was based 
Pon their view that the latter measures 
al involved problem situations where the 
oo" provided an embedding context from 

ich some piece of information had to be 
extracted and restructured in order to solve 
G € problem. They found two such measures 
Í Adaptive Flexibility, Match Problems and 
eet Problems, to correlate significantly 

= EFT, BAT, and RFT. 
Con, he evidence from these studies suggests 
twe, erable interrelation and overlap be- 
ie measures of field dependence, Flexi- 
ity of Closure, and Adaptive Flexibility. 

a et al. have suggested, as @ basis for 
elie, Overlap, that ability to overcome the 
requis of embedding contexts is a common 
fact irement of tests defining each of these 
te In contrast, interpretations of Flexi- 
se of Closure by Thurstone (1944), 
Best ie (1951), and Pemberton (1952) sug- 
the at the tests loading this factor require 
tracy ect to overcome the effects of dis- 

Ing, rather than specifically embedding, 


0 . 
imps: The latter interpretation might 
Sp Y that an embedding context is but a 


eci; b; 
lal case of a distracting context. 


2 
(19n Published study, described in Witkin et al. 
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Evidence from two studies might be con- 
sidered as supportive of the Thurstone inter- 
pretation. Podell and Phillips (1959) and 
Jackson (1955) each obtained significant 
correlations between the EFT and a measure 
which appeared to reflect ability to overcome 
the effects of distracting, rather than em- 
bedding, contexts. 

The study of Podell and Phillips involved 
cluster analysis of a group of cognitive tests, 
including the EFT and Podell’s (1957) 
Form Embeddedness Test. The latter test 
requires the subject to locate a series of 
simple figures in each of two kinds of com- 
plex figures, one organized and one random. 
Their results suggest that the EFT and a 
total time measure of Form Embeddedness 
(for both random and embedding contexts) 
correlate highly with one cluster (Spatial 
Decontextualization). A second Form Em- 
beddedness measure (ratio of time taken to 
find figures in embedding contexts to time 
taken to find them in random contexts) cor- 
relates little with this cluster. This would 
imply that EFT performance may be as 
highly related to performance under con- 
ditions of distraction as it is to performance 
under conditions of embeddedness. 

Jackson administered a battery of tests to 
small groups of male and female subjects. 
The battery included the EFT and a Words 
in Noise Test. In the latter situation, sub- 
jects were asked to identify words hidden by 
background white noise. This background of 
noise can be viewed as providing a distract- 
ing context. Jackson obtained a significant 
relationship between the tests for men 
Gea 4, Ps 05) but not for women 
(r= .21). Although a different perceptual 
modality was used in this study—and results 
were obtained for men but not for women— 
these findings offer some support for viewing 
embedding and distracting contexts as related. 

The above results suggest that further 
evidence is needed before the issue of dis- 
traction versus embeddedness can be set- 
tled. The present study is designed to provide 
such evidence. It must be noted, however, 
that this study is not directed at investiga- 
tion of the proad issues of how or why one 
may be distracted. Rather, the kinds of 
distraction situations chosen for study all 
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involve irrelevant visual stimuli which sur- 
round or intersect critical test items which 
must be located or manipulated by the sub- 
ject. The distraction situations are thus 
similar to embeddedness situations, such as 
EFT, where critical visual items ‘are sur- 
rounded by irrelevant visual stimuli. The 
difference between the two types of situation, 
which leads to interpretation of one context 
as distracting and another context as em- 
bedding, lies in the relationship of critical 
to irrelevant items. In the distraction situa- 
tions, the figural properties of critical items 
remain intact. A triangle remains a triangle, 
despite the surroundings. In the embedded- 
ness situation, the critical item or its parts 
are organized into new, competing gestalts 
which serve to break up the original critical 
figure. In effect, a distracting context may 
be thought of as obscuring a critical item 
without changing the nature of the item, 
whereas an embedding context serves to 
obscure a critical item because it changes 
the nature of the item. 


METHOD 


Subjects were 150 undergraduate males attending 
Brooklyn College. All were paid volunteers. They 
ranged in age from 17 to 43 years (the second oldest 
was 27), in college class from lower freshmen to 
upper seniors, and included majors in virtually 
every subject offered at Brooklyn College. 

The following tests were administered to each 
subject: 

1. BAT. A measure of field dependence involving 
ability to adjust one’s tilted body to the upright 
while surrounded by a tilted field. Score for each 
trial is degrees of deviation from the true vertical 
at which the subject reports himself as vertical. 
Test score is the sum of six trial scores. 

2. RFT. In this test of field dependence the sub- 
ject is seated in a completely darkened room, where 
all he can see is a luminous rod surrounded by a 
luminous square frame. At the beginning of each 
trial both rod and frame are presented in tilted 
positions, the subject’s task being to adjust the 
rod to the vertical while the frame remains tilted. 
Score for each trial is degrees of deviation of the 
rod from the true vertical when the subject reports 
it to be upright. Test score is the sum of eight 
trial scores. 

3, EFT. This test of field dependence consists 
of a series of pairs of figures, each pair consisting 
of a simple geometric figure and a complex colored 
figure within which the simple figure is embedded. 
The subject’s task is to quickly locate the simple 
feire within the complex one, his score being 
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the time taken to do so. Test score is the sum of 
time scores for 12 trials. z 

4. Wechsler Adult Intelligence Scale (WAIS). Six 
subtests of the WAIS, Block Design, Object As- 
sembly, Comprehension, Vocabulary, Arithmetic, and 
Digit Span, were administered and scored in the 
standard manner (Wechsler, 1958). 3 

5. Match Problems. In this test of Adaptive 
Flexibility, the subject is shown a diagram of a 
lattice of squares which are formed by an arrange- 
ment of matchsticks. His task is to show how, by 
removal of a specified number of matchsticks, one 
can arrive at a given number of squares. Score 15 
the number of correct solutions in 14 minutes. 

6. Insight Problems. The test, another measure of 
Adaptive Flexibility, consists of 12 problems re- 
quiring the restructuring of the “given” in order 
to reach a solution. Score is the number of correct 
solutions in 22 minutes. 

7. Digit Symbol. This modified version of the 
WAIS subtest contains approximately twice as many 
items as the standard version. Score is the time 
taken to complete the test. . 

8. Distracting Contexts Test I (DCT I). This 
test is similar to the EFT; however, the context 
in which the simple figure is hidden was designe 
to serve a distracting, rather than an embedding, 
function.3 k 

9. DCT IIA. This test is scored for speed with 
which the subject locates each of a series of 
simple geometric figures which are interspers¢ 
among a matrix of 52 such figures. Here, distraction 
is provided by the presence of irrelevant simple 
figures surrounding the critical ones. A; 

10. DCT IIB. This test is similar to DCT IA; 
however, as an additional source of distraction, a 
series of semitransparent colored designs are gpa 
imposed over many of the simple geometric figur? 
to be located. suit 

11. Arithmetic Operations ‘Test. The subjects 
score is the time it takes to solve 24 simple arithm® 
tic problems (eg, 8—3+4). The problems at? 
spaced evenly on a single page; however, surroun t 
ing the problems are a series of totally irrelevan 
pictures, commands, sayings, and jokes. The subjee 
is instructed to do the problems as quickly as 
can, ignoring the irrelevant material. «ton 

In addition, two other tests (Form-Recognitt 
and Cancellation), administered to the present $0 
jects by another investigator as part of a diferen 
study of embeddedness and distraction (Loe i 
1961), were included in the present study aat 
hypotheses for purposes of exploring their factor 
content. The former test (Holtzman, 1955) involve 
speed of recognition of a series of impoverishe 
Pictures of animals, and the latter involves Spee 
of crossing out the letters “a, t, and c” each en 
they appear on a page of randomly arranged lette 4 

The hypotheses of the present study include 
general hypothesis, that tests involving overcom 


able 


3 Copies of the four distraction tests are vals 
at no cost from Cognitive Tests, P. O. Bo 
Vanderveer Station, Brooklyn 10, New York. 
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TABLE 1 


PREDICTED Factors AND TEST LOADINGS 


Factor 


Name 


Tests 


I 


Analytical Ability 


BAT, RFT, EFT, 
Design, Object 
bly, Match Pr 
Insight Problems 


II Verbal Comprehension Comprehension, 
T Vocabulary 
I Memory-Attention Arithmetic, Digit Span, 
I Arithmetic Operations 
V Overcoming Distracting DCT I, DCT HA, DCT 
Contexts I IIB, EFT, Block, De- 
y sign, Object embly 
Overcoming Distracting Arithmetic Operations, 
Contexts II DCT I, DCT HA, DCT 
vI IIB 
Overcoming Embedding EFT, Block Design, Object 
Contexts versus Time Assembly, Match Prob- 
vir Insight Problems 
viit Performance Speed Digit Symbol 


DCT II Equipment 


DCT IIA, DCT IIB 


em : 
PO beddedness (EFT, RFT, BAT, Block Design, 


emer’, Assembly, M 


ms) wi atch Problems, Insight Prob- 
tests , will load a different factor (or factors) than 
lg 2Volving overcoming distraction (DCT 


I, IA, 


ypo tithmetic Operations) ; and a series of specific 
| tiaga ma as to the nature of factors expected to 
umm and the tests expected to load such factors 
osnnized in Table 1). 
Upon ee of Factors I, I, and II was based 
taineg te Goodenough and Karp study which ob- 
Previo Similar factors with children. Factor VII was 
the usly identified by Cohen (1957) in studies of 
a te IS. Factors IV, V, and VIII were expected 
ang plt of the inclusion of the distraction tests, 
Flexibit ctor VI because of inclusion of the Adaptive 
et Measures. f 
pete chiens product-moment coefficients of correlation 
Prin, ip ined between each pair of the 18 measures. 
then aS components factors (Hotelling, 1933) were 
C Xtracted from the matrix of jntercorrelations 
Mputer, 


E dace 

bas cision about number of factors to rotate 
Mate tes upon two criteria, a rule-of-thumb esti- 
Si olving rotation of factors with latent roots 


Sina’ both the first four and the first eight prin- 


ed py aple structure, following procedures out- 
x attell (1952).4 Rotation was carried out 

. Five 

apliabitiy bles, giving zero-order intercorrelations, 
li estimates, unrotated factor loadings, and 
tansformation matrices have been deposited 
“Umen Merican Documentation Institute. Order 
it roj No 7507 from ADI Ausiliary Publica- 
Agres Ject, Photoduplication Service, Library of 


» Washington 25, D. C. Remit in ad- 


O 
tig 
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graphically by the present investigator while “blind” 
as to the identities of the test loadings being rotated. 
After rotation was completed the matrix of rotated 
factors was given to an independent judge (also 
“blind”) to attempt further rotations that would 
improve simple structure. 

For purposes of evaluation of hypotheses, loadings 
of +.25 or greater were considered significant. 


RESULTS 


Final rotated loadings of the 18 measures 
on four and eight factors are presented in 
Tables 2 and 3 and correlations among refer- 
ence vectors in Table 4. In the course of 
carrying out rotations it quickly became 
apparent to the present investigator and to 
two independent judges that the four-factor 
solution underestimated the number of inter- 
pretable factors present. These factors were 
crowded and many test loadings tended to 


TABLE 2 
Factor LOADINGS AFTER ROTATION OF Four Factors 


Factor 
Test 1 2 3 4 
EFT 82 —06 -17 00 
WAIS Arithmetic 34 0 06 57 
WAIS Object Assembly 59 07 09 —18 
Match Problems 51 10 05 10 
DCT IIA 14 46 —20 05 
Cancellation —10 68 10 10 
RFT 78 —10 —33 06 
WAIS Vocabulary 10 —10 64 09 
WAIS Block Design 66 03 05 —01 
Arithmetic Operations 00 56 0l 55 
DCTI o 56 0l —10 
Form-Recognition 44 13 09 —32 
BAT 74 —21 —27 07 
WAIS Comprehension —07 02 76 —01 
WAIS Digit Span 09 10 —09 78 
Digit Symbol —08 715 —01 4l 
Insight Problems 52 00 17 05 
DCT IIB 05 64 —10 08 


Decimals are omitted; toaings over +.24 a in 
4 tha 
type; and tests are reflected where necessary so 
tole scores represent greater field independence and 
Pite Iligence, more rapid problem solving, letter cancellation, 
aad form recognition, and less interference from distracting 
stimuli. 


Note.—I 


1.25 for microfilm or $1.25 for photo- 
copies and make checks payable to: Chief, Photo- 
duplication Service, Library of Congress. F 

Supplementary analytic rotations, to normal vari- 
max and pbiquartimin criteria, were also carried out. 
Results of these procedures, which are fairly con- 
sistent with those to be presented here, are reported 


elsewhere (Karp, 1962). 


vance $ 


2 


98 
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TABLE 3 
Factor LoApINGs AFTER ROTATION OF EIGHT FACTORS 


Factor 
Test 1 2 3 4 5 6 7 8 

ZF 733 -09 —O4 00 01 27 09 05 
VAIS Arithmetic 09  —07 09 60 17 01 09 E 
WAIS Object Assembly 49 —05 10 —09 10 27 —08 E 
Match Problems 59 10 —03 01 —23 55 —10 Ae 
DCT IIA —10 08 —03 06 62 -—09 —10 A 
Cancellation —02 56 22 —06 09 05 —40 oF 
RFT 76 05 —05 o7 -09 —04 00 ee 
WAIS Vocabulary 08 00 76 —07 —10 03 “u -= ; 
WAIS Block Design 56 —09 —01 08 04 39 07 we 
Arithmetic Operations 10 61 09 49 -10 -03 —47 ey 
DCTI —01 48 07 02 o4  —10 01 = 
Form-Recognition 38 07 10 —10 08 05 05 ar 
BAT 74 = —10 01 07 03 —05 —08 n 
WAIS Comprehension —09 02 70 08 04 —03 —08 -3 
WAIS Digit Span —09 08 —01 73 05 —09 —02 x 
Digit Symbol —06 6l —07 43 06 10 —4l =W 
Insight Problems 47. —10 02 05  —10 56 09 0. 

DCT IIB —05 52 07 —06 07 o  —03 09 


Note.—See note to Table 2. 


“hang” in a position between the hyperplanes 
and clusters of measures clearly loaded (e.g., 
10% of the loadings fell between .10 and 
.25). Percentage of loadings within the + .10 
hyperplanes (57%) was relatively low for 
the kinds of ability variables included in the 
present study. In contrast, the eight-factor 
solution suggested considerably better resolu- 
tion of factors, with 76% of loadings between 
Æ .10 and only 2% between .10 and .25. 
Regarding the general hypothesis, that 
tests involving the overcoming of embedding 


TABLE 4 


CORRELATIONS AMONG REFERENCE VECTORS AFTER 
OBLIQUE ROTATION or EIGHT AND Four 
(my Irazics) Factors 


NOOB 


23 =—.22 12 =.28 35 04 .06 
=—16 —J5 —.04 
10 21 —12 —16 —43 —.08 
00 33 
—.18 03 —.21 —.05 —.19 
—.18 
Al —.16 —.18 —.09 
—.27 01 —.01 
.05 18 
02 


contexts load different factors than tests by 
volving distracting contexts, it may be nea 
that there is a complete absence of be 
of significant loadings of tests involving a 
ferent kinds of contexts on either fonr m 
eight factor solutions. None of the fac ay 
obtained load both kinds of tests. It ™ 
also be noted from Table 4 that Races 
and 6 (defined by embeddedness tests) i 
not highly related to any of the factors W. 
load distraction tests (2, 4, 5, 7, 8), altho 
some obliqueness is present (correlations 
tween reference vectors ranging from - these 
—.27). It seems reasonable to vieW the 
data as providing substantial support for 
general hypothesis, with the reservation Ws 
factors defined by the embeddedness a 
are not orthogonal to all factors define 

the distraction tests. 

With regard to specific hypothes 
the kinds of factors expected and ies 0 
predicted as loading such factors, a setlé per 
point-biserial correlations was obtained Sn 
tween each combination of one predicted A 
one obtained factor. For this purpose, vie 
dicted factors were treated as dicho a 
variables (each test scored “1” if signi 
loading on that factor was predicte 
“Q” if not) and obtained factors as CO? 


es about 
the tests 


tinu- 
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ous variables (score for each test was its 
loading on a given factor). An obtained 
factor was matched with the predicted factor 
with which it was most highly correlated. 
The magnitude of the correlation suggests 
the degree to which the match is a good one. 
Obtained Factor 1 appears highly related 
to Predicted Factor I (Analytic Ability) with 
"w = 90 and .93 (p< .001) for four- and 
eight-factor solutions. Two tests that were 
not predicted as loading this factor, Arith- 
iv and Form-Recognition, significantly 
oad it in the four-factor solution. Form- 
€cognition also loads the eight-factor 
Solution, 
F Factor 2 is most highly related to Predicted 
ane V (Overcoming Distracting Con- 
ae II) with 7» =.71 and .64 (p < .005) 
ia the two solutions. Three of the four dis- 
necting contexts tests (DCT I, IB, Arith- 
an Operations) load this factor in both 
alyses. The fourth distraction measure, 
tic T ITA, loads only the four-factor solu- 
nem Tn addition, Digit Symbol and Cancel- 
anes significantly load Factor 2 in both 
‘ons, contrary to predictions. , 
(94 tained Factor 3 relates most highly 
4 and 85; p< .001) to Predicted Factor 
(Verbal Comprehension), loading WAIS 
fee ilary and Comprehension. No other 
aa Were expected to load Factor II; how- 
R P low negative loadings on Factor 3 of 
taneg \_ 33) and BAT (—.27) were ob- 
ed in the four-factor solution. 
vegntained Factor 4 relates significantly to 
on icted Factor III (Memory or Attention- 
TA centration), with r» = -85 (Ż < .001) 
tic ae solution. This factor loads Arithme- 
as irs igit Span, and Arithmetic Operations, 
Pac etal In addition, Digit Symbol loads 
Fact 4 in each solution. : : 
the ., C'S 5 through 8 were obtained only in 
high ht-factor solution. Factor 5 is most 
(Dey related to Predicted Factor VIII 
(be II Equipment), with 7» = .63 
on tyros): Loading of DCT ITA and IIB 
the = factor was predicted; however, only 
R rmer occurs. 
( ot 6 matches Predicted Factor VI 
With ©O™ing Embeddedness versus Time), 
al] ee = .89 (p< .001). This factor loads 
eddedness tests that are timed. 
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Factor 7, which loads Digit Symbol, 
Cancellation, and Arithmetic Operations, ap- 
pears to correspond most closely to Predicted 
Factor VII (Performance Speed) which only 
postulated loading of Digit Symbol. For this 
comparison, a point-biserial coefficient of 
correlation is not appropriate as only one 
test was predicted as loading Factor VII. As 
a rough estimate of relationship between 
Predicted Factor VII and each of the ob- 
tained factors, loadings of Digit Symbol on 
the latter were converted to standard scores 
(based upon the distribution of loadings on 
each factor). The highest standard score for 
Digit Symbol occurred on Obtained Factor 7; 
which was on this basis matched with Pre- 
dicted Factor VII.° 

Factor 8 related significantly (rm = .48, 
p < 05) only to Predicted Factor II; how- 
ever, the low level of this relationship and 
the nature of the three tests loading this 
factor (DCT I, Form-Recognition, and 
Comprehension) suggest that Factor 8 might 
best be viewed as a residual factor. 


Discussion 


The results thus provide support both for 
the general hypothesis and with regard to 
the character of four of the eight predicted 
factors (I, II, II, and V). Partial support 
may be found for two other predicted factors 
(VI and VID). Tests involving ability to 
overcome embeddedness clearly define Fac- 
tor 1 and suggest a high degree of overlap 
between measures of field dependence, WAIS 
“closure” measures, and measures of Adap- 
tive Flexibility based upon this common 
characteristic. 

Measures reflecting ability to overcome 
the effects of distracting contexts contribute 
to the definition of Factor 2. In addition, 
loading of Digit Symbol and Cancellation is 
substantial. This might suggest either of two 
kinds of possibilities, that Digit Symbol and 
Cancellation involve distracting contexts or 
that this factor might tap other kinds of 
abilities than the overcoming of distraction. 
One might, on an ad hoc basis, interpret these 


5 If one were to assume a normal distribution of 
test loadings, loading of Digit Symbol of .41 or 
greater on Factor 7 could be expected to occur by 
chance alone in 1 out of 50 cases. 
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tests as involving routine clerical performance 
where items that must be found and ma- 
nipulated are located within a matrix of 
generally similar irrelevant items. For Can- 
cellation, the irrelevant (and thus, by impli- 
cation, potentially distracting) background 
is provided by the randomly arranged matrix 
of letters among which the critical letters 
must be found. Digit Symbol involves, at 
least for the period of time until symbol- 
number pairs are learned, the successive 
matching of a series of test numbers with the 
same numbers in the legend. The legend 
provides an orderly matrix of similar items 
(number-symbol pairs), the correct one of 
which must be selected for each test item. 
Thus, one might consider Factor 2 to repre- 
sent ability to rapidly select critical items 
from a matrix of similar irrelevant items. 
In this view, the degree to which one is 
distracted by the irrelevant items might 
affect his speed of doing the task. 

Factors 3 and 4 closely match the Verbal 
Comprehension and Memory factors obtained 
by many investigators using one or another 
of the Wechsler scales. Loading of Digit 
Symbol on Factor 4 was not predicted, on 
the basis of results by Cohen (1957, 
1959) with the WISC and WAIS; however, 
such loading agrees with the results of 
Goodenough and Karp (1961), using the 
WISC with children, where Coding (the 
WISC version of Digit Symbol) loads their 
Attention-Concentration factor. 

The choice of a name for Factor 4 has 
been a subject of some discussion in the 
literature, the most frequently used choices 
involving either short-range memory (as 
memory for digits over an interval of a few 
seconds) or some combination of attention- 
distraction-concentration. Yet, the alternative 
interpretations may not be very far apart. 
Undoubtedly, tests such as WAIS Arithmetic 
and Digit Span involve memory. However, 
it also seems reasonable to consider inability 
to attend as one of the most likely reasons 
for failure of short-range memory. The 

present investigator would favor representa- 
tion of both interpretations in the name 
(eg., Memory-Attention ) until such time 


as further evidence suggests a review of this 


issue. 
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Factor 5 loads only DCT IIA, an outcome 
neither predicted in advance nor clear at 
this time. 

Factor 6 (Overcoming Embeddedness 
Against Time) loads all timed embeddedness 
tests as predicted. However, those tests 
which provide the least speed pressure 
(Match and Insight Problems) are most 
highly loaded on this factor, whereas the 
remaining embeddedness tests (EFT, Object 
Assembly, Block Design) have relatively 
small loadings. On this basis one, migh 
consider an alternative interpretation © 
this factor. 

Both Match and Insight Problems define 
the Guilford et al. (1957) Adaptive Flexi- 
bility factor which, as has been suggeste' 
above, may be highly similar to Analytic 
Ability. In fact, the presence of these tests 
on Factor 1 would support this view. How 
ever, it is also possible that these tests be 
by implication, the Adaptive Flexibility 
factor involve other kinds of abilities tha? 
overcoming embeddedness. One could, i 
example, postulate a kind of persistence- ii 4 
a-difficult-task ability or a speed-at-solving 
complex-problems ability to account 
Factor 6 variance. Such abilities might A 
required to a greater degree by Match EE 
Insight Problems than by the other po 
which load Factor 6. To evaluate such ae 
potheses it would be necessary tO inclu a 
in a single study, measures of persistence? 
problem solving ability (or any other Pore 
lated ability) which do not also inv? 
overcoming embeddedness. 

Factor 7 (Performance Speed) loads 
cellation, Arithmetic Operations, an¢ 
Symbol. The presence of Cancellation i 
this factor would tend to emphasize 
routine clerical performance aspect © ple, 
loading this factor as opposed, for examP | 
to the distraction and memory aspects ol 
Arithmetic Operations and Digit py fk 
(which have been accounted for in intetP 
tation of other factors which loa 
tests). The presence on this factor © 


sug- 
tests that are not subtests of the WATIS “pol 


tests 


gests that this may be the ni (1957; 
i i ohen 
specific factor obtained by WAIS an 


1959) in his analyses of the 
WISC. 
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There would also seem to be overlap be- 
tween Factor 7 and a Perceptual Speed 
factor identified in several studies. Davis 
(1956) and Pemberton (1952) describe this 
factor as involving speed of identification of 
a given stimulus within a mass of distractors, 
where the task is a simple one and where 
the distractors do not serve to distort or 
conceal the stimulus, as through embedded- 
ness. This description would seem appropriate 
for two of the three tests which define Fac- 
tor 7, Digit Symbol and Cancellation. It is 
not as relevant for Arithmetic Operations, 
Where rapid location of the critical items 
(arithmetic problems) would seem to repre- 
Sent a relatively minor aspect of the task. 
Success at Arithmetic Operations would 
appear to involve ability to attend to the 
Problems to be solved in the face of distrac- 
tion, rather than ability to locate the prob- 
ems among a matrix of irrelevant stimuli. 

ere, the irrelevant materials might be seen 
as “competing” with the critical problems 
a the attention of the subject. It is possible 
© advance a similar kind of interpretation 
je the presence of Digit Symbol and Cancel- 
ion on this factor, if we view the tasks 
volved as boring or tiresome because of 
i. repetitiousness. Here, other kinds of 
eee which have not been intentionally 
si troduced into the test situation (e.g., Out- 
ject) sounds, ideas which occur to the sub- 
atte may compete with the task for the 
ee of the subject. He might be 
ee not because the distractors are 
ye enticing, but because the critical 
F 1s so uninteresting. 
laa on 8 corresponds to no predicted 
actor and would appear to be a residual 
Riss a Similarly, Predicted Factor IV does 

relate highly to any obtained factor. 

Onsidering all seven interpretable factors 
tote in the present study, it may be 
View ir four of these factors have been 
Con, ed as involving, in some way, attention, 

Centration, or distraction. Factor 2 is seen 

Teflecting ability to overcome the effects 
4 alistracting (perceptual) contexts, Factor 

ure volving attention as an aid to short- 
Varia memory, Factor 5 as reflecting specific 
ang ce of Distracting Contexts Test ITA, 

Factor 7 as loading measures of Per- 
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formance Speed, which would appear to 
require some freedom from distraction. One 
might, in addition, consider Factor 6, in- 
volving all timed embeddedness tests, as 
potentially influenced by distraction. Yet, 
each of these factors appears quite distinct 
and is not highly correlated with any other 
factor. If nothing else, the presence of so 
many factors which directly or indirectly 
may be affected by some form of distraction 
would suggest that there may be no con- 
sistent kind of ability to attend or resist 
distraction which cuts across a great variety 
of situations and sources of distraction. This 
possibility is even more compelling when one 
considers that the college students of the 
present sample are, for the most part, above 
average in general intelligence and have 
demonstrated at least moderate scholastic. 
success, which would seem to entail some 
kind of ability to attend and concentrate. 

In general, the results would favor the 
position of Witkin and his colleagues, that 
measures of field dependence involve ability 
to overcome the effects of embedding con- 
texts and that other kinds of cognitive tasks 
involving this ability load the same factor 
as the measures of field dependence. The 
results further suggest that ability to over- 
come embeddedness is factorily different 
from ability to resist distraction, although 
some correlation occurs between factors 


representing these abilities. 


REFERENCES 


Borzum, W. A. A factorial study of the reasoning 
and closure factors. Psychometrika, 1951, 16, 361- 
386. 

Carrer, R. B. Factor analysis: An introduction and 
manual for the psychologist and social scientist. 
New York: Harper, 1952. 

Couen, J. The factorial structure of the WAIS be- 
tween early adulthood and old age. J. consult. 
Psychol, 1957, 21, 283-290. 

Conen, J. The factorial structure of the WISC at 
ages 7-6, 10-6, and 13-6. J. consult. Psychol., 1959, 
23, 285-299. 

Davis, P. C. A factor analysis of the Wechsler- 


Bellevue Scale. Educ. psychol. Measmt., 1956, 16, 


127-146. 
Garpner, R. W. Jackson, D. N., & Messick, S. J. 


Personality organization in cognitive controls and 
intellectual abilities. Psychol. Issues, 1960, 2 


(Whole No. 8). 


302 


GoopenoucH, D. R., & Karp, S. A. Field dependence 
and intellectual functioning. J. abnorm. soc. 
Psychol., 1961, 63, 241-246. 

Gumrorp, J. P., Frick, J. W., CHRISTENSEN, P. R; 
& Merrrrretp, P. R. A factor-analytical study of 
flexibility in thinking. Laboratory Report No. 18, 
1957, University of Southern California, Depart- 
ment of Psychology. 

Harman, H. H. Modern factor analysis. Chicago: 
Univer. Chicago Press, 1960. 

Horrzman, W. H. Tentative manual: The Holtzman 
Form-Recognition Test. Austin: University of 
Texas, 1955. (Mimeo) 

Horettinc, H. Analysis of a complex of statistical 
variables into principal components. J. educ. 
Psychol., 1933, 24, 417-441; 498-520. 

Jackson, D. N. Stability in resistance to field forces. 
Unpublished doctoral dissertation, Purdue Univer- 
sity, 1955. 

Karr, S. A. A factorial study of overcoming em- 
beddedness in perceptual and intellectual function- 
ing. Unpublished doctoral dissertation, New York 
University, 1962. 

Loerr, R. G. Embedding and distracting field con- 
texts as related to the field-dependence dimension. 
cada master’s thesis, Brooklyn College, 


STEPHEN A. Karp 


PEMBERTON, Carnot L. The closure factors related 
to other cognitive processes. Psychometrika, 1952, 
17, 267-288. 

Popett, J. E. Personality and stimulus factors in 
adult cognition: A developmental analysis of de- 
contextualization. Unpublished doctoral disserta- 
tion, Clark University, 1957. 

Popvett, J. E, & Pures, L. A developmental 
analysis of cognition as observed in dimensions 0 
Rorschach and objective test performance. J. Pers. 
1959, 27, 439-463. a 

THURSTONE, L. L. A factorial study of perception 
Chicago: Univer. Chicago Press, 1944. 

THURSTONE, L. L. Mechanical aptitude: IT. Analy- 
sis of group tests. Report No. 55, 1949, University 
of Chicago, Psychometric Laboratory. 

WECHSLER, D. Measurement and appraisal of adult 
intelligence. Baltimore: Williams & Wilkins, 1958. 

Wirxiy, H. A, Dyx, Rutm Bọ, Farerson, HANSA 
F., Goopenoucn, D. R, & Karp, S. A. Psycho- 
logical differentiation. New York: Wiley, 1962- 

Wirk, H. A., Lewis, Heren B., HERTZMAN, ©? 
MACHOVER, Karen, MEISSNER, PEARL B., 
Warner, S. Personality through perception. New 
York: Harper, 1954. 


(Received October 1, 1962) 


| Journal of Consulting Psychol 
1963, Vol. 27, No. 4, 303-309” 


COMPARATIVE EFFICIENCY OF SOME TESTS OF 


CEREBRAL DAMAGE * 


MAURICE KORMAN ann STANLEY BLUMBERG 


University of Texas Southwestern Medical School 


This study: (a) compared 40 Ss with cerebral damage with 40 matched 
nondamaged Ss on 4 commonly used psychometric instruments, as well as 
W-B Vocabulary and MMPI L Scale; and (b) investigated the relationship 
between these tests and 4 dimensions (laterality, severity, progressiveness, 
and diffuseness). Using artificial base rates, the rank order for “accuracy” was: 


associated with laterality (TMT, A 


Problems connected with the assessment of 
cerebral damage (CD) continue to be among 
the more vexing that the clinician faces in 
his diagnostic activities. Although the litera- 
ture is extensive, few studies, as Jenkins and 
Lykken (1957), and more recently Reitan 
ee), have remarked, deal with more than 
Bo test at a time. This prevents comparisons 

the relative discriminating power of various 
a tests in a situation where other param- 
a Ts, including base rates, are held con- 
peant. Furthermore, the independent variable 
tq, Most studies has too often been merely 
en damage” with no attempt made to 

X kateto relationships between various 

ee of cerebral damage and psycho- 

ric instruments. It can be argued that loss 

Peau variance can sometimes occur 
str gh the use of such overly global con- 

ucts (Korman, 1960). 
sem present study was undertaken as a 
EOR eiempi to explore simultaneously the 
i A ness of certain commonly used CD tests 

: Br eronp (CD versus non-CD) as well 
te tagroup (all CD) criterion situations. It 
= however, be pointed out that such a 

gn cannot, because of its artificiality, 


| 
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Memory for Designs Test (MFD), S 
Making Test (TMT), Bender Gestalt, Vocabulary, L. Combining the 3 


“best” tests increased accuracy slightly. 
minus B; MFD); severity (SAET); 


progressiveness (L; TMT, A minus B; age); diffuseness (age). 


piral Aftereffect Test (SAET), Trail 


Some variables were significantly 


yield results that have immediate applica- 
bility. 

We were also interested in testing the sug- 
gestion of Garrett, Price, and Deabler (1957) 
that hit rate might be increased by the 
joint use of the more valid tests. We chose 
the tests in question because of their rela- 
tively widespread use among working clini- 
cians. In some instances, furthermore, hy- 
potheses concerning relationships between 
such tests and various CD dimensions have 
been formulated. This is particularly true of 
the Trail Making Test (Reitan & Tarshes, 
1959), the Spiral Aftereffect Test (Sindberg, 
1961) and the Memory-for-Designs Test 
(Graham & Kendall, 1960). 

Finally, the intercorrelations among these 
tests were computed. Blau and Schaffer’s 
(1960) data suggest a moderate relationship 
among somewhat similar instruments. In 
addition the relationship of such variables as 
age, education, and vocabulary to CD meas- 
ures is of particular methodological im- 


portance. 
METHOD 


Subjects 

The CD group was composed of 52 patients from 
the in- and outpatient services of one county gen- 
eral hospital and two Veterans Administration 
hospitals. The only criteria for selection were that 
the patient have definite cerebral damage and that 
he be able to take the battery, of tests. For each 
patient, a rating sheet patterned after one developed 
by Reitan (1959) was completed by the staff or 
resident neurologist assigned to the case. This rating 
sheet provided information as to diagnosis, type 
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of lesion, degree of severity, whether focal and/or 
diffuse, localization, degree of progressiveness, and 
disease process. In addition, the number and kind 
of diagnostic techniques used (e.g., EEG, skull X ray, 
pneumoencephalogram, etc.) and the degree of 
confidence in the diagnosis were rated by the 
neurologist. The median number of diagnostic tools 
used with each patient was four. Degree of con- 
fidence in diagnosis was “good” or “excellent” in 
889% of the cases rated. The group was composed 
of 40 male and 12 female subjects, whose diagnoses 
fell into the following major categories: cerebral 
vascular disease, 16; tumor, 9; trauma, 7; epilepsy, 
7; degenerative or demyelinating disease, 6; cerebral 
vascular disease plus trauma, 2; inflammatory or 
infectious disease, 1; frontal lobotomy, 1; multiple 
cortical lesions, 1; trauma plus tumor, 1; and 
narcolepsy, 1. 

The distribution of patients according to the 
rated dimensions were as follows: cerebral laterality 
—18 left side, 16 right side, and 18 bilaterally 
damaged; severity—6 mild, 30 moderate, and 16 
severe; progressiveness—33 static and 16 progressive; 
diffuseness—12 focal, 23 diffuse, and 17 focal and 
diffuse. It should be noted that because of the 
neurologists’ uncertainty, 3 subjects were not rated 
on the static versus progressive dimension. 

Of the 52 CD patients, 40 were matched in pairs 
for age, education, and sex with a control group 
of non-brain-damaged patients from the in- and 
outpatient psychiatric and medical services of the 
same hospitals from which the CD patients were 
selected. These 40 CD subjects had a mean 
age of 45.37 with an SD of 15.80. Their mean 
years of education was 9.80, SD 3.20. Their distribu- 
tion by diagnosis was 12 cerebral vascular disease, 
8 tumor, 6 epilepsy, 4 trauma, 4 degenerative or 
demyelinating disease, 2 CVD and trauma, 1 tumor 
and trauma, 1 inflammatory disease, 1 lobotomy, 
and 1 multiple cortical lesions. The 40 control 
subjects had a mean age of 45.10, SD 15.45, and 
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mean education of 9.83, SD 2.53. Having been 
selected from the same hospitals and clinics, aS 
well as matched on age and education, the two 
groups were assumed to be of comparable pre- 
morbid intelligence. Twenty-four of the control 
subjects had psychiatric diagnoses as follows: 
schizophrenia, $; psychoneurosis, 6; alcoholism, 5; 
and personality disorder, 5. The remaining 16 sub- 
jects had a variety of medical diagnoses such as 
duodenal ulcer, pancreatitis, etc. There was no 
evidence of CD symptoms or diagnosis in the 
medical charts of the 40 control subjects. 


Tests 


Every subject was individually administered ad 
test in the battery. The order of presentation me 
staggered. Each test was administered and som 
in accordance with the standard instructions in t E 
test manual, except as noted below. The tests bi 
the Visual Motor Gestalt Test (Bender, 194 d 
scored according to the system proposed by PAS 
and Suttell (1950); the Trail Making Test (Reitan, 
1956), from which four separate raw time e 
were analyzed—Part A, Part B, Total, and A eae 
B; the Graham-Kendall Memory-for-Designs oe: 
(MFD) (Graham & Kendall, 1960), both unet 
rected and corrected scores; the Spiral Atert ‘ 
Test (SAET) (Psychological Research and Develo 
ment Corporation, 1958), with 8 trials presentes a 
ABBABBAA order using a 7-inch disk rotate I; 
a speed of 80 rpm; the L scale of the MMP! 
and the Wechsler-Bellevue I Vocabulary test. 


RESULTS AND DISCUSSION 
Cerebral Damaged versus Control Subjects 
of the 


e are 
ness 


Table 1 summarizes the performance 
CD and control groups, Although ther 
marked differences in the tests’ effectiv' 
as reflected in their overall hit rate, the 


TABLE 1 


Summary DATA COMPARING 40 CEREBRAL DAMAGED 


AND 40 NONDAMAGED SUBJECTS 


Non-CD CD 
cP’ 

T CP» ý CP" % 

M sD M SD Cuttingscore Qu p none? 
Age 45.10 15.45 45.37 15.80 
Education 9.83 2.53 9.80 3.20 00 
MFD, uncorrected 3.15 3.70 11.00 6.53 5-6 90.00 90.00 20o 
MFD. corrected 93 3.75 795 6.63 2-3 8250 7750 Soy 
SAET 7.50 1.05 5.15 2.23 7-8 g250 8500 Sey 
Trails B 87.30 52.95 179.95 102.60 107’-108"" 82.50 87.50 47,50 
Trails, Total 139.00 70.28 287.55 172.85 189-190" 8000 7250 80 
Trails A 51.70 26.85 107.60 79.05 71-72" 76.25 63.50 2000 
Bender 37.78 22.55 82.05 49.00 4445 73.75 77150 99.50 
Tails AB 35.60 45.95 72.35 64.05 63"-64"" 70.00 47.50 7750 
Vacabuley 20.93 7.80 15.90 7.36 1415 67.50 57.50 4000 
K 5.05 2.25 615 260 6-7 63.75 47.50 


a Correct predictions. 


Tests OF CEREBRAL DAMAGE 


Mean scores on all variables except age and 
education (on which the groups were 
matched) were significantly poorer ($ < .05 
or better) than the control group’s means. 
It cannot be emphasized too strongly that the 
Cutting scores reported in Table 1 must not 
be generalized to other populations and other 
problems. For the purpose of comparing the 
efficiency of these tests to one another, highly 
artificial base rates of .50 CD and .50 non-CD 
Were used. In clinical situations, where base 
Tates are typically asymmetrical, it is obvious 
that different cutting scores would be optimal, 
and that these would shift depending on the 
acceptable proportions of false negatives and 

false positives (see Meehl & Rosen, 1955). 
The most effective single test in correctly 
Predicting the presence or absence of CD was 
the MFD, uncorrected for age and vocab- 
ulary. Its efficiency was equally great in 
Predicting the absence as well as the presence 
Of CD, the hit rate being 907% in both in- 
ances, using a cutting score between 5 
and 6. These data compare fairly closely 
With the information given in the newest 
Manual for the MFD. Recomputing the 
Validation and cross-validation data of their 
ag 11 (Graham & Kendall, 1960, p. 162) 
ai ained a total hit rate of 84% using our 
ay ae score. However, if we use Graham 
Pa Kendall’s system of breaking down MFD 
ble into normal, borderline, and critical 
aas, only 32.5% of our CD group would 
ron telly. identified, while 2.5% of that 
Rup would be misclassified. Although we 
Ke in general agreement with Graham and 
Ndall’s conservative approach, which em- 
‘sizes interpretation of deviant scores 
Whee We wonder whether cutting scores some- 
asset between 5 and 11 would not have the 
With of lowering the false negative rate 
Our Out undue violence to base rate problems. 
experience with this test suggests that 
Fae scoring system likewise contributes 

Is difficulty. 

tene, corrected MFD scores (ie., the dif- 
tiore between subjects’ actual scores and 
dicap a ted on the basis of age and 
T a, did not do quite so well as the 
82 5o, ted scores, the overall hit rate being 
Partly | It seemed at first that this might 
Wwe, Y be due to the fact that our two groups 
already carefully matched for age and 
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education. Table 4, however, suggests an 
additional answer. In contrast to Graham 
and Kendall, we find a negligible correlation 
(r = — .09), between age and uncorrected 
MFD scores. Under these conditions correct- 
ing for age increases error variance. On the 
other hand, we find an appreciable correlation 
between MFD uncorrected scores and vocab- 
ulary (r = — .48), which drops only slightly 
upon correction (r = — .34). Aside from this 
negligible difference, perhaps MFD scores 
should not be corrected for vocabulary at all, 
inasmuch as vocabulary level may be less 
of an irrelevant variable than had been 
thought. These data seem to bolster the 
questions which Hunt (1952) has raised con- 
cerning the need for correcting MFD scores. 

The SAET also yielded an overall hit rate 
of 82.5%. For our sample it seemed to do 
slightly better with true positives than with 
true negatives. Because investigators have 
dealt with the SAET more as a technique 
than as a test, thus bypassing the problems 
of standardized administration and scoring, 
comparisons are difficult to make. The hit 
rates of six studies reported in recent years 
range from 67.5% to 96.4%, with a median 
of 87.2%. In the present study, we utilized 
the preliminary manual provided by the Psy- 
chological Research and Development Cor- 
poration, Tampa, Florida, the makers of the 
apparatus. Now that methodological studies 
such as that of Sindberg (1961) are being 
published, it is to be hoped that this tech- 
nique will take one more of the characteristics 
of a methodologically sound testing pro- 
cedure. The distribution of scores in our 
samples tends to corroborate Sindberg’s con- 
clusion that a longer series of trials increases 
between-group discrimination. The majority 
of studies (see Blau & Schaffer, 1960), it 
should be recalled, have used four or fewer 
trials. In contrast to the significant cor- 
relation between SAET and age of —.39 
reported by Goldberg and Smith (1958), we 
find a correlation of only .08. 

Analysis of the four different scores from 
the Trail Making Test indicates that Part B 
and Total Score differentiate the CD group 
from the controls better than do Part A or 
A minus B. As shown in Table 1, using 
optimum cutting scores, the total hit rate 
for Part B is 82.5%, for Total Score, 80%, 
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for Part A, 76.3%, and for A minus B, 70%. 
Our results are in general agreement with 
previous studies indicating that persons with 
brain damage tend to perform significantly 
more poorly on the Trail Making Test than 
do control subjects without brain damage 
(Armitage, 1946; Davids, Goldenberg, & 
Laufer, 1957; Reitan, 1955, 1958). An ex- 
ception to this generalization is the study of 
Brown, Casey, Fisch, and Neuringer (1958) 
which found CD and psychotic groups sig- 
nificantly different from normals, neurotics, 
and character disorders, but not significantly 
different from each other. In our study, 8 
control subjects had diagnoses of schizo- 
phrenia; the total control group was signifi- 
cantly superior to the CD group. In addition, 
separate analysis of the scores of the schizo- 
phrenic subjects revealed only two mis- 
classifications (false positives) on Part A, 
Part B, and Total Score of the Trail Making 
Test. 

A cutting score between 44 and 45 yielded 
a total hit rate of 73.8% on the Bender, 
with 77.5% of the CD group scoring 45 or 
above, and 70% of the controls scoring 44 
or below. The group mean scores of 82 and 
37.8 are comparable to those reported by 
Bowland and Deabler (1956). They obtained 
a mean of 80 from the CD subjects, 42.8 from 
the schizophrenics, 28.1 from neurotics, and 
16.4 from normals. Our total hit rate of 
73.8% is somewhat less than the 80% re- 
ported by Goldberg (1959). 

Vocabulary tests have not been typically 
evaluated as tests of cerebral damage. On 
the contrary, instruments built around the 
differential score technique have implied a 
fundamental constancy of vocabulary per- 
formance regardless of brain changes. Such 
recent data as that of Williams, Lubin, and 
Gieseking (1959) clearly suggest that this is 
not the case. Likewise, our CD group, which 
had been carefully matched for educational 
level, as well as age, with the non-CD group, 
nonetheless had significantly lower vocab- 
ulary scores. Whereas the control subjects 
had a mean vocabulary of 20.93, the CD 
subjects had a mean vocabulary of 15.90. 
A cutting score between 14 and 15 yielded 
correct predictions in 67.50% of all cases. 
However, the test’s capacity for correctly 
identifying nondamaged patients is consider- 
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TABLE 2 


PREDICTIONS OF CEREBRAL DAMAGE mie VARIOUS 
COMBINATIONS OF THE THREE “Best” TESTS 
(MFD, Traits B, SAET) 


Number of tests 


with “positive” CD subjects Non-CD subjects 


findings (N = 40) (N = 40) 
5 2 
3 25 2 
12 2 
i 2 10 
0 1 26 


ably higher than its efficiency for detecta 
CD. It would seem that tests which utiliz 
vocabulary scores as correction factors me 
unwittingly be blunting the sharpness of t E 
discriminatory power. It is interesting , 3 
note in Table 4 that vocabulary relates aa 
nificantly to all the measures of CD of 
study, the median r being .33. dis- 

The MMPI L scale has the lowest ia 
criminating power of any of our varia = 
Its total correct hits are only 63.759; A 
this at the price of over 50% false cee 
Its low discriminating power is not supin er 
although our results do lend some supp? a 
Doehring and Reitan’s (1960) notions idly! 
the CD patient’s need to maintain as 18 
acceptable a self-concept as he can. the 

Table 2 reports our attempt to test ting 
efficacy of the three most discrimina af 
tests that were also independent when vai 
jointly. The three measures are MFD, Ti 
B, and SAET. All of our subjects’ recom 
were examined for the number of eee 
dicating cerebral damage using the cu nats 
scores of Table 1. Table 2 indicates ae 
using a cutting score of two or noe the 
with “positive” results, 37 or 92.5% ° wile 
CD subjects are correctly classified, ate 
only four or 10% of the control subjects al 
inappropriately labeled damaged. ‘The | cre- 
hit rate of 91.75% represents a slight "° e 
ment over our most successful test, ified 
MFD, which by itself correctly cianal 
90% of our subjects. Because of the up? 
and more diverse sample of behavior uld 
which this index rests, however, it cross” 
probably show less shrinkage upon q o 
validation than selection ratios base 
single tests. 


TESTS OF CEREBRAL DAMAGE 


Dimensions of Cerebral Damage 


One-way analyses of variance, or é tests 
when appropriate, were run on all the vari- 
ables of this study including age and educa- 
tion with the CD subjects categorized with rel- 
evance to cerebral laterality, severity, progres- 
siveness, and diffuseness. In cases where the 
results exceeded the .05 level of significance, 
degree of overlap and optimum cutting scores 
Were determined. The interrelationships of 
these four criterion dimensions were investi- 
gated by means of chi square. In only one 
Mstance did the results approach significance. 

‘atings of diffuseness were related to 
bilaterality (p < .10). 

ree measures significantly differentiated 
the CD subjects according to laterality of 
damage. As indicated in Table 3, a cutting 
Score between 59 and 60 seconds on the 

tail Making A minus B score yielded a 
total correct classification of 74% of the 

2 brain damaged patients. Those patients 
With bilateral damage performed similarly 
° those with left-sided damage on the Trail 
beens Test. Seventy-two percent of the 
laterally damaged patients and 72% of 
he left-side damaged patients had A minus 
Rog ores of 60 seconds or more, whereas 
ad 7 of the right-side damaged patients had 

Ores of 59 seconds or less. 

here are both similarities and differences 
“tween our results and those obtained by 
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Reitan and Tarshes (1959) in the only other 
published study relating Trail Making Test 
scores to laterality of brain lesions. Using 
converted T scores instead of raw time scores, 
Reitan and Tarshes obtained a 75% hit rate 
for left-side patients (compared to our 72%) 
and 72% hits for right-side subjects (com- 
pared to our 79%). In both studies the mean 
difference scores between the left- and right- 
side damaged patients were significantly 
different, with the left-side damaged patients 
performing worse on Part B in relation to 
Part A. In contrast to the similarity be- 
tween the bilaterally damaged group and 
the left-side group in our study, Reitan 
and Tarshes (1959) found a significant dif- 
ference between subjects with bilateral and 
those with left-side damage, but no significant 
difference between bilaterally and right-side 
damaged patients. 

The MFD scores significantly differentiated 
laterality among the brain-damaged patients 
with higher hit rates for the bilaterally 
damaged patients than for those with either 
left- or right-side damage. As shown in Table 
3, the optimum cutting scores result in 74% _ 
and 70% total correct classification for MFD 
uncorrected and corrected scores, respectively. 
Graham and Kendall (1960) hypothesized 
that the MFD test would be especially dif- 
ficult for patients with lesions of the right 
hemisphere, but found that the test did not 


TABLE 3 


SIGNIFICANT RELATIONSHIPS BETWE! 


EN VARIABLES AND DIMENSION: 


S OF CEREBRAL DAMAGE 


an 
Dimensions and tests Cutting score % Correct predictions 
Laterality 
1. Trails ” . 14% (Lt 72%} Rt 19%; Bil 72%) 
alls A-B 59/-6 Total: 74% (Lt 72%; pe 
S MFD, uncorrected 9-10 Total: 74% (Lt 67%; Rt 64%; a a 
+ MTD, corrected 5-6 Total: 70% (Lt 61%; Rt 64%; Bi “0 
Severity 
l SAET 5-6 Total: 67% (Severe, 69%; Moderate, 67%) 
Stat; ú 
TiC Progressive oh ner 
ay : Static, 91%; Progressive, 387% 
x 4 Total: 74% (Static, o3 Y 4 
3 Trails A-B 557-56" Total: 67% (Static, 70%; Progressive, BE 
Š. Age “46-47 Total: 59% (Static, 64%; Progressive, 9/0 
fuseness i 
L Age 52-53 Total: 73% (Focal, 83%; Diffuse, 70%; Focal and Diffuse, 70%) 


om 5 
Mild cases were excluded because of small N (N = 6). 
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differentiate right- from left-side damaged 
patients. Our bilaterally damaged patients 
and those with lesions on the left side per- 
formed worse (had higher scores) than did 
those with right-sided damage. However, our 
hit rates in identifying left- and right-sided 
lesions are not much above chance (in the 
low to middle 60s), while the rate for cor- 
rectly identifying bilaterally damaged pa- 
tients is 89% and 83% for uncorrected and 
corrected scores respectively. The differences 
between the three CD groups classified as 
to laterality cannot be attributed to age or 
education. 

The dimension of severity, rated in terms 
of extent of disability, was discriminated 
only by the SAET. Because only 6 mild cases 
were available, they were excluded from 
the analysis. The 16 severe cases tended to 
report seeing the aftereffect less frequently 
than did the 30 moderate cases. With a 
cutting score between 5 and 6, correct 
predictions could be made with nearly the 
same frequency for both of these groups, 
yielding an overall hit rate of 67%. Although 
the separation is only moderately accurate, 
these data corroborate the report by Sindberg 
suggesting that while location of damage is 
unimportant with reference to SAET per- 
formance, “extensive or severe damage to any 
part of the cortex markedly reduces the 
probability of SAET report [Sindberg, 1961, 
p. 136].” 

The neurologist’s rating of the progres- 
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siveness of the disease process was dichoto- 
mized into static versus progressive. The 33 
patients with static lesions tended to get 
higher scores on the Z scale and on Trails 
A minus B, and to be older. As Table 3 
indicates, the L scale and Trails A minus 
identified correctly 74% and 67% of the 
total sample, respectively, while age identi- 
fied only 59% of the subjects correctly. This 
dimension of progressiveness is confounded 
with disease process. For example, of 15 pa 
tients with cerebral vascular disease, 14 were 
rated as static; all 9 tumor cases were 
rated progressive; and all of the trauma cases 
were rated static. The 6 cases of degenerative 
or demyelinating disease were evenly divide 
between static and progressive. , 

Only age distinguished focal from diffuse 
lesions with the diffuse group showing K 
tendency to be older than either of the othe 
two groups. A cutting score between 52 a 
53 correctly identified 73% of the total © : 
group. All of the subjects with degenerativ 
or demyelinating disease and most of tho 
with cerebral vascular disease were rated E 
diffuse. The relationship of age to difran 
apparently reflects the propensity of ol si 
people to suffer from disorders of a degene™ 
tive or arteriosclerotic type which tend 
lack a clear focus. 

Table 4 shows the magnitude of th 
relations among the variables of this stl a 
based on an W of 80 (40 CD and 40 n0 
damaged subjects). 


e cor 


TABLE 4 


INTERCORRELATION MATRIX 


MFD 
un- MFD 
Trails Trails Trails, Trails cor- cor- Vocab- Educa- ë 
SAET A B Total A-B Bender rected rected ulary tion AS 

$ —AS 40 10 11 06 05 38 06 23 —01 T 
SAET 36 37 40 20 35 40 39 32: =02 ‘ot 
Trails A 78 92 A3 05 at 30 —.30 —.33 Z2 
Trails B 96 72 33 43 39 —38 —25 9g 
Trails, Total -50 38 43 37 42-30 eg 
Trails A-B 05 27 28 —.28 —03 gg 
Bender -60 Ai. —.30 —.10 ~ 09 
MED, uncorrected 96 —48 = —.30 =a 
MFD, corrected = ell At 
Vocabulary a 01 


Education 


Note.—.22 significant at .05 level, .28 significant at .01 level, 


Tests OF CEREBRAL DAMAGE 


Of the 45 correlations among the 10 test 
= analyzed (including Vocabulary and 
au 32 are significant at the .01 level. The 

ree most promising independent measures 
aa uncorrected, SAET, and Trails B) 
aa a mean intercorrelation of .40. Al- 
ae these variables seem to share to some 
ee a common underlying factor it 1s 

» from the fact that only some 16% of 
tay | cosa is accounted for, that they tap 
Dis ively discrete abilities, even though they 
fhe me equally effective in discriminating 

T from the non-CD subjects. 
ane mean correlation between vocabulary 
c he other nine measures is 34. This 
Siow agg seems to reflect the fact, discussed 
Ge that vocabulary apparently decreases 
ae me extent with cerebral damage. By 
io aan education is less closely related 
aS ig messitres of this study (mean 7 = 17), 
Eileen (mean r = 12 y. The latter, in par- 
quite t, is a surprising finding; it reoccurs 

$ consistently, however, as witnessed by 
is si act that only one out of 9 correlations 

Aitant at the .05 level. 
hee the dimensions of brain damage 
eg Igated here are undoubtedly too gross 
ica all patients with lesions in the left 
eras regardless of the lobe involved 
ilar bly cannot be expected to perform sim- 

SA a beginning was made in relating 

© commonly used tests of cerebral damage 
eten mensions somewhat less global and 

geneous than brain damage. 
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PATIENTS AND THERAPISTS ASSESS THE 
SAME PSYCHOTHERAPY * 


HERMAN FEIFEL anp JANET EELLS 


Veterans Administration Outpatient Clinic, Los Angeles 


This study analyzed the perceptions of both patients and their psychotherapists 
at the close of psychotherapy as to changes taking place and ideas about 
what was helpful and not helpful. The participants consisted of 63 out- 
patients and 28 psychotherapists. Data were secured by administering an 
open-end questionnaire. Major findings and conclusions were: (a) therapists 
stressed changes in symptomatic relief and improvement in social relationships, 
whereas patients focused on self-understanding and self-confidence; (b) 


patients underlined the opportunity to talk over problems and the “human 


” 


characteristics of the psychotherapist as helpful, and therapists highlighted 
therapeutic technique and support to the patient as most beneficial; (c) 
expectancy and conceptual disparities about therapy between patient and 
therapist should be minded to maximize treatment benefits. 


Despite the contribution of various in- 
vestigators (among others, Dollard & Mowrer, 
1953; Fiedler, 1953; Heine, 1953; Leary & 
Gill, 1959; Robbins & Wallerstein, 1959; 
Rogers, 1961; Strupp, 1960), agreement 
concerning process aspects and recognizable 
and predictable effects of psychotherapy is 
still lacking. Evaluation of psychotherapy 
is undoubtedly a complex and difficult area of 
research. Numerous approaches have been 
attempted (Rubinstein & Parloff, 1959; Zax 
& Klein, 1960), yet the authors have been 
impressed by the meager heed given to the 
patient’s viewpoint of what happens in psy- 
chotherapy and its influence on him. Most 
reported studies have ruddered psychothera- 
peutic understanding and explanation mainly 
to the therapist’s grasp. 

The direct, phenomenological path (Blaine & 
McArthur, 1958; Board, 1959; Chance, 1959; 
Lipkin, 1948) has been treated with some 
success notwithstanding a number of ob- 
stacles, i.e., (a) neither patient nor therapist 
can be a completely unbiased observer, each 
varying in what he can and will report, (b) 
the mutual desire for success, wish to please 
the therapist, and the “hello-goodbye” effect 
of Hathaway (1948), may unwittingly affect 
responses, (c) limited consonance may exist 
between what either patient or therapist 
states about occurring changes and actual 


1A condensed version of this paper was pre- 
sented at the 1962 American Psychological Associa- 
tion annual convention in St. Louis. 


behavior noted by others, and (@) both 
patient and therapist may be unaware A 
what facets of therapy are actually “a 
countable for improvement and attribu 
success to irrelevant factors, overlooking 
those having meaningful and lasting effects- a 

Some of these impediments can be p% g 
tially overcome for the patient by extendin 
the distance between therapist and ran A 
operation in his thinking, e.g., assuring ing 
that replies will be kept confidential, getti”® 
the questionnaire administered by an 
turned to someone other than the thera 
and emphasizing the impartial nature O a 
investigation. Additionally, follow uP pi 
later time may yield data more liberated frog 
immediate “transference” effects, albeit nei 
ject to possible memory distortions of t 
own. 

Our view is, despite the stumbling blo 
marring this way of advance, that hal in 
volved parties in psychotherapy are st ith 
the most favored position to provide US Jace 
promising leads concerning what takes Pe 
To maximize comprehension of the oe ye 
peutic experience, harmful as well as POS f 
aspects should be examined. Undesi" neg 
features of psychotherapy have been 3 
lected in most investigations. Avoidane iy. 
these is understandable. It appears vo 
while, however, to hypothesize that C? hers 
features of therapy may be useful and of jn 
not, and to look for evidence of b° 
assessing therapeutic functioning. 
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PATIENTS AND THERAPISTS ASSESS PSYCHOTHERAPY 


The present study was designed to serve 
as the exploratory phase of a research pro- 
gram aimed at probing helpful and nonhelp- 
ful aspects of psychoanalytically oriented 
Psychotherapy as practiced in a Veterans 
Administration Mental Hygiene Clinic. The 
ìnitial phase accounted for here analyzes 
the perceptions of both patients and thera- 
Pists at the close of a mutually engaged-in 
Psychotherapy along these lines. The thera- 
ists’ abilities in predicting responses of their 
‘Spee to questions about the therapy are 
also appraised. Additionally, stability of the 
Patients’ judgments concerning their therapy 


1S evaluated after the passage of a number of 
Years, 


METHOD 


0 Faoice of an instrument to measure the reactions 
ion, S. participants was that of an open-end ques- 
inaire. While other alternatives such as check- 
RoR ane scales, Q sort self-descriptions, etc., can 
ae ute useful data, the consideration was that 
ere piul clues might be overlooked if responses 
ators imited to variables occurring to the investi- 
rs, Further, there was the estimation that it 
cult to fully convey exact feelings and thoughts 
i sh checking or sorting a series of statements. 
eng OCS not minimize disadvantages which open- 
(a) _diestionnaires themselves possess, notably, 
Bitte oe may be difficult to categorize ac- 
imports, because of complexity or obscurity, (b) 
sig ant variables may be ignored through over- 
Persone. Pressing concern with other areas, (c) 
int May assess the experience at varying levels 
€rpretation or inference making response 
Parison difficult, and (d) transposition of rich 
Complex data obtained in open-end inquiries 
“EY Se asia categories is bound to entail a 
uas of oversimplification and distortion 
at this E Nevertheless, the judgment was that 
Westionn, typothesis seeking point the open-end 
Oses in ae technique could serve heuristic pur- 
therapy, xtending insights about what happens ìn 


throu 


Ger cia questionnaires were administered to 
dividu. ie who had’ terminated a course of in- 
Vie Sn, Psychotherapy involving at least 10 inter- 
Clini, 2 & Veterans Administration Mental Hygiene 
majora ne 1957-58. They represented the vast 
Clinic sa „Of patients completing therapy in the 
of sees. this time period irrespective of number 
Siong ions, The criterion of a minimum of 10 ses- 
the es set up to maximize meaningful replies to 

estions posed. The questionnaire asked: 


l, 
What changes, if any, do you feel occurred 
2. Whee perange of your therapy? , 
about the th itself did you fin 
helpful? the therapy yi 
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3. What about the therapy itself did you find 
not helpful, or even set you back? 

4. Have you any suggestions for ways in which 
your therapy might have been made more 
helpful? 


At the same time similar questionnaires were 
given to tħe 28 therapists who had these patients 
in therapy. The therapists included 12 psychiatrists, 
12 clinical psychologists, and 4 psychiatric social 
workers functioning in a predominantly psycho- 
analytically oriented manner. They were asked to 
answer the first three questions and, additionally, 
to predict how their patients would answer these. 

Sixty-three (73%) of the patients completed 
their questionnaires, while therapist responses were 
obtained from all 28 (100%). Background informa- 
tion concerning age, education, occupation, diagnosis, 
reason for termination, and a number of other 
factors about the patients were obtained from the 
clinic files and therapists. 

An effort was made to reckon the extent to 
which the 73% return on patient questionnaires 
might reflect a biased sample afiecting the results. 
Fortunately, therapist responses as well as case- 
history data were available on the 23 patients who 
had not returned their questionnaires. Replies of 
the therapists concerning amount of improvement 
judged to have occurred, types of change reported, 
number of therapy sessions, diagnosis, reason for 
termination of therapy, and factors such as age and 
sex were compared for those patients who had 
returned their questionnaires with those who had 
not. No significant differences were found between 
the two groups and it was tentatively concluded 
that the 739% return sample of patients was not 
biased in any known way. 

Approximately 4 years after the original question- 
naire had been sent out, a follow-up letter was sent 
to 61 of the original 63 participating patients (2 
patients, in this interim, had returned to psycho- 
therapy with their original therapists and it was 
deemed advisable to exclude them) requesting they 
again answer the questionnaire on the basis of how 


they now felt about their past therapy. Forty-five 
(74%) of the patients completed this follow-up 
questionnaire. This is a substantial feedback con- 
sidering that 1 patient had died and seven 
questionnaires were returned because the patients 


could not be located. This means that only 8 
patients, who apparently received their question- 
naires, did not contribute to the follow-up study. 
Here, too, no reliable differences were 1n evidence 
with regard to diagnosis, number of sessions, previ- 
ous report of satisfaction by patient, therapist 
judgment of degree of improvement, etc., between 
the 45 patients responding to the follow-up question- 
naire and the 16 who did not. 


Subjects 

Of the 63 patients comprising the basic sample of 
this study, 58 were male and 5 female. Their ages 
ranged from 24 to 57 with a mean of 37.5 years. 
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The number of years of schooling varied from 
4 to 18 with a mean of 14.0 years indicating 
the general educational level of the group to be 
considerably above that of the general population 
of veterans. Forty percent of the group were 
college graduates. Occupationwise they were less 
outstanding, with 43% either unemployed or work- 
ing at unskilled or semiskilled trades. Ten percent 
were diagnosed as psychotic, 36% as psychoneu- 
rotic, 35% as character disorder, and 19% as 
psychosomatic or psychophysiological reaction. In 
78% of the cases therapy had been terminated by 
mutual agreement between patient and therapist, in 
9% the therapist had advised termination, while 
in 13% the initiative had been taken by the patient. 
Seventy-one percent were married and an ad- 
ditional 13% had been married at some time. 

Concerning frequency of therapy sessions, 67% 
had been seen on a weekly basis, 24% more often, 
and 9% less often. The number of cumulative inter- 
views for the patients varied from 10 to 324, with 
42% having been seen from 10 to 50 sessions, 21% 
from 51 to 100 sessions, and 37% for more than 
100 sessions. Forty-eight percent were seen by 
psychiatrists, 38% by psychologists, and 14% by 
psychiatric social workers. About two-thirds of the 
patients were treated by regular staff members 
with a minimum of 4 years’ therapeutic experi- 
ence. The other third was seen by either psychiatric 
residents or fourth-year clinical psychology trainees. 
Approximately two-thirds were seen by male thera- 
pists and the rest by female therapists. 


Classification of Responses 


Responses to the questions by both patients and 
therapists were sorted into categories based on 
analysis of their content. Those dealing with 
changes were adequately classified into 19 rubrics, 
those with helpful factors into 18 rubrics, and those 
covering nonhelpful aspects into 17 rubrics.? It was 
observed, nevertheless, that a number of the rubrics 
culled out still tended to overlap and be somewhat 
correlative. Although face reading suggested further 
combinations, we decided to keep these separate 
because of their individual citation. However, in 
order to obtain better comparative perspective, the 
response rubics were further organized into major 
area categories. 

Reliability of categorization was checked by 
having another investigator make an independent 
classification of a random third of the responses, 
using the same rubrics, Since the questions were 
open-end with no limit to the number of changes, 
etc, which a person might report, percentage of 
agreement was based on the total number of classifi- 


2 These tables have been deposited with the 
American Documentation Institute. Order Docu- 
ment No. 7455 from ADI Auxiliary Publication 
Project, Photoduplication Service, Library of Con- 
gress, Washington, D. C. Remit in advance $1.25 
for microfilm or $1.25 for photocopies and make 
checks payable to: Chief, Photoduplication Service, 


Library of Congress. 
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cations assigned. The two investigators assigned 
identical category numbers to 79% of the responses. 
In view of the complex and often murky nature o 
some of the replies, this was gauged to represent an 
acceptable level of reliability of categorization. 


RESULTS 
Changes Occurring in Therapy 


In response to the first question “What 
changes, if any, do you feel occurred 10 
you because of your therapy?” 90% of the 
patients mention one or more changes for the 
better. The therapists of these patients also 
report considerable change, 84% indicating 
more than one beneficial change. Data on the 
therapists’ determinations concerning the con: 
dition of these patients at the close 9 
therapy recorded at a different time for 
other purposes showed that 62% of the pa 
tients were judged moderately or much 34% 
proved, 30% slightly improved, and 8 f 
unimproved or worse. Even keeping 19 at 
the probable bias toward proclaiming ie 
able results, these figures imply that ae 
patients and therapists felt that therapy ký 
generally helpful. 2 

Changes acai most often by the pä; 
tients are “more realistic evaluation of E 
(52%), “more self-confidence” (35%), # 


“more ability to handle promien. Cin, 
i i ist 
Therapists of these patients 1 uredu 


realistic evaluation of self” (39%), 
tion in neurotic fears” (38%), and on 
self-confidence” (35%), as the most cint aj 
changes. Significant differences i iie 
between the groups appear concerning fe 
proved social relationships” (chi sd¥ 
10.42) and “reduction in neurotic 


more 


H 
fears’ 


(chi square 16.58), with the therapists men 
tioning these changes more frequently: 
When the various rubric changes pose 
grouped into major areas (Table 1); at 
involving insight, followed closely bY the 
titude and behavioral alterations, 1ea° pe- 
patients’ list, whereas those relating ve era 
havioral and symptom relief head the p m 
pists’ entries. Both behavioral and symp ly 
relief changes are mentioned significa pa 
more often by the therapists than me rot” 
tients. They, seemingly, note change: cti 
ing improved social relations and redu“ pe 
in self-defeating behaviors more than ib 


patients who emphasize intrapsychic oF 


s mit 
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TABLE 1 


RELATIVE FREQUENCY OF MAJOR AREA CHANGES 
OCCURRING IN THERAPY 


Patients Therapists 

Area of change % Rank % Rank 
y Insight 52 1 37 4 
> Attitude 48 2 38 3 
i Behavioral 47* 3 71* 1 
` Symptom relief  25** 4 57* 2 
» Other 10 5 8 5 


Ji 
ren ote. — Percentages represent percentage of cases giving a 
* ae classibable in this general area. 


*b gio. 


op feeling changes. One should be alert 
si the Possibility that these may reflect dis- 
Milarities in conceptualization rather than 


8 E ] 
map dissonances over actual changes taking 
ce, 


Helpful Aspects of Therapy 


t a answer to the question “What about the 
wardby itself did you find helpful?” the 
ah ortunity to talk over problems with 
er is placed high by both patients 
thar) and therapists (19%) as is “car- 
belie outlet for tensions, help in expressing 

is ngs,” with 19% in each group specifying 
iste et of therapy as advantageous. Both 
isties S also stress “attitudes and character- 

tay of the therapist” (167) as a positive 
toi ee feature. The therapists, addi- 
of cries ehlight “support especially in time 
ationshi (19%) and the “transference re- 
te ee ips” (16%). These latter two items 
signin verbalized at all by the patients and 
the — differences exist between them and 
8.80) erapists (chi square 11.14; chi square 


hen the helpful aspects are classified 
Vey major areas (Table 2), those of talking 
telati one’s problems and the therapeutic 
the Uship (therapist as a person) head 
Rive replies of the patients. The therapists 
Dorty ocedence to therapeutic skills, Op- 
tional] ity to talk over problems, and emo- 
differ Support of the patient. Significant 
Which nces are evident in the frequency with 
Ani therapists regard therapeutic technique 
Cor, Satisfaction of emotional needs when 

Pared with the patients’ designation of 
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these facets. The therapists’ cognizance of 
technique is understandable. It may also be 
postulated that difficulties which patients 
have in recognizing and coping with their 
emotional wants (e.g., gratification of de- 
pendency’ needs) make them less discerning 
of such elements as being beneficial for them. 
Again, the issue arises whether these ob- 
served differences characterize variance in 
reference points more than opposing views. 


Nonhelpjul Aspects of Therapy 


In response to the question “What about 
the therapy itself did you find not helpful, 
or even set you back?” over one-third (37%) 
of the patients state they cannot think of 
anything which interfered with treatment. 
Among those particularized, the most fre- 
quently reported statements are “therapists” 
feelings of irritation, anger, boredom, etc.” 
(11%) and the “frequent change of thera- 
pist” (10%) to which they were subject. On 
the therapists’ side, 16% assert that “noth- 
ing” was detrimental to therapy. Their senti- 
ments about nonhelpful aspects underline 
mistiming in “interpretation and confronta- 
tion” (15%), termination problems (14%), 
and communication to the patient of negative 
personal feelings (14%). The only reliable 
difference between patients and therapists 
is the more frequent declaration by the pa- 
tients that “nothing” was nonhelpful about 
their therapy. 

Table 3, major nonhelpful areas of therapy, 
reflects essentially these findings. Therapist 


TABLE 2 


RELATIVE FREQUENCY or Major 
HELPFUL AREAS OF THERAPY 


Patients Therapists 
Areas of help % Rank % Rank 
1. Opportunity to talk 
over problems 37 i a 2 
2. Therapeutic relationship 
(therapist as a person) 25 2 30 4 
3. Therapeutic skill and E 3 jaf 
technique 3 
4. Administrative factors 14 4 10 5 
5. Satisfaction of emotional m 
needs (support ge 5 35 3 
6. Other 2 6 o 6 


ntages represent percentage of cases giving a 


Jote.—Percer s 
ie Lita this general area. 


response classifia! 
ep < 0l. 
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TABLE 3 
RELATIVE FREQUENCY OF Major 
NONHELPFUL AREAS OF THERAPY 
Patients Therapists 
Nonhelpful area % Rank % Rank 
1. None 37* 1 16* 3 
2. Therapeutic relationship 
(therapist as a person) 14 2 11 4 
3. Administrative factors 13 3 21 
4. Therapeutic skill and 
technique oe 4 34 
5. Other 5 5 8 5 
6. Absence of emotional 
support 2 6 5 6 


Note.—Percentages represent percentage of cases giving 
a response classifiable in this general area. 

*p< 05. 

p< O01. 
statements significantly point up damaging 
features related to professional functioning 
when compared with those of their patients. 


Also, therapists express significantly fewer 
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“nothing was not helpful about therapy” 
statements than do the patients. A word o 
explanation may be in order to clarify the 
number two ranking (21%) of administrative 
factors by the therapists. This incorporates 
instances where therapy was terminated t00 
soon not due to misjudgment on the part © 
the therapist but rather because of his leaving 
the clinic, the patient’s having to move to 
another town because of occupational reasons, 
etc. 


Suggestions 


Question 4 “Have you any suggestions ye 
ways in which your therapy might have bee 
made more helpful?” was asked only of t 
patients. Thirty-five percent had none, 
make. Of those volunteered, the pricha 
ones are less frequent change of therap a 
(10%), more therapy time via longer i 
view sessions or more frequent visits (109%) 


al 


TABLE 4 
ACCURACY OF THERAPISTS’ PREDICTIONS OF Major AREAS 


Number of Number of 
patients correct pre- % correct 
giving this dictions by predictions 
Areas response therapists by therapists 
Changes 
Insight 33 9 27 
Attitude 30 17 57 
Behavioral 30 18 60 
Symptom relief 16 10 63 
Other 6 2 33 
Helpful 
Opportunity to talk over problems 23 15 65 
Administrative factors 19 3 16 
Therapeutic relationship 
(therapist as a person) 16 8 50 
Therapeutic skill and technique 10 2 20 
Satisfaction of emotional needs 
(support) 4 0 0 
Other 3 0 0 
Nonhelpful 
None 23 5 » 22 
Administrative factors 9 2 22 
Therapeutic relationship 
(therapist as a person) 9 1 11 
Therapeutic skill and technique 6 1 17 
Other 3 0 a 
Absence of emotional support 1 0 0 


ae z gi 
Note.—Percentages represent percentage of correct predictions based on the number of responses in this category 


patients. 


TE 
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and more guidance and direction from the 
therapist (10%). Another 10% desire “more 
patience, interest, and warmth” from the 
therapist. 


Predictions of Therapists 


Mi able 4 presents the accuracy of the 
erapists in predicting how their patients 
would respond to the first three questions 
posed them. In the area of changes, their 
f Pications are best for symptom relief, 
plowed closely by those related to be- 
ae and attitude changes. These areas, 
the entally, are the major ones in which 
3 therapists felt therapeutic changes had 
ctually occurred. 
keno acerning acknowledgments by the pa- 
icta of what was helpful in therapy, pre- 
eed of the therapists are most satis- 
Bier ty in the areas of opportunity to talk 
ship Problems and the therapeutic relation- 
ue sidering the open-end nature of the 
ionnaire, the overall degree of prediction 
Y the therapists is moderately respectable. 


Background Variables 


en {sponses of the patients were examined 
ab] ae to a number of background vari- 
ducati hese encompassed the patient’s age, 
ere tional level, and occupation. Differences 
bre dicts evidence only as often as would be 
ze Ai on the basis of chance. Further 
a Er was done of the kind of therapy 
nosis P Men EEN, supportive) and diag- 
Were fosey” reliable differences in responses 
antici und only as frequently as would be 
the apated by chance. Additional analyses of 
Sonal ee "were made in terms of profes- 
the ¢ ‘scipline, sex, and experience level of 
tiong lerapist, Here, too, no important rela- 
Whe iPs emerged. It made little difference 
Psyehinn the patient had seen a social worker, 
fema atrst, or psychologist, had a male or 
stae S therapist, or been in therapy with a 
trai ee mba advanced level psychology 
€, or psychiatric resident. 
ol 
Nem Findings 
thera Noted previously, 4 years after their 
Akai bY had ended, 45 (74%) of the patients 
answered the questions originally asked 
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of them. The aim was to gain information 
about the patients’ views of their therapy 
after they had been graduated, so to speak, 
in the outside world—with transference feel- 
ings presumably muted and perspectives 
heightened. Responses to the questions were 
coded afresh using the identical rubrics as 
before. 

Seventy-eight percent of the patients now 
mention one or more positive changes com- 
pared with 90% previously. The specific 
changes itemized most repeatedly are still 
essentially the same as originally asked of 
them. “More realistic evaluation of self” 
(42%) heads the list, followed by “more 
ability to handle problems” (25%), and 
“more self-confidence” (22%). 

The major area of change reported now 
(Table 5) by the patients is behavioral 
(47%) whereas before it was insight (52%). 
However, their follow-up responses are not 
significantly different than their original ones. 

With respect to the question “What about 
the therapy itself did you find helpful?” the 
patients again highlight the same aspects 
they did initially. The major ones mentioned 
are the “opportunity to talk over problems 
with someone” (22%), “catharsis; outlet for 
tensions, help in expressing feelings” (20%), 
and “attitudes and characteristics of the 
therapist” (14%). The status quo holds con- 
cerning major areas of help (Table 5) with 
opportunity to talk over one’s problems and 
the therapeutic relationship remaining the 
leaders. Although not reliable, there is now a 
lessening tendency to select administrative 
factors as being helpful. 

Follow-up responses to the third question 
“What about the therapy itself did you find 
not helpful or even set you back?” do signal 
differences from the patients’ original replies. 
The most frequent nonhelpful aspects re- 
ported now are “therapist too passive, per- 
missive” (22%) and “interpretations and 
confrontations given when not prepared for 
them” (20%). There is a noticeable fall-off 
in the “nothing, everything was helpful” 
rubric from an original 37% to 20% (Table 
5). Examination of the nonhelpful areas 
shows that faultfinding is centered mainly 
on the therapist’s professional skills. This 


critical orientation is presently significant at 
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TABLE 5 
Comparison OF THERAPIST AND ORIGINAL PATIENT Major AREA 
RESPONSES WITH FOLLOW-UP RESPONSES OF PATIENTS 
Patients Patients x? xt Patients 
Therapists (original) (follow-up) Therapists Therar (original 
versus versus ees 
f % i % f % patients patients patier 
N =63" N =63 N =45 (original) (follow up) (follow up) 
Changes 5 
Insight 233 37 33 52 17 38 2.60 0.00 1.7 
Attitude 24 38 30 4 i4 3i -81 30 2.32 
Behavioral 450 71 3000 47 210047 6.46* 5.77* 01 
Symptom relief 36 57 16 25 3 2 11.82** 7.35** 03 
Other 5 8 6 10 8 18 0.00 1.56 94 
Helpful 
Opportunity to talk 7 
over problems 23 37 3 g 14 31 0.00 14 al 
a 
Tea Ronship 19 30 16 25 8s 18 0.16 1.54 50 
Therapeutic skill 24 38 10 16 5 n 6.81% 8.41% 18 
Administrative factors 6 10 9 14 1 £$ 30 = 3.22 
Satisfaction of 
‘emotional needs 22 35 4 6 s n 14.01% 6.72** 28 
Other 0 0 1 2 4 9 a> = = 
Nonhelpful 
None 10 16 23 37 9 2% 5.91 09 2.68 
Therapeutic oz 
relationship 7 11 9 14 6 14 -07 .02 ” 
Administrative factors 3 21 8 B 10 22 oO 0.00 110 
Therapeutic skill 2100 34 6 9 2000 44 9.24%% 94 15.65 
Other 5 8 3 5 6 13 — 35 -A 
Absence of emotional ae 
support 3 5 1 2 1 2 — 


scinally inv 
a Although the N of therapists involved is 28, they responded individually for each of the 63 patients originally i" 


Hence, our N reflects 63 therapist-patient pairs. 
* p< 05. 
p< 01. 


the .01 level when contrasted with the pa- 
tients’ original replies. The patients, at this 
time, vent displeasure and specify discon- 
tent more openly and clearly than hereto- 
fore, 95% now doing so compared to 43% 
previously (Table 5). Administrative factors 
are ranked as the second most nonhelpful 
area. 

Concerning the final question “Have you 
any suggestions for ways in which your 
therapy might have been made more help- 
ful?” the new suggestions indicate no reliable 
differences from those mentioned 4 years 
back. But, as manifest in responding to the 
nonhelpful question, there is enhanced 
“speaking-up.” Only 13% of the patients 
now fail to make suggestions as contrasted 
with 35% originally. Among the most per- 
sistent ones made are those related to tech- 
nique, i.e., “more direction and active par- 
ticipation from the therapist” (10%), “in- 


olved: 


n 

: jews 

creased length and frequency of sensi 
es 


(11%), use of hypnosis, joint therapy 5 tion 
with spouse, etc. Assuming larger propor ith 
now are recommendations connected “ 
administrative features, e.g., 
change of therapist, more attract : 
a shorter waiting period for therapy: ‘i 
medical examinations, etc. Additional ye 
and personal interest from the therapist 
tinue to be called for. are 
We thought it worthwhile to also comfi 
the follow-up responses of the patients irec- 
those of the therapists and see if aPY ter 


ive room) 


tional shifts had taken place. In the thei” 
of changes, the therapists maintaine ptom 
significant noting of behavioral and sy0P a 


relief changes when compared with the till 
tients. In helpful areas, the therapists “tje 
disclose reliable accentuation of thera in 
skill and satisfaction of emotional np 
contrast to the patients. In the non 
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realm, the patients have come around to 
accepting the therapists’ concern with tech- 
nique. The previous signficant difference be- 
tween the two groups on this score has 
Vanished. Indeed, percentagewise, this area 
1S mentioned more often by the patients 
than the therapists. Additionally, the past 
Statistical difference in the area of “nothing 
Was nonhelpful” has disappeared in the wake 
of the patients’ sharpened self expression. 
joy, the effect of 4 years intervening 
Phas on these patients has been to clarify 
he they did not like about therapy while 
aintaining their impression of positive 
enefit. 


DISCUSSION 
T thought-provoking contrast is in the 
e accent on insight changes compared 


Š nce symptom relief and behavior by 
stone age This seems to limn the touch- 
y which each group tests the world. 

the — is his inner subjective feelings; 
Saat his patient’s freedom from 
comm, ing behaviors and acquired social ac- 

odation. 

tribute in substantal proportion, at- 
With € their assistance to being able to talk 
atmo Someone about their difficulties in an 
ance spalte of interest, warmth, and toler- 
the „> OPposing vein, a goodly number of 
Strongly Pes conceive their aid as issuing 
ihoa from their professional mastery. Un- 
B,, edly idols of the den are at work here. 
Drege, qrames of reference aside, hierarchical 
in Gat if not actual weight differences 
tio therapeutic multiple regression equa- 
a tee manifest. The leitmotiv that sounds 
im Prak again in the patients’ replies is the 
oe of sharing uncertainties and ur- 
Wig, S With an individual who will listen 
respect and treat with dignity their 
ul The administrative aspects, both 
and nonhelpful, which are cited are 
fr TA in the service of undergirding and 
th a the therapeutic relationship but zeroes 
Mow: what he is doing performancewise. 
Laing ng the patient to express himself, sus- 
nizi 8 him in his emotional exigencies, recog- 
`a” the influence of attitudinal components 
kingi? heeded by the therapist, but the 
remains his technical excellence. 
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The consideration is not that expectations 
and concepts of therapeutic implementation 
vary or are noncomplementary between thera- 
pist and patient; rather, that lack of aware- 
ness of these diverging calculations and hopes 
may often make for avoidable disruption and 
lag, particularly in the early stages of ther- 
apy. Interestingly enough, after the patients 
obtain distance from their therapy experi- 
ence, they tend, although not significantly, to 
minimize insight and attitude changes. They 
veer sharply toward skill and proficiency areas 
in discussing what was not salutary to their 
therapy. This may well echo the patient’s 
newly won capacity to attend to the beat of 
sociocentric drums. Yet, one wonders about 
the relation of this fresh verbalization of 
what was presumably undifferentiated or not 
understood before to befitting consensual 
“explanatory” knowledge. 

Be that as it may, an implication of force 
is that the therapist’s contribution, as far 
as the patient observes it, is personal as well 
as technical, a theme which Carl Rogers has 
been justly heralding for some time. Of perti- 
nence is not only the personal feelings 
(countertransference) of the therapist as af- 
firmed by Freud, Sullivan, and others but also 
the patient’s perception of them. The notion 
that the therapist “cures” with what he 
himself is as well as his studied arts must be 
troika-ed with the patient’s anticipations and 
reactions to these. It is plain that cognitive 
characteristics of treatment are only one of its 
meaningful components. Our findings, least- 
wise from the patients’ outlook, tend to 
bolster Rogers’ view that attitudinal elements 
in the therapy relationships are a conse- 
quential ingredient accounting for change. 
Whether they are more salient than technique 
is not distinct but they apparently possess 
subjective antecedence for a considerable 
number of the patients. ; 

The enlarged capacity of the patients, at 
long last, to forthrightly speak up concerning 
“lacks” in their previous therapy lends 
credence to the idea that “transference” 
feelings are far from dissolved in many pa- 
tients at the termination of their therapy. 
Pfefier’s (1959) work certainly reinforces this 


datum. ‘ : 
Our findings are congenial with those of 
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Board (1959). He also studied patient and 
physician judgments concerning outcome of 
psychotherapy in an outpatient clinic. His 
results showed that patients equated success- 
ful therapy with being permitted to express 
their problems, gaining self-insights, and 
having interested and understanding thera- 
pists. Unsuccessful therapy was associated 
essentially with noninterest on the part of the 
therapist and assignment to multiple thera- 
pists. This aspect of frequent change of 
therapist is an accompanying one in our data. 
It behooves clinic administrators to temper 
this development as far as feasible. It is 
humbling to realize the rejection and “lost” 
feeling which so many patients experience on 
being transferred from one therapist to an- 
other, particularly when it is not linked to 
therapeutic motive—this despite preparation 
by the therapist. 

A few words of caution. We should not over- 
look the variety of aspects which both pa- 
tients and therapists relate are helpful and 
nonhelpful. For instance, note that a small 
but definite number of patients feel that 
“guidance and direction” are useful to them. 
Some are neurotic. A psychotic patient voices 
the opinion that his therapist was “too con- 
trolling” for him. We must guard against 
the thinking that “since this is the kind of 
thing most patients indicated was helpful, 
therefore it is the thing most helpful in 
therapy.” Clinical experience has demon- 
strated steadily that the same type of therapy 
is not necessarily most helpful to all patients 
even within identical nosological groups. We 
need to understand and integrate the informa- 
tion and implications for therapy offered by 
both our majority and minority groups 
among patients and therapists. 

Another caveat. These are data from pa- 
tients responding to a prevailing psycho- 
analytically oriented form of psychotherapy. 
Would they be similar in reacting to another 
theoretical style of psychotherapy? 

Finally, it is somewhat reassuring to obtain 
indications that positive changes do take place 
as a result of therapy. This, notwithstanding 
the tendency for patients and therapists to 
highlight its favorable outcomes. 
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As a test of the hypothesis that malformation in the body of the child 
constitutes a threat to the bodily integrity of the parent, and will result in 


a reud and Adler early in their careers 
te hifested an awareness that the individual’s 
actions to his body constituted some of 
i Most interesting of psychological phe- 
eee calling attention to and writing 
aa commentaries on such aspects of 
sism Ody-self relationship as anxiety, narcis- 
oa nd the inferiority complex. Curiously, 
j ologists as well as laymen have seemed 
eae what insights these pioneers had 
am ae with little attempt at empirical check 
elicit eir validity. Schilder’s (1950) effort to 
t beliefs, feelings, and memories about 
A ey in questionnaires developed by him 
eae to group and individual therapy 
aaa the first attempts at objective and 
pa collection of data. But by far the 
i aaa and ambitious strivings in this 
anction are found in the work of Jourard 
pSecord (1954, 1955a, 1955b; Secord, 
a set Secord & Jourard, 1953). They devised 
Scale questions to form a more objective 
the; than Schilder had provided, terming 
instrument a Body-Cathexis (B-C) 
Secord and Jourard (1953) wrote: “By 
2 cathexis is meant the degree of feeling 
VariguiSfaction or dissatisfaction with the 
343] » Parts or processes of the body [p. 
erent This is, of course, a somewhat dif- 
Meanj Use of the term than the psychoanalytic 
Stin, ng, which indicates the amount of 
àn lee, energy invested in one’s body as 


hha 
© B-C Scale, which is described more 


C, 
ae. 


parents of both malformed and norm: 


body-cathexis responses differing from those of parents of normal children, 
two forms of the Body-Cathexis Scale were administered to samples of 
al children. Support for the hypothesis 
was discovered in that there was a substantial correlation between responses 
to their children’s bodies and responses 
parents of the malformed children, while little or no correlation existed in 
this respect for the parents of normal children. Parents of malformed children 
also expressed more dissatisfaction with their children’s upper extremities. 


to their own bodies on the part of 


fully later, consists essentially in requiring 
the subject to indicate his degree of satis- 
faction or dissatisfaction with each of various 
parts of his body in terms of a five-point 
scale. The procedure, although subject to 
some of the objections and criticisms common 
to questionnaires and self-inventories, has 
the advantages of relative simplicity, proven 
value in a substantial amount of previous 
work, and demonstrated internal consistency 
and reliability (a reliability coefficient of .81 
has been obtained with it). 

Of most relevance to the research reported 
here is a study by Remy (1953). He used 
college males and females as subjects and gave 
them both body-cathexis (B-C) and self- 
cathexis (S-C) questionnaires. All subjects 
answered three questionnaires on each vari- 
able, filling out one for representing their own 
feelings on each, another two as they believed 
their mothers felt about their (the subjects’) 
bodies and selves, and a third pair repre- 
senting their perception of their fathers’ 
feelings. All subjects also took Maslow’s Test 
of Psychological Security-Insecurity (Mas- 
low, Hirsh, Stein, & Honigman, 1945). 

The main results of the study were that 
perceived mother and father attitudes about 
the subjects’ bodies were positively correlated 
to a substantial (r’s ranged from .56 to .84) 
and significant degree with self-evaluations, 
and that both parental cathexes and BC-SC 


scores correlated to a substantial degree with 
feelings of security. 
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Problem 


If the foregoing correlates of body cathexis 
seem somewhat a demonstration of the obvi- 
ous, they are no less valuable for that. They 
are specially worth while, moreover, in 
suggesting validity for the scale itself. 

A distinctly less obvious correlate or 
determinant of body cathexis in the indi- 
vidual may lie in the social evaluation of the 
body of his progeny by himself and others. 
In instances where from birth or later ac- 
cident the body of the child is in some way 
malformed, rejection and subtle defensive 
maneuvers may be found, though the existing 
evidence which would bear upon the fore- 
going point is at most quite general and 
indirect. 

For instance, Wittreich and Radcliffe 
(1955) had subjects wear aniseikonic lenses 
of varying strengths when viewing two 
figures, one normal, the other mutilated. On 
a scale purporting to measure “resistance to 
induced aniseikonic distortion they found 
that the threshold for the mutilated figure 
was significantly higher than for the normal 
figure.” The investigators in this case decline 
to offer any plausible interpretation of their 
findings, but to us a reasonable suggestion is 
that the phenomenon is a manifestation of 
perceptual defense. 

Gilder, Thompson, Slack, and Radcliffe 
(1954) achieved similar results to those of 
the foregoing investigation when they uti- 
lized full-sized mannequins as well as adult 
amputees as stimuli. One of their findings 
was that the perception of the amputee and 
even the mutilated mannequin was apt to 
provoke strong emotions of anxiety as well 
as anger. They interpret these reactions as a 
function of the threat that actual or even 
simulated amputation constitutes to the in- 
tegrity of the body image of a normal person 
since he identifies with the person viewed. 
The type of emotional response encountered 
is seemingly dependent upon the meaning the 
body injury has for the subject and his 
particular way of dealing with such stress. 

If persons unrelated to amputees experi- 
ence such threat and manifest such emotional 
reactions, then how much more likely it 
would be that the parent of an amputee 
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child will experience threat and manifest 
defensiveness in anger and rejection? The 
question of whether he does or not is SI& 
nificant from several considerations, the most 
obvious of which is that of the psychological 
adjustment of both parent and child to the 
circumstance of the latter’s deformity. Feel- 
ings of shame, guilt, and resentment have 
all been observed in parents of amputees by 
the senior author in clinical interviews with 
such parents in connection with her work 
with the Child Amputee Research Project 
at the University of California, Los Angeles, 
although how such reactions are related t? 
bodily integrity and body cathexis was not 
obvious. That the family stresses imposed by 
chronic parental illness are accompanied by 
affective and defensive reactions on the part 
of the children of such parents has bee? 
effectively shown in Arnaud’s (1959) study. 
That effects may be produced in an opposit 
direction seems highly probable. ad 

Out of the beliefs that the amputee chi J 
does constitute a threat to the bodily ™ 
tegrity of the parent, it was expected tha 
reactions to such threat might be manifeste? 
in the responses to the Body-Cathexis ques 
tionnaire. More specifically it was our itl 
potheses that (a) parents would respond Ties 
rejection of those parts of their child’s bo a 
where malformation of the child’s body © a 
isted, and thus would show extreme dissat 
faction for them in terms of the B-C 5°% 7 
and that (b) because of identification be 
the child, a substantial correlation ™ in 
responses of parents of amputee chile 
toward their own and their children’s bodi 
would be found. 


PROCEDURE w 
e: 
o samp d 


child ® 
as subject? 


For testing the foregoing hypotheses tw' 
of parents, one possessing a malformed 
the other a normal one, were employed 2 
Both children and parents in the respective 5° sure 
were matched in as many ways as possible i 
that whatever effect parentage of a ma 
child might have on response to the 
would not be confounded with the effects ° 
possible determinants. who 

Twenty-six parents, 6 men and 20 women puta” 
children were victims of upper extremity veexpe 
tions or anomalies, were selected for me e rani e 
mental” group. Their children were in the EA ad 
from 5 to 12 years and in all but four Ca 
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congenital amputations or anomalies, while the re- 
maining four cases had been victims of disease or 
accidents before they were 6 years old. The children 
Were otherwise in good general health, possessed 
normal vision without correction, and were of at 
least average intelligence as measured by the 
Columbia Test of Mental Maturity (Burgermeister, 
lum, & Lorge, 1954). Each of these children was 
matched with a normal child in chronological age, 
Mental age, and sex, and the parents of the latter 
Were chosen as a “control” sample. Five of the 
Parents selected for the latter group, who had 
fe ally agreed to participate, later declined to 
is in the study, thus reducing the number 
ae ne final sample to 21. As finally constituted it 
Ree of 18 women and 3 men. The two parent 
es ups were to a high degree alike in most 
Certainable characteristics. 
Otis ne parent of each child with an amputation and 
ae paren of a child without one were asked to 
ie ee two forms of the Body-Cathexis Scale, 
od indicating their attitude toward their child’s 
eae and the other their feelings about their own 
es, The scale for child’s body was given first. 
é gion of the B-C scale employed consisted of 
Were ing of 46 parts or aspects of the body which 
ollo to be rated by subjects according to the 
wish ng 5-point scale: 1—strongly dislike and 
i change could somehow be made; 2—don’t like, 
ne By put up with; 3—have no particular feelings 
myself or the other; 4—am satisfied; 5—consider 
satis fortunate. Thus, if a person were very dis- 
encir ed with his nose, for example, he would 
cle a 1., 
body parts or aspects are: hair, facial 
Ver hy hands, distribution of hair 
i ody, nose, fingers, elimination, wrists, breath- 


Ng, i ; i 
ankles b energy level, back, ears, chin, exercise, 
, 


height 
he, estion, hips, skin texture, lips, legs, teeth, fore- 
» feet, sleep, voice, health, sex activities, knees, 
, face, weight, sex (male or female), back 

i te head, trunk. 
TedueiePing with Johnson’s recommendation for 
8g response sets the following paragraph was 
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added to the original instructions: “You may refer 
back to the scale as often as necessary to make 
your judgment of how you feel. Judge each item 
carefully. Do not use the same number for each 
item.” 

RESULTS AND DISCUSSION 


Frequency distributions of the total scores 
and of each of the five ratings on the B-C 
scale, Child form, were made for the amputee 
and nonamputee groups. No significant dif- 
ferences were found between the two with 
regard to their median scores. 

Since only four items in the scale pertained 
to the upper extremities, it was thought pos- 
sible that a differential treatment of these 
four items by the parents of amputees might 
be masked by the distribution of ratings of 
the remaining items. For this reason re- 
sponses to these four items, Hands, Fingers, 
Wrists, and Arms, were extracted from the 
scale, and subjected to separate chi square 
tests. Because of the small cell frequencies 
Rating Categories 1 and 2 were combined, 
as was 4 with 5. The results of these tests 
are presented in Table 1. They indicate 
strong dislike for those features of the 
amputee child’s body which are deformed, 
such as fingers, hands, wrists, and arms. 

No significant differences between the 
groups were found, however, on the remain- 
ing items. In other words, the parents of 
amputee children strongly dislike those 
features of their children’s bodies which are 
deformed, but this attitude does not extend 
to other parts or functions of the child’s 
body nor to the body as a whole. The results 
of this test, then, tend to support the hypothe- 
sis that parents of amputee children will have 


TABLE 1 
Cut SQUARE Tests OF RATINGS FoR ITEMS PERTAINING TO UPPER Extremities: CHILD FORM 
Hands Wrists Fingers Arms B 
Non- 
m A Non- Non- 
Rating Amputee Bons Amputee amputee Amputee amputee Amputee amputee 
3+2 9 0 3 0 6 0 9 0 
4 4 5 5 4 4 3 i k 
8 13 16 18 17 16 | 18 EE: 
x2 = 9.2* x? = 2.23 xX = 57 x? = 12.96 


* 
b S 
< 01; Yates correction for small frequencies. 


TABLE 2 


CORRELATIONS BETWEEN PARENT’S ATTITUDE 
TOWARD CHILD’S BODY AND ATTITUDES 
TOWARD Own Bopy 


Nonamputee 


Amputee 
Parent-Child Parent-Child 
Scale (N = 26) (N = 21) 
46-item 49* .09 
42-item .54* 07 
*p <.01. 


extreme attitudes of dislike toward their 
children’s bodies insofar as they reject the 
specific anomalous feature of the child. 

The prediction was also made that on 
the Body-Cathexis Scale, Parent form, there 
would be a substantial correlation between 
the attitudes the parents of amputee children 
expressed toward their children’s bodies and 
those they expressed about their own bodies. 
To test this prediction an analysis of the Par- 
ent form similar to that of the Child form was 
made. No significant differences were found 
between the groups of parents, either in their 
attitudes toward their bodies, all parts and 
functions, or toward their own upper extremi- 
ties. In effect, parents of amputee children 
are apparently just as satisfied with and 
accepting of their own bodies as are parents 
of nonamputee children. 

The extent to which parents’ attitudes 
toward their children’s bodies agree with 
attitudes toward their own bodies was ascer- 
tained by computing Spearman rank cor- 
relations. In each case the total score for 
responses to the Child form was paired with 
scores on the Parent form. Correlations were 
computed for total scores for both the 46- 
item scale and the 42-item scale and are 
shown in Table 2. It is apparent that there 
are substantial correlations (.49 and 54) 
between the attitudes expressed toward the 
bodies of amputee children by their parents 
and the attitudes they express toward their 
own bodies. However, there is very little 
relationship (r =.09 and .07) between the 
attitudes of parents of nonamputee children 
toward their children’s bodies and toward 
their own bodies. 

Although the hypothesis is supported that 
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the parents of amputee children have atti- 
tudes toward their own bodies similar to 
those they have toward their children, it ap- 
pears, from scrutiny of responses to indi- 
vidual items, that they do not reject their 
own upper extremities as they do. their 
children’s. Rather they distribute their dis- 
satisfactions among the various other parts 
and functions of their bodies. ; 4 
A possible interpretation of this ee. 
finding is that the strong emotional Spee 
ment and identification with the child ° 
the part of the parent results in a generalize 
criticalness and dissatisfaction with own bo y 
parts which, while normal, are not ideal, a 
the net result is a body cathexis for ya 
which bears an overall resemblance to t 
for the offspring. i 
It is strongly suggested, as a general a 
pothesis needing confirmation in further } 
vestigation, that for the parent of ei 
deformed child his offspring may repres% | 
such an intimate extension of self thats 
is on the whole much less able to different 
his evaluation of his child’s body from ev@ A 
ation of his own than is the parent ° ely 
normal child. Further, the findings sae 
suggest the need for more general exam m 
tion of the question of how perception the 
defect and deformity in others affects sdf. 
individual’s perception and evaluation of 
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The purpose of the study was to test the hypothesis that somatic complaints 
during pregnancy and difficulties in childbirth are related to anxiety and 
conflicts involving attitudes and feelings toward others. The Ss, 52 primiparous 
women, were given a series of tests and structured interviews. Significant 
positive relationships were found between somatic complaints during pregnancy 
and manifest anxiety, marital conflict, and indirect indicators of conflict in 
the interview, i.e., history of menstrual symptoms, birth, and baby dreams. 
49 of the Ss’ delivery room records were rated for duration of labor and 
amount of analgesic required. Only one significant relationship was found: 
anxiety, as measured by the Affect Adjective Check List given during 


pregnancy, was directly related to amount of analgesic required. 


Many gynecologists and obstetricians are 
convinced that anxiety, stemming from con- 
flicts about pregnancy and childbirth, is an 
important factor in the common somatic 
disturbances in pregnancy and the pain and 
difficulty in childbirth. These convictions 
have been largely supported by clinical 
evidence until recently. 

Two studies have found responses to 
anxiety and psychosomatic scales related to 
physical and emotional symptoms during 
pregnancy (Zemlick & Watson, 1953; Schaefer 
& Manheimer, 1960). Anxiety measures from 
the TAT (Zemlick & Watson, 1953) and 
MAS (Davids, DeVault, & Talmadge, 1961) 
have been related to “abnormal reactions” 
during childbirth. 

The present study represents another at- 
tempt to test the hypothesized relationships 
between anxiety and conflict on one side and 


1A note of thanks is owed to Eugene E. Levitt 
for his contributions in the planning stage of this 
study and to Dale E. Berkebile for his assistance 


in collecting the data. 


somatic reactions during pregnancy and a 
ficulty during childbirth on the other. be 
attempt was made to specify some O 
independent and dependent variables 
precisely than in previous studies by tic 
more specific measures. Anxiety and some r- 
complaints were assessed at monthly a 
vals during pregnancy rather than by depe” 
ing on only one time sample. The ar 
reported here was a pilot to a more exten, 
longitudinal study of pregnancy, childbir e 
and childrearing being conducted 1” 
Indiana University Medical Center. 


Stupy I 
Procedure 


: en A 

The subjects were 52 primiparous women 5e Med- 
the prenatal clinic of the Indiana University Je 
ical Center. The racial composition of this the 


$ 5 p o 
differs from previous studies. porty so gst sub⁄ 


group were Negro and 11 were white. adle- 
jects came from a lower or lower ™! 5.2): 
background. Their mean age was 21.1 6 113 


and the mean of their years of schooling W4 
(SD 2.1). 
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Most of the subjects were seen during the fourth 
or fifth month of their pregnancy. At that time, 
they were given the first of a series of structured 
interviews designed to assess their reactions to 
Pregnancy and significant family figures in addition 
to certain other medical information not relevant 
to this study. Descriptions of the structured inter- 
view technique will be elaborated later in this sec- 
tion. The subjects were also given a test battery 
consisting of the following scales: 

1. The Taylor (1953) Manifest Anxiety Scale 
(MAS). 

2. Zuckerman’s (1960) Affect Adjective Check 
List (AACL), a test which uses a check list of 
Words with affective connotations and an empirically 
developed anxiety key. The “Month” version of this 
test, used in this study, asks the subjects to check 
Words describing their feelings during the previous 
Month, 

a Three scales from Schaefer and Bell's (1958) 

‘arental Attitude Research Instrument (PARI), a 
Scale designed to measure attitudes toward child 
rearing and family life. The three scales used were: 

arital Conflict, Rejection of the Homemaking Role, 
and Irritability (with children). These scales were 
pi to comprise a “Hostility-Rejection” factor in 

€ test (Zuckerman, Barrett-Ribback, Monashkin, & 
Norton, 1958), 
sir Masculinity-Femininity scale from the 
a A form of Osgood’s (1957) Semantic Differ- 
C0 ial was given to 16 of the subjects. The concepts 
dee were: Myself, Mother, Most Women, 
ae Woman, Father, Husband, Most Men, and 
ic Man. Difference (D) scores were computed 

tween all of the female figure concepts and the 
deal Woman concept and between all of the male 
mee concepts and the Ideal Man concept. In- 
Gn sing D scores between any figure and the ideal 
aes for that figure was taken as a measure of 

Pons “acceptance” of the particular figure. 
strate subjects were available for a second 
View pier interview, 1 month after the first inter- 
comp]: Chis interview contained a list of 13 somatic 
ae faints: backache, pain in womb, headache, 
Urgene? nausea, vomiting, constipation, diarrhea, 
earth, to urinate, weakness, fatigue, change in 
Viewe eat, and difficulty in breathing. The inter- 
their s inquired as to their presence or absence and 
ntaa ey A symptom complaint score was com- 
Sympt from this list by scoring 1 for any mild 
Dlaing complaint and 2 for any severe com- 
again During this second session, the subject was 

' given the MAS and AACL. 
Vie irty subjects were available for a third inter- 
With Containing the Somatic Complaint Inquiry along 
the „Other areas of content. They were also given 
AS and AACL again. On all subsequent 
Somatic Occasions, the subjects were interviewed, 
Were © Complaints assessed; and MAS and AACL 
7 da Teadministered. The last interview was about 
of ine before expected delivery. Varying numbers 

erviews were obtained from different subjects 


Month] 
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due to dropouts, varying starting times for the 
first interview, and termination of the study before 
completion of the series on all subjects. Ten sub- 
jects had one interview, 12 subjects had two inter- 
views, 11 subjects had three interviews, and 19 sub- 
jects had four or more interviews. 

Since the 10 subjects with only one interview 
never received a somatic complaint inquiry, they 
could not be used in any comparison involving this 
score. Since different subjects had different numbers 
of somatic complaint inquiries, MASs, and AACLs, 
average scores on these variables were calculated for 
each subject. A more homogeneous group consisting 
of 14 subjects with a minimum of five interviews 
and testings between the fourth and ninth months 
of pregnancy was analyzed separately on certain 
measures (Group II in Table 1). 

Structured interviews. The structured interviews 
attempted to cover a wide range of topics which 
the authors felt might bear on psychological reac- 
tions to pregnancy. Questions ranged from very 
direct ones, such as “Have you been working during 
your pregnancy?” to very indirect or open-ended 
questions such as, “How did you feel when you 
really suspected you might be pregnant?” A number 
of interviewers were used ranging from medical 
students to psychiatrists. Interviewers were cautioned 
not to lead the subjects, and were asked to record 
responses verbatim. Some deficiencies were apparent 
when going over the interview transcripts. Since 
this was not the most verbal group of subjects, 
their responses were mostly bland or defensive in 
nature. Questions regarding feelings elicited very 
little. Since the interviewers had been warned against 
“leading” the subjects they tended to accept non- 
committal responses. A lack of rapport with the 
subjects may have also limited the material which 
was elicited from them. Despite the obvious 
deficiencies in the interview technique, an attempt 
was made to categorize responses in order to com- 
pare high and low symptom groups. It was hoped 
that leads might be obtained for devising a new 
interview schedule to be used in more extensive 
studies planned for the future. 


Results 


Marital status. Eight pregnant subjects in 
the group were single at the time of the 
interviews. Eleven subjects had been married 
between 1 and 3 months. Since they were 
presumably in their fourth month of preg- 
nancy, it was assumed that subjects in this 
group were single at time of conception. A 
third group of 33 subjects had been married 
4 months or more. The scores of these three 
groups on the MAS, AACL, Somatic Com- 
plaint inquiry, M-F Scale, and three PARI 
scales were compared, using simple analysis 
of variance. Although Irritability, MAS, and 
AACL mean scores were somewhat higher 
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TABLE 1 


CORRELATIONS OF MEAN SYMPTOM SCORE WITH TESTS 


Group I* Group II 


N 42 14 
Manifest Anxiety Scale 34* .59* 
Affect Adjective Check List .20 —.06 
Masculinity-Femininity irj .05 
Marital Conflict .32* 9° 
Rejection of Homemaking Role 10 22 
Irritability (with children) —.19 14 


aA minimum of two testings between fourth and ninth 


months of pregnancy. h 
bA minimum of five testings between fourth and ninth 


months of pregnancy. 
*p <.05, two-tailed test. 


in the “single” group, none of the F ratios 
reached the .05 level of significance. 

Correlations between tests and somatic 
scores. Table 1 presents the correlations 
between the objective test scores and the 
somatic complaint scores obtained from the 
interviews. Group I contains all subjects who 
received at least one symptom complaint 
inquiry. Group II contains only those sub- 
jects with a minimum of five interviews. 
Group I includes the subjects in Group II. 
In both the extended and restricted samples, 
the MAS and the Marital Conflict scale from 
the PARI correlated significantly with the 
somatic symptom complaint score. 

Semantic differential measures and symp- 
tom complaints. The correlations between the 
six D scores and the somatic symptom 
score are listed in Table 2. All of the corre- 
lations, with the exception of that one in- 
volving the Ideal Man-Father discrepancy, 
were positive. However, the only correlation 
which approached significance was that be- 
tween the Ideal Woman-Mother discrepancy 
and the Somatic Symptom Complaint score. 
The tendency was for less acceptance of 
“Mother” to be associated with higher 
somatic complaints during pregnancy. 

Somatic symptom complaints and the struc- 
tured interview. The highest 11 subjects on 
the average somatic symptom complaint score 
were compared with the lowest 11 subjects on 
their responses to the first interview. All single 
subjects were excluded from the comparison 
since many of the questions in the interview 
were irrelevant to their situation. Of the 44 
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questions contained in the first interview, 
many pertained to medical history, and others 
obviously elicited little in the way of cate 
gorizable responses. Ten questions Were 
selected for analysis. These were questions 
pertaining to feelings of the subjects and 
their husbands about the pregnancy, plans 
for method of feeding the baby, history af 
menstrual difficulty, source of, and ways ° 
handling hostility, and dreams. Two questions 
yielded significant differences between the 
two groups. Direct questions on feelings 
about being pregnant and having @ baby 
revealed no differences between the tw? 
groups. There was a significant difference 
between groups on reports of premarita 
menstrual difficulties (usually cramps). Seven 
of the 11 subjects in the high somatic symp 
tom group reported these difficulties while 
only 2 of the 11 subjects in the low gtouP 
reported them. A second significant difference 
was found on the reporting of dreams abon 
babies and birth with 6 subjects in the his” 
symptom complaint group and only 1 
ject in the low symptom complaint 
reporting such dreams. socts 

During the second interview, the subjec e 
received the first Somatic Symptom 
plaint Inventory. The selection of hig 


sub: 
group 


h and 


: n 
low somatic symptom groups was based ° 
average symptom scores, However; eve Ta 
this second interview, there was no ove ie 
between the two groups: the scores of re 
low group ranged from 0 to 2, and the se 
of the high group from 3 to 9. The beat 
questions in the second interview ©¥° 
TABLE 2 
MEAN D Scores AND CORRELATIONS wITH 
Symptom ScorES® 
r with 
ti 
mae Somoms 
0 
Ideal Woman versus Self 5.83 43 
Ideal Woman versus Most Women 7.67 46" 
Ideal Woman versus Mother 6.51 ‘09 
Ideal Man versus Father 5.80 7'34 
Ideal Man versus Husband 4.91 38 
Ideal Man versus Most Men 6.57 i 
band 


= i Hu 
REE de except Hn tts case of the Idal Man versus 
ree single subjects were excluded. 
*p <.07, two-tailed test. 
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a wide range of topics: feelings about the 
pregnancy, situational stresses, past and 
present sexual behavior, and so on. Nineteen 
questions were selected for analysis, five 
questions yielding significant differences be- 
tween groups. When asked how things had 
been going since their last visit, all 11 of 
the low symptom group said “fine” or “all 
right,” while four of the high symptom group 
mentioned difficulties. One subject mentioned 
lack of sleep and “things” getting her 
‘down”: another subject said she was “mixed 
up” and using her pregnancy to “pick 
arguments”; the third subject complained of 
gaining weight, and tiredness; and the fourth 
Subject said “I will get sick, but I’m still 
living.” When asked if things had been going 
as they expected, all 11 in the low symptom 
complaint group said “yes” while 4 in the 
high symptom complaint group said “no.” 

hen asked about changes in their sleep 
Pattern, 10 subjects in the high symptom 
group and 6 in the low reported changes. 
About half in each group reported sleeping 
less while half reported sleeping more. In 
questions about the content of their dreams, 
the result found in the first interview was 
found again. Seven subjects in the high symp- 
tom complaint group reported dreaming 
about babies and birth and only 2 in the 
low symptom complaint group reported such 
dreams. More will be said about the content 
of these dreams in the Discussion section. 
‘he last significant result found in this inter- 
View was in regard to changes in interest 
Since pregnancy. Seven of the high symptom 
Complaint group reported an increased inter- 
est in babies while only one of the low 
ns Mptom complaint group reported this par- 
icular interest, The interviews beyond the 
Second were not analyzed because of the 
OSs of subjects from the two symptom groups. 


Stupy II 
Procedure 


pinne subjects were 49 of the 52 women who had 

con used in the previous study on pregnancy. The 

est Parisons in this study utilized the psychological 

the data obtained in the initial testing session and 

obt: „average somatic symptom complaint score 
ained in the interviews. u 

cha, e senior obstetrician reviewed the subjects’ 
tts, without prior knowledge of psychological 
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test results, and delineated two variables: length 
of labor from the first contractions to the second 
stage of labor, and amount of analgesic required 
by the patient. The obstetrician believed that the 
latter was a rough objective index of the pain 
reactions of the patients during labor. The group 
was divided into a long labor group (15 hours or 
longer) of 13 subjects, an intermediate group (6-14 
hours) of 24 subjects and a short labor (5 hours 
or less) group of 12 subjects. On the second vari- 
able, 6 subjects required no analgesic, 25 required 
the average amount, and 18 required above average 
amounts to deal with their pain. 


Results 


Length of labor. The three duration of 
labor groups were compared on the two 
measures of anxiety (MAS and AACL) ob- 
tained during the fourth or fifth months of 
pregnancy, the three scores from the parental 
attitude scale (PARI) and the “Hostility- 
Rejection” factor score resulting from their 
combination, the Masculinity-Femininity test 
from the MMPI, and the average somatic 
symptom complaint score obtained during 
pregnancy. A one-way analysis of variance 
was computed between the three groups for 
each of the eight variables. None of the 
F values reached statistical significance at 
the .05 level. The only F approaching sig- 
nificance (F = 2.10, 2/46 df) was that for 
the AACL. The mean AACL anxiety scores 
were 8.77 for the long labor, 6.78 for the 
intermediate, and 5.58 for the short labor 
groups. 

Amount of analgesia. The three groups 
formed on the basis of amount of analgesic 
required were compared on the eight vari- 
ables mentioned above. Only one of the eight 
F tests yielded a significant F ratio. The 
F between groups on the AACL anxiety 
measure was 4.27 which for 2 and 46 degrees 
of freedom is significant below the .05 level. 
The six subjects requiring no analgesic had 
a mean AACL score of 3.17, the 25 subjects 
in the average analgesic group had a mean 
score of 6.96, and the 18 subjects requiring 
above average amounts of analgesic had a 


mean score of 8.39. 


Discussion 


The significant correlation between the 
MAS and somatic complaints during preg- 
nancy supports the idea that many of these 
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complaints reflect somatic expressions of anx- 
iety, and their relationship with the Marital 
Conflict scale suggests the origin of this 
anxiety. The Marital Conflict scale is an in- 
direct test, in that it asks the subject about 
marriages in general rather than about her 
own marriage. The more direct evaluations of 
the husband obtained in the Semantic Dif- 
ferential and the structured interview were 
mostly positive or neutral. In fact the D 
score on the Semantic Differential shows 
greater idealization of husband than of self, 
parents, or other persons. We would hypothe- 
size that a conflict between unconscious feel- 
ings of hostility toward the husband and 
dependency on him may be a primary source 
of anxiety in pregnancy. Rejecting attitudes 
toward the mother were more overt and 
tended to be directly related to somatic 
symptoms. 

A similar contrast exists in the interview 
where direct questions about the subject’s 
own feelings toward her pregnancy and ques- 
tions about worries elicited mainly positive 
feelings and denial of anxieties in both high 
and low symptom groups. Dream material, 
on the other hand, seemed to reflect a 
greater preoccupation with the forthcoming 
parturition in the subjects with many somatic 
symptoms. Some of these dreams have 
obvious conflictual content. For instance, one 
subject dreamed “that someone is taking 
something away from me, something about 
the baby.” Another subject dreamed that she 
had a huge baby and did not know how she 
did it. Several subjects dreamed about having 
twins. One subject dreamed she had a girl, 
and then added that her husband wanted a 
boy. One low symptom subject dreamed that 
her baby was too small and her husband 
said it was not big enough to be a baby. The 
dream report technique seems to be of some 
potential value in exploring fears in preg- 
nancy. One improvement in this technique 
might be to have subjects in future studies 
keep a pad and pencil by their night stands 
to record dreams immediately. 

On the conscious level, the high symptom 
group expressed a greater interest in babies 
than the low symptom group. In terms of 
Dollard and Miller’s (1950) conceptualiza- 
tion of conflict, given two equal avoidance 
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gradients, the situation with the higher 
approach gradient produces more conflict. 
Perhaps the motivations of this group r 
higher in regard to pregnancy, which woul 
increase the strength of existent conflicts. i 
The greater tendency to report premarita 
menstruation difficulties in the high symptom 
group can be interpreted in two ways: 
First, one could hypothesize that earlier 
conflicts over sex, previously expressed iM 
menstrual difficulties, have made this srouP 
particularly vulnerable to somatic stress E 
volved in pregnancy. Second, one could s 
pothesize that this group has a general ten 
ency to report somatic symptoms which others 
would forget about or ignore. The a 
interpretation is more parsimonious, p- 
going back to the question on physical pt? i 
lems before pregnancy and analyzing E 
sponses other than menstrual difficulties, k 
was found that the same number of S 
(three) in both high and low SYMP 
groups complained about other phys! 
symptoms before pregnancy. A 
The results aa nt the psychological 
test variables obtained during pregnancy ye 
the obstetrical variables did not yield pjes 
pressive findings. None of the test varia a 
was significantly related to the duration s 
labor and only one of the variables, tly 
AACL measure of anxiety, was significai 
related to the amount of analgesic require yas 
the patient during delivery. This result a A 
in the expected direction with those ag 
jects requiring more analgesic haves 
higher anxiety scores earlier in ‘i 
nancy. However, the lack of signific 
relationships between the MAS and At 
measures and the labor variables 
support the findings of Zemlick and wie 
(1953) and Davids et al. (1961). The 


. i noes 
ferences in results may be due to ane 
in the populations used or different CH ive 


used to evaluate childbirth difficulty. F" ob- 
studies should devote more attention cio 
servation of the subjects’ behaviors aver 
delivery rather than depending on á 
chart records. 
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RESPIRATION AND GSR AS FUNCTIONS OF WHITE 
SOUND IN SCHIZOPHRENIA * 
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The hypothesis was that white sound would reduce stress reaction. The 4 
groups were: 20 normals-white sound (N-WS), 20 normals-no white sound 
(N-NoWS), 20 schizophrenics-white sound (S-WS) and 20 schizophrenics-no 
white sound (S-NoWS). GSR conditioning and respiration 
measured for 20 trials, with a light as a CS and an electric shock as a UCS. 
The significant findings were: (a) S-WS had highest respiration, (b) changes 
in GSR conductance were greatest in N-WS, (c) basal GSR increased only 
in N-WS, (d) schizophrenics groups failed to condition. The conclusions were: 
(a) WS is a noxious stimulus (70 db.); (b) 2 groups responded with dis- 
tinguishable physiological patterns; (c) fear and anger constructs were in- 
ferred for schizophrenics and normal groups, respectively; (d) lower oxygen 
saturation in schizophrenics was inferred in explanation of findings. 


Psychogenic disorders, often manifested by 
tension, agitation, or anxiety, have been par- 
tially overcome by use of psychotropic drugs 
capable of producing toxic effects. Possible 
application of white sound (WS) as another 
therapeutic modality becomes obvious by 
virtue of its success as an audio analgesic in 
minor medical and dental surgical procedures 
(Cook, 1960). This study was undertaken in 
an attempt to evaluate the function of WS 
with schizophrenic and normal subjects under 
induced stress. The available information does 
not provide hints as to the nature of in- 
fluence that may be exerted by WS on 
schizophrenic and normal populations. How- 
ever, there is experimental evidence (Malmo, 
Shagass, & Davis, 1951) that schizophrenic 
patients respond to stress evoking situations 
with a greater degree of muscular tension 
than normal subjects. Whether WS would 
function to reduce induced anxiety can not be 
predicted, although there are data suggesting 
that it would be effective as an analgesic, 

1This study was supported, in part, by the 
Veterans Administration Medical Research Program 
(8200 Funds). 

Grateful acknowledgment is made to Harry F. 
Harlow, Primate Laboratory, University of Wis- 
consin, for his thoughtful suggestions with respect 
to this investigation. 

2 Also affiliated with the University of Oklahoma 
Medical Center. 


rates were 


s M 
Recently there has been some succes 


application of WS as an analgesic (Cate 
& Licklider, 1959) in dental operations, ~ 4 
as cavity preparation, sealing, erindings 600 
extraction. They found that for 637 ° tive 
patients, the WS was completely effec 
25% of the patients found it less eded, 
complete, but no other anesthetic was ?® 
and 12% were not influenced. n that all 
The WS is much like white noise 1n ut 
frequencies are simultaneously present, ach 
unlike white noise, the intensity ° 
frequency is not equal, but tailor vill 
conform more closely to frequency Sers! 
of the ear. According to Cook (1960); 


re 
White sound is derived from the very p o? 


tailoring of white noise as a raw materia 138 
new discipline of distribution of intensity ,sus 
frequency, which matches the sensitivity und 
frequency characteristic of the ear. - + * ies degt” 
is capable of producing uniform and Þig audit”, 
of stimulation of a large fraction of the th 


ng 
nerve endings as they are distributed alo 


basilar membrance of the cochlea [p. 3]- 


There have been no reports of We Faced 
used in a laboratory situation wi a 
stress. However, there is also evidenc”, 
noise in itself can produce stress- 5 
reports in closely related literature ibility 
effects of noise would indicate that porcludee 


of stressful influence can not be 
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Duncker (1937) found that a severe pain in 
humans resulting from pressure applied to 
one arm was reduced when the other arm was 
also stimulated painfully, and also the same 
effect was achieved when he used a loud 
noise in place of the second stimulus. There 
are other studies demonstrating the nox- 
lousness of loud noise. Some of the results 
that have been obtained are: an increase in 
trembling resulting from a 90-db., 8,000-cps 
tone (Miller, 1953); impaired performance 
on a reaction time task plus reports of fatigue 
and discomfort after exposure for a 3-hour 
Period to a 115-db. sound (Miles, 1953); 
decreased vigilance under a 100-db. noise 
Condition (Broadbent, 1951, 1953); reports 
Of irritation, distraction, and discomfort as 
a result of 30-minute periods of 111-db. noise 
(Miller, 1951). 
_Some physiological effects of noise are a 
se in blood pressure following a loud un- 
expected sound (Lovell, 1941), a decrease in 
Peristaltic contraction and flow of gastric 
Juices following 10-minute periods of noise 
Smith & Laird, 1930), and an increase in 
Pulse rate during periods of noise while taking 
an intelligence test (Corso, 1952). Increased 
Palmar sweating, EMG, and respiration 
amplitude were reported along with de- 
reused pulse, finger volume, and respiration 
be and initial increase followed by a de- 
Tease in pulse rate (Davis, Buchwald, & 
Tankman, 1955). Lovett Doust and Schnei- 
rea (1952) showed that oxygen saturation 
vels of the blood „varied consistently with 
Romie auditory stimuli in both hospital 
pee ts and normals. Some effects of de- 
pe oxygen saturation or anoxemla re- 
titans were poor concentration, tiredness, ir- 
lon, and dizziness. In another paper 
aad Doust, 1953) it was pointed out 
these findings of rhythmic effects upon 
Ygen saturation level might be applicable 
is ental illness where one of the character- 
1 € features is relative anoxemia. This prob- 
Was considered in another paper by 
tt Doust and Schneider (1954) where 
ae, data was presented on “Rhythmic 
ae. Bombardment Therapy.” They for- 
ig ed a hypothesis that relative anoxemia 
ne of the critical physiological spheres of 
“ntal illness, either in a resting state or in 


Love 
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stress situations. This approach stresses phys- 
iological effect of repetitive stimuli, in rela- 
tion to different sense modalities, upon treat- 
ment of mental illness. The degree of actual 
effectiveness of rhythmic stimulation as yet 
has not been determined. 

If the WS acts as an analgesic (Cook, 
1960), then it is plausible that the UCS 
should not produce as strong a reaction under 
the WS condition when compared to the con- 
dition where WS is absent (N-WS), since the 
GSR varies with strength of effectiveness of 
the UCS (Furer & Hardy, 1950; Hovland & 
Riesen, 1940). Also, conventional anesthetics 
tend to raise the resistance level of the skin 
(Lindsley, 1951). However, if the WS does 
not act as an analgesic but as any other 
noise, then one would expect a greater GSR 
from the WS group. This proposition is 
based on the evidences that when the noise 
is noxious it tends to lower basi¢ resistance 
level and it also functions to increase the 
response to shock by generating an increase in 
the activation level of the subject (Lacey, 
1959). Respiration, like the GSR, has been 
used as a measure of reaction to a stressful 
situation (Ax, 1953; Schachter, 1957). 

More specifically, the purpose of the 
present study was to determine the effect of 
WS upon the galvanic skin response to 
electric shock and upon frequency of respira- 
tion in normals and schizophrenics. 


METHOD 


Subjects. Subjects for the experiment were 40 
chronic undifferentiated schizophrenics who were pa- 
tients at the Veterans Administration Hospital, 
Tomah, Wisconsin. The schizophrenic group had a 
mean age of 36.52 and a mean length of hospitaliza- 
tion of 61.35 months. All patients receiving any 
type of psychotropic drugs were taken off their 
medication 30 days prior to the experimental 
procedure. Also, 40 employees at the same hospital 
were used as normal controls. Both groups ex- 
cluded the subjects under 25 or over 50 years of 
age. The mean age of normals was 35.78 years. 
These subjects were divided into four groups: 
schizophrenic-WS (Schiz-WS), schizophrenic-no WS 
(Schiz-NoWS), normal-WS (Nor-WS), and normal- 


no WS (Nor-NoWS). et 
Apparatus. The GSR and respiration were recorded 


with a multipurpose polygraph recorder. This ap- 
paratus recorded resistance to the nearest 1,000 
ohms. Two GSR zinc electrodes used were of the 
finger clamp type and were placed on the index 
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and ring fingers of the right hand. The fingers 
were cleaned with alcohol solution and EEG elec- 
trode jelly was applied prior to attachment of 
electrodes. The electrodes provided contact with 
both of the finger pads. The GSR current applied 
to electrodes was a constant 20 microamperes, dc, 
which was then amplified and recorded: on a pen 
polygraph. The current was regulated by an amplifier 
unit which had a self-contained regulated power 
supply for dc voltages. The subject resistance con- 
trol on a preamplifier was used to balance basal 
resistance and had a dial for reading the electrode 
basal resistance. The subject was instructed to 
get his right hand and arm in a comfortable 
position on the arm of the chair and then let it 
go limp and refrain from moving it. 

The UCS was an electric shock delivered by an 
Ensco Model PC-1 Portable Conditioner through 
zinc strap electrodes attached (electrode jelly) to 
the subject’s left wrist. The shock was regulated 
by dc voltage stimulus which could be varied 
from 0 to 150 volts and was current regulated. The 
time duration of pain stimulus was set manually 
by the use of variable cams. The CS was a 15-watt 
light bulb behind an 8 X 12-inch milk glass screen 
placed 7 feet in front of the subject at eye level. 
The CS was presented for 5 seconds and was fol- 
lowed immediately by the UCS for 2 seconds. The 
duration of the CS and the onset of the UCS were 
controlled automatically by use of a cascaded 
multiple-phase electronic timer. The duration of 
the UCS was controlled in the same manner. 

The WS and music that was used for attention 
was produced on the same record. The sound was 
reproduced with a stereophonic system consisting 
of a 40-watt amplifier and 2 sets of speakers. One 
of the speaker systems was placed 4 feet directly 
behind subject’s chair and the other speaker was 
placed 3 feet in front of the subject. By means of 
the balance control on the amplifier, the sound was 
channeled as much as possible to the speaker behind 
the subject. The intensity of the sound and music 
was at continuous 70 db. This was not a subjective 
level, but a standard intensity level at the point 
where subjects were seated. The experimental room 
was air conditioned and during the actual running 
of the subjects the room was darkened except for 
an experimenter’s lamp of low illumination (15 W). 

Procedure. The subject was seated in a com- 
fortable easy chair and the electrodes were attached. 
While the electrodes were connected, the subject 
was told in a conversational manner something of 
the nature of the experiment and what was expected 
of him. The following points were always covered: 

1. The purpose of the experiment was to de- 
termine the effect of small electric shocks on his 
wrist upon sweating. 

2. The only thing for the subject to do was to 
relax as much as possible and to refrain from 
moving about in the chair. 

3. The WS groups were also told that to help 
them relax, some specially designed noise and 
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music would be played. Also, to be sure they got 
the full benefit of the noise and music they were 
told that they must listen to it by trying to follow 
the music through the noise. 

The subjects were then 
shocks of ascending intensity beginning Fi 
they were instructed to tell the experimenter i 
which point the shock was “uncomfortable as 
not painful.” When this level was determined É 
lights were extinguished and for the WS grou 
the sound with music was turned on. There Eo 
no difference between the levels of shock appler 
the two groups; the mean shock value was als 
and 4.79 milliamperes for schizophrenics and norm: 
respectively. 

Five minutes were then allowed to elapse A 
the subject to become acclimated to the situalo” 
after which time the first base level of the sit 
was determined. As soon as the base level was $? 
the first adaptation trial began. 

There were 30 trials in all: à 
with the light alone, and 20 conditioning ai 
with light and shock paired. In order to mini 
anticipation, the intertrial interval was 40, 50 "ith 
seconds, each interval being chosen randomly “ 
the qualification that each interval was used 
9 or 10 times. 


given a series of 
at 0 and 


+ jals 
ation tria 
10 adapta trials 


RESULTS 


x f 
The means and standard deviations : 
respiration and GSR data are shown 
Table 1. Due to the nature of these er 
nonparametric Mann-Whitney U test nce: 
applied to test the significances of differe 
between the groups. d as 
GSR. The GSR scores were measure ne 
changes in conductance. The conditio vel 
response was the difference between the 
at the point where the light came an 
highest point for the duration of the 
The UCR was the difference between the oi 
at the onset of shock and the highest Py. 
during 5 seconds following the onset of F ot 
All independent between-group compat 


the 


ight: 
revel 


TABLE 1 


oN- 
RESPIRATION RATES AND MAGNITUDE OF : 
DUCTANCE CHANGES FOR SCHIZOPHREN 
AND NORMAL SUBJECTS 


Respiration UCR 
Subjects M SD M SD p 
À Tma GA 
Nor-WS 10.28 2.23 39.16 18.62 391 60 
Nor-NoWS 10.41 3.18 24.43 7.85 tyg Ge 
Schiz-WS 14.03 6.07 21.94 10.03 7,04 3 
Schiz-NoWS 10.69 4.09 21.56 12.68 è 
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TABLE 2 


GSR AND RESPIRATION Group COMPARISONS 


Respiration 


a Base Levels 


Ini- Termi- 


Conditions Adaptation CR 


Adaptation CR UCR tial nal 


7, 

Normals—white 
sound versus no 
white sound 


Overall ns ns 


Last 10 trials 


Last 5 trials 


Schizophrenics— 
White sound versus 
no white sound 

Overall E SaD 

p < 02 


U = 113.5 
p <02 


x, 
Normals versus 
Schizophrenics— 
White sound 
Overall U=47 
p <.02 


U =45 
p < .02 


Normals versus 
Schizophrenics— 
no white sound 


(0) 
verall ns ns 


ns ns ns ns ns 


ns U 


ns ns ns ns ns 


ns ns 


q 
1 
w 
= 
oS 
u 
=} 
Eg 


Al 
Sh 


Al 
> 
a 
i 


ns ns ns ns ns 


Wi 
at ae using the Mann-Whitney U test. 
e Wil in-group comparisons were made with 

d ilcoxon Signed Ranks test. 
toute The Nor-Ws group was con- 
y above the other three groups in 
be As can be seen from Table 2, this dif- 
wg ce Was significant only between the Nor- 
$ Ta Schiz-WS. The Nor-WS, however, 
kiena variance as compared with the 
nares group (F = 12.47, p< 01). 

otip itioned response. The CRs for the four 
i shown in Figure 1. Using the last 
i Rone the first 5 conditioning trials, the 
Cant ce for the normal groups was signifi- 
Pirin? < .01). Also, as can be seen from 
Breater 1, the normal groups gave much 
tients CRs than did the schizophrenic pa- 
Ence who did not condition. These differ- 
were significant at the .05 level of 


confidence. In addition, the CRs of the 
Nor-WS group were maintained at a higher 
level than those of Nor-NoWS throughout 
conditioning, even though both groups tended 
to diminish in CR. This difference was found 
to be significant for the last 5 trials (p < .05). 

Unconditioned responses. Magnitude of 
UCRs is shown in Figure 2 for the four 
e again, the Nor-WS group is 
above the other three groups. 
When this group is compared with the Nor- 
NoWS group, the difference is significant at 
the .05 level for the last 10 trials. The differ- 
ence over all trials between the Nor-WS group 
and the Schiz-WS group was also significant 


at the .02 level. 
Base levels. Two base levels were deter- 
mined for each subject. One just preceded the 


first adaptation trial and the second followed 


groups. Her 
consistently 
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BLOCKS OF TRIALS 


Fic. 1. Changes in conductance in CR for blocks of 2 trials each following 10 
adaptation trials. (Each point is based on the mean value for 20 subjects.) 


he a A iratio” 
the last conditioning trial by 30 seconds. The Respiration. The frequency of respirati® 


comparisons that were made can be seen in was determined by the number of peaks d- 
Table 2, and none were significant. First the respiration record for 20 seconds preca 
versus final base level GSR differences were ing the CS and 20 seconds following the r 
tested and the only significant difference of the CS for each trial. The results of a 


found was for the Nor-WS group with the measure are shown in Figure 3. Here, a 
final base level conductance being higher Schiz-WS group is consistently higher be: 
than the initial level (p < .05). the other three groups. The difference 


x 
o 
A 


s~s Norm. - WS 


mo Norm. - NoWS 
so @—e Schiz. - WS 
OmrmimsO Schiz. - NOWS 


UCR MAGNITUDE OF CONDUCTANCE CHANGE 
micro -mho - 


6 
BLOCKS OF TRIALS 


Fic. 2. Changes in conductance in UCR for blocks of 2 trials each following 10 
adaptation trials. (Each is based on the mean value for 20 subjects.) 
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Fic. 3. Respiration rate for blocks of 2 trials each following 10 adaptation 


trials. (Each point represents m 


ean frequency of respiration of 20 subjects 


for 20 seconds preceding and following CS.) 


pee the Schiz-WS and Schiz-NoWS groups 

henge (p < .02) for adaptation and 

itioning trials. Furthermore, when Schiz- 

en Nor-WS groups were compared, 

A cant differences were revealed for both 

( adaptation and the conditioning periods 
P< 02). 

Discussion 


rin Comparing the effects of WS in this 
chair with the effects observed in the dentist’s 
situati Several points of difference in the two 
intensity. must be recognized. First, only one 
ile A of WS was used in this experiment, 
ensity a dental patient selects his own in- 
eco evel and can change it at any time. 
contine the WS in this experiment was 
dlient us. In the applied situation, the 
n a manipulate the sound and let it 
desired Pind or in short bursts, if so 
instructi nother difference is that of the 
ertain| ons. The dental patients were most 
ie i tng: that the Ws would mask their 
ere et the subjects of this experiment 
them ‘pla told that the WS would help 
ental © relax. Also, the mere fact that the 
Rives Ree controls the sound not only 
ut Ae some control over the situation, 
Patien is little task may tend to occupy the 
t and serye as a distraction from the 


dentist’s operations. On the other hand, in 
the present experimental setting the subject 
had only to sit in a chair for approximately 
40 minutes without any activity. 

For the GSR data, the most significant 
result is that when there was a difference 
between WS and NoWS groups, the WS 
group always produced a greater response. 
This is the opposite of what would be pre- 
dicted if WS were to function as an analgesic. 
This finding seems to warrant the conclusion 
that, in this experiment, WS was a noxious 
stimulus much like any other continuous loud 
noise. This would indicate that any possible 
analgesic effects of WS might be due not to 
any somatic consequences of WS per se, 
but to the effects of another noxious stimulus 
which may serve to distract the patient from 
other noxious or painful stimuli. Equally im- 
portant was the failure of schizophrenics to 
show GSR conditioning, whereas there was 
rapid conditioning in the normal group which 
was followed by a swift adaptation or extinc- 
tion (Figure 1). This extinction may have 
been due to the fact that the UCS was 
energized immediately following termination 
of CS. Thus, this timing factor may have 
produced some fusion between UCR and CR. 
Although there are no adequate means to 
explain this specific difference between the 
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two populations, it is noteworthy that schizo- 
phrenics’ GSR responsivity was generally 
suppressed as compared to normals. 

Respiration data reveal some unexpected 
but interesting trends. Here again, the WS 
group gave a stronger response, but with this 
measure it was the schizophrenic group that 
showed significant increase and not the 
normal group, as was the case with the GSR 
data. An interesting hypothesis, following the 
ideas of Lovett Doust and Schneider (1952), 
that noise causes the oxygen saturation 
level of the blood to decrease and thus 
triggers the response of an increased respira- 
tion rate, would be that oxygen saturation 
was lowered in schizophrenics under WS 
condition. Respiration rate is, of course, only 
an indirect measure of oxygen level which 
was directly measurable by Lovett Doust and 
Schneider. 

The major finding of this study was the 
differential response of schizophrenics and 
normals to WS, as measured by GSR and 
respiration rate. Although there are some 
inconsistencies in other related data, there is 
some evidence of response patterns indicating 
distinguishable connection to experiences of 
either fear or anger. Ax (1953) found a 
significantly higher number of GSRs in anger 
than fear conditions. Although the present 
study utilized magnitude rather than number 
of GSRs, it was assumed that for the present 
short-term durations, magnitude would be a 
more appropriate measure in reflecting levels 
of activation. Hence fear-anger differentia- 
tion, as demonstrated by Ax, is inferred in 
explanation of the present findings. Secondly, 
respiration rate was found to increase sig- 
nificantly more in fear than in anger by 
both Ax (1953) and Schachter (1957). Un- 
fortunately, the present investigators did not 
elicit any subjective reports regarding the 
subjects’ experiences during the experiment. 
It was not anticipated in the original design 
that group differences would emerge with 
respect to respiration and GSR. Consequently, 
in this study, there is no clear-cut evidence 
of either fear or anger as a predominant 
experience. 

Generally, GSR conditioning rate is as- 
sociated with increase in severity of anxiety 
(Schiff, Dougan, & Welch, 1949). In ad- 
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dition, there is evidence that respiration rate 
is higher for schizophrenics than for normals 
under stress-like conditions (Jurko, Jost, : 
Hill, 1952). Kryter (1950) has pointed ou 
that different kinds of sounds may have m- 
fluence on feelings. He stressed that irregular 
sounds tend to be more annoying than steat p 
cues. Since WS waves are of varying af 
tensity, resembling waves in water, it is 2° 
surprising that a form of annoyance bee. 
result from it, Regardless of fear-ang f 
and/or oxygen saturation speculations, it W? 
evident that the physiological responses A 
white sound produced in this experiment we 
clearly distinguishable between schizophreme 
and normals. 

Since respiration is under greater voluntary 
control than GSR, which is a purer ae 
festation of autonomic response, it is plaus: j 
that normals exercised greater voluntary co ir 
trol to the WS stimulation although the 
response may have been characteristic rë- 
anger or annoyance while schizophrenics  ; 
sponses were channeled through fear. 
would be characteristic of generalized 
of schizophrenics’ emotional control. me 

In addition, the possibility that . in 
degree of relative anoxemia is involve 
mental illness may serve as an additio 
explanation for increased respiration sa 
schizophrenics. Lovett Doust and Schne! en 
(1954) postulated that decrease of O39 a 
saturation is likely to be one of the cite g 
characteristics of mental illness. In vie”, 1S 
the present findings, varying levels ° 


Jack 


nal 


. + i iow 
intensity and different types of instru 
suggesting anticipated effects of J audi 


be manipulated in clarifying presume f g 
analgesic property of WS. Relationship ted 
GSR and respiration rate should be oot 
under conditions of WS within situat er 
which could be clearly distinguished 45 “allow 
fear or anger producing in order to $0 ip- 
up present propositions. Although the ev- 
structions and the sensory input wee 
tical for both groups, either differentia sen 
ception or discriminable responsivity be 

schizophrenics and normals was demonst jon 
At this point the most plausible explani of 
for present results is conceived in ter” tio? 
fear-anger constructs and oxygen § aari ust 
levels. Within the foregoing analysis 1 
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be pointed out that the fear-anger distinc- 
tion is a tentative one, requiring caution 
against general acceptance without more 
thorough investigation of subtleties in auto- 
nomic responsivity associated with different 
states of arousal of schizophrenics and nor- 
mals. Oxygen saturation must also be ex- 
plored in relation to physiological correlates 
of white sound. 
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AFFECTIVE STIMULI AND DISTURBANCE OF 
THOUGHT PROCESSES * 


GAYLE DEERING 2 


Teachers College, Columbia University 


The study investigated the effects of affectively toned and neutral adjectives 
on the incidental memory functioning of matched groups of 40 schizophrenic 
and normal female Ss. The adjectives which had been matched for familiarity 
and rated by judges were presented in a word association test in which the 
number of different associations, association recall, and stimulus word recogni- 
tion scores were measured. Conclusions are: (a) schizophrenic Ss gave a larger 
number of different associations to affective words than normal Ss (.01 level); 
(b) there was no evidence that schizophrenic Ss, in comparison with normal 
Ss, recalled relatively more associations to neutral words than to affective 
words; (c) there was no evidence that schizophrenic Ss recognized stimulus 


words as well as normals. 


Schizophrenia has been characterized as a 
disturbance in thought processes which often 
results in noncommunicative language. Theo- 
retical formulations and clinical observations 
suggest that affective stimuli impair schizo- 
phrenic patients’ thinking (Arieti, 1955; 
Bleuler, 1950; Fromm-Reichmann, 1959). 
Rapaport, Gill, and Schafer (1946) have 
elaborated upon the theory that the thinking 
of schizophrenic patients is easily disturbed. 
According to them, “good adjustment is 
characterized by personal and/or traumatic 
material not being mobilized easily, either in 
life or in the testing situation; while in mal- 
adjustment, such material is often mobilized 
even in situations which are only remotely 
relevant [p. 18].” Their theoretical formula- 


1 The article is based upon a dissertation submit- 
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extended to members of the dissertation committee, 
Laurance F. Shaffer (Chairman), Joel R. Davitz, and 
William N. Thetford, and to Rosedith Sitgreaves. 
In the collection of data, the writer is indebted to 
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State Psychiatric Institute, New York, New York; 
St. Vincent’s Hospital, New York, New York. 

2 Research associate, Institute of International 
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s show 


E ee a 4 . ient 
tion implies that schizophrenic pati S es in 


a greater disturbance in thought proces*" 
dealing with affective material than W 
neutral material. Jated 

Considerable research has accumU i 
supporting the hypothesis that affective “2 i 
uli are particularly disturbing to schizop* 
patients. Much of the research investie on} 
complex conceptual thinking Ta The 
1957; Wexler, 1955; Whiteman, 1954): pe 
main criticism of these studies has bee” sg- 
lack of equating the affective stimulus 
terial with the neutral stimulus materik or 
difficulty. In previous studies, the P re 
performance of schizophrenic patients ney 
sponding to affective stimulus material Jevel 
have been due to its greater difficulty con 
rather than to its affective content. ee g 
tinuing importance of the hypothesis, t08° zst 
with the conflicting interpretations ° ith 2 
research, suggests further exploration { itive 
research design that avoids complex c08” 
tasks. rela” 

Incidental memory appears to be a ot 
tively simple cognitive function that a ge 
place schizophrenic patients at a disa H ew 
and was selected for exploration. SO et 
research studies have investigated © |. 
Tuptive influence of affective sti coe 
schizophrenic incidental memory ots a 
The present study explored the effet pion 
affective stimuli on schizophrenic as er 
recall, and recognition responses to 2” 


mental word association test. 
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DISTURBANCE OF THOUGHT PROCESSES 


Disturbance Indicators in Word Association 
and Related Research 


The word association test was selected as 
the research instrument because it elicits 
information about memory functioning while 
avoiding complex memory tasks. Evidence of 
association and recall disturbance was based 
on the following formulation of disturbance 
on word association tests. In associating to 
a stimulus word, an individual without emo- 
tional maladjustment generally responds to a 
word association test with conventional asso- 
ciations conceptually related to the stimulus 
word (Rapaport, Gill, & Schafer, 1946). In 
the emotionally disturbed individual, devia- 
tions from the conventional-conceptual mode 
of responding are to be expected and are 
indicators of association disturbance. 

Good recall indicates stable thought or- 
ganization which, according to Rapaport, 
Gill, and Schafer (1946), is characteristic of 
Normal individuals; whereas poor recall sug- 
gests instability of thought organization 
Which is characteristic of severely disturbed 
individuals and indicates a disturbance in 
recall, 

In a study which investigated association 
and recall disturbances in schizophrenic, 
neurotic, and normal individuals, Rapaport, 
Gill, and Schafer found evidence of disturb- 
ance in the schizophrenic, neurotic, and 
normal individuals in responding to affective 
stimulus words. Their findings are incon- 
clusive because the stimulus words had not 
een equated for familiarity and the patient 
groups had not been matched with the 
Control group. Jones (1957) who exercised 
8teater care in matching his groups and con- 
trolled for word familiarity found schizo- 
Phrenic patients gave more unique associa- 
tions to affective stimulus words than normals. 

Recognition is another aspect of memory. 

cause recognition of stimulus words does 
Rot require the subject to recall his own 
1 Ponses, it was assumed in the study that 
cognition of stimulus words, in contra- 
Istinction to recall of associations, measured 

lesser degree of personal involvement and 
active participation. Recognition was con- 
Sidered an easier task than recall of associa- 
Ìons, No reported word association studies 
ave investigated the recognition of stimulus 
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words by schizophrenic subjects. However, 
several tachistoscopic studies have investi- 
gated the recognition of stimulus words. 
McGinnies and Adornetto’s (1952) and 
Ressner’s (1950) findings suggest that schizo- 
phrenic and normal subjects do not recognize 
affective stimulus words as well as neutral 
words. However, the authors did not match 
the affective words with the neutral words 
for familiarity. In Newberry’s study (1954), 
in which stimulus words equated for fami- 
liarity were presented tachistoscopically, 
neither speed of recognition nor recall of 
affectively toned words differentiated between 
the schizophrenic and normal subjects. Fail- 
ure to find a difference in recognition of 
stimulus words between schizophrenic and 
normal subjects suggests that schizophrenic 
subjects perform as well as normal subjects 
in a recognition task in which personal, active 
involvement is at a minimum. 

Theoretical formulations and research find- 
ings suggest that schizophrenic subjects give 
more different associations and recall fewer 
associations to affective stimulus words than 
to neutral words than do normal subjects; 
schizophrenic subjects recognize affective and 
neutral stimulus words as well as nonschizo- 
phrenic subjects when the stimulus words are 
controlled for familiarity. 


Hypotheses 

1. In word association, schizophrenic pa- 
tients, in comparison with nonpatients, give 
a greater number of different associations to 


affective stimulus words. y : 
2. In recall of associations, schizophrenic 


patients, in comparison with nonpatients, 
recall relatively more associations to neutral 


words than to affective words. x 
3. In recognition of stimulus words, schizo- 


phrenic and nonpatient individuals show no 
difference in recognition of affective and 


neutral stimulus words. 
Although no specific hypotheses were stated 
regarding educational level, the effects of 


education were investigated. 


METHOD 
Subjects 


Forty first- 
pitalized less t 


admission schizophrenic patients hos- 
han 11 months with a medical diag- 
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nosis of schizophrenia formed the patient group. 
None of the schizophrenic subjects had received 
shock within the previous 6 months or had been 
diagnosed as having organic impairment. 

Forty nonpatient subjects who appeared able to 
function adequately in their vocational role and who 
were not receiving psychotherapy comprised the 
normal group. All subjects were United States born 
white females between the ages of 21 and 35 years; 
one half were high school graduates, the remain- 
ing one half had some college education. Non- 
patient subjects’ vocations were roughly matched 
with schizophrenic subjects’ vocations prior to hos- 
pitalization. The mean ages for the high school 
groups were 28.05 for schizophrenic and 27.25 for 
normal; for the college groups, 25.65 for schizo- 
phrenic and 25.20 for normal. The vocabulary raw 
scores on the Shipley Institute of Living Test for 
the high school groups were 31.70 for schizophrenic 
and 32.05 for normal; for college groups, 34.45 for 
schizophrenic and 35.50 for normal. The differences 
between the diagnostic categories for the high school 
and college groups separately on these variables 
appear to be nonsignificant. 


Procedure 


Each subject was seen individually by the same 
experimenter. The experimental word association test 
was administered according to standard procedure, 
Upon completion of the test, each subject was asked 
to recall her original associations as the experimenter 
repeated the stimulus words which had been ran- 
domized a second time. Following recall, the recogni- 
tion test of the stimulus words was administered. 
Each subject was asked to encircle those words 
which had been read to her by the experimenter, 
The vocabulary part of the Shipley Institute of 
Living Scale was administered last. 


Experimental Word List 


Adjectives equated for frequency from 1 to 25 
times per million words (Thorndike & Lorge, 1944) 
had been rated by 20 inpatient and 20 normal 
judges to determine the affective and neutral values 
of the words. The judges, who were matched on 
age, sex, and education with the patient and normal 
groups, rated 175 adjectives on a 7-point scale 
ranging from very pleasant to very unpleasant. The 
rating scale was administered individually twice over 
a 1-week interval. Eighty adjectives 3 judged initially 


3 GIFTED, GRACEFUL, LOVING, SKILLFUL, YOUTHFUL, 
ALERT, JOYFUL, RESTFUL, SINCERE, LOYAL, TACTFUL, 
THANKFUL, WITTY, FRUITFUL, HOPEFUL, TRUSTING, 
TRUTHFUL, GALLANT, HELPFUL, CORDIAL, 
ROTTEN, UNJUST, HARMFUL, LONESOME, 
SELFISH, STUPID, THOUGHTLESS, DREARY, 
HATEFUL, JEALOUS, WORTHLESS, BARREN, BLOODY, 
CALLOUS, FRIENDLESS, SCORNFUL, VULGAR, CONCAVE, 
EARTHEN, FRECKLED, LANKY, METRIC, RANDOM, NEU- 
TRAL, OPTIC, OVAL, POROUS, CUBIC, BUSHY, COUNTLESS, 
FISCAL, FOCAL, FLEETING, MASSIVE, NIGHTLY, POLAR, 
SPIRAL, SPOTTED, GRAPHIC, HUSKY, LENGTHY, PLURAL, 


BRUTAL, 
PAINFUL, 
FILTHY, 
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as pleasant, unpleasant, or neutral by at least 65% 
of all judges comprised the experimental word i 
An overall reliability coefficient of 89 based on ee 
subjects’ test-retest ratings suggests that the punia 
had a high degree of stability. The final list con 
tained five dummy adjectives at the beginning aM 
end of the list. 


Experimental Recognition Test 


For the recognition phase of the experimenty E 
affective and 20 neutral stimulus words were eee 
domly selected and embedded in a list of 470 rei 
adjectives. A measure of test reliability was piste 
lated by the split-half method. With the Spearman 
Brown correction, a reliability coefficient of .82 
obtained on the embedded adjectives. 


RESULTS AND CONCLUSIONS 


The results are presented in terms of ee 
syneratic association, association recall, a 
stimulus word recognition scores. All so 
were group scores measured across WOY to 
The variances of these scores were found 
be homogeneous. Certain trends in the ney 
which appeared significant, even though t 
had not been hypothesized, are reported. 


Idiosyncratic Associations 
n scores 


socia” 
tral 


The mean idiosyncratic associatio 
refer to the mean number of different as 
tions given to the 40 affective and 40 neu nic 
words by the subjects in the sco Pog: 
and normal groups. High idiosyncratic 4 n- 
ciation scores for a group indicate an 4 ives 
dance of different associations. Table 1 8" ie 
the means and standard deviations © tive 
idiosyncratic association scores for affec 
and neutral stimulus words. first 

Tn terms of quantitative scores, the ju 
hypothesis predicted that affective SO n 
words elicit a greater number of die 
associations from subjects in schizop ars. 
groups than from subjects in normal 8" iye 


An analysis of variance revealed an jeve} 
of 24.47, significant beyond the - sis 
between diagnostic categories. Hypoth®* at 


was supported. There was no evidenc? of 

educational level influenced the num hizo” 

different associations for either the $° 

phrenic or normal subjects. ans S 
In analyzing the data on association 


SPINAL, SPONGY, SLANTING, WOOLY, BULKY: 
COSMIC, ELDEST, FLAKY, FOAMY, GRAINED, 
OLDEN, TRIBAL, YEARLY. 
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TABLE 1 


Means AND STANDARD DEVIATIONS OF IDIOSYNCRATIC 
ASSOCIATION SCORES FOR AFFECTIVE AND 
NEUTRAL STIMULUS WORDS 


Affective words Neutral words 


Group and 

education M SD M SD 
Schizophrenic 

High school 13.42 3.10 10.98 3.54 
Normal 

High school 10.78 330 10.42 3.30 
Schizophrenic 

College 13.82 2.88 10.90 3.20 
Normal 

College 11.85 248 1032 292 


ie Note.—There are 40 words in each category and 20 subjects 
each group. 


neutral words by analysis of variance, there 
Were no differences found for idiosyncratic 
association scores for neutral words between 
Schizophrenic and normal groups nor between 
educational levels. 


Association Recall 


t The mean association recall scores refer to 
ae mean number of correctly recalled associa- 
tions given to the 40 affective and 40 neutral 
Words by the subjects in the schizophrenic 
and normal groups. Table 2 gives the means 

standard deviations of association recall 
Scores, 


TABLE 2 


M 
TEANS AND STANDARD DEVIATIONS OF ASSOCIATION 
RECALL SCORES FOR AFFECTIVE AND 
NEUTRAL STIMULUS WORDS 


iv Neutra Combined 
Group md Affective Neutral ‘o 
“cation M SD M SD M SD 
Schizophrenic 
igh school 10.90 2.96 15.58 2.75 
sae 12.15 296 16.45 2.21 
Combined 27.54 6.56 
i 
ermal 
Hi 
“igh school , 238 
3. 3.49 16.78 x 
“ollege 1308 3001747 2.21 
Combined 30.21 6.85 
in Note — There are 40 words in each category and 20 subjects 
ch group, 
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The second hypothesis predicted that the 
subjects in the schizophrenic groups recall 
relatively more associations to neutral words 
than to affective words in comparison with 
the subjects in normal groups. To test the 
second hypothesis, it was necessary to investi- 
gate the interaction between type of word 
and diagnostic category. The results of an 
analysis of variance found this interaction to 
be nonsignificant. Hypothesis 2 was not 
upheld. 

Differences at the .01 level of significance 
were found between types of words, 
diagnostic categories, and educational levels. 
These differences show that more associations 
to neutral words were recalled than associa- 


TABLE 3 


MEANS AND STANDARD DEVIATIONS OF RECOGNITION 
SCORES FOR AFFECTIVE AND NEUTRAL 
StmuLus Worps 


Affective Neutral Combined 
Group and 
education M SD M SD M SD 
Schizophrenic 
High school 13.60 2.58 15.90 2.84 
College 15.20 2.02 16.70 2.79 
Combined 30.70 4.95 
Normal 
High school 14.95 2.95 17.55 1.76 
College 16.10 3.40 17.95 2.01 
Combined 33.28 5.12 


Note.—There are 20 words in each category and 20 subjects 
in each group. 


tions to affective words by subjects in all 
groups; more associations were recalled by 
subjects in normal groups than by subjects 
in schizophrenic groups; and more associa- 
tions were recalled by subjects in college 
groups than by subjects in high school groups. 


Stimulus Word Recognition 


The mean recognition scores refer to the 
mean number of times the 20 affective and 20 
neutral stimulus words were recognized by 
subjects within the schizophrenic and normal 
groups. The means and standard deviations 
are given in Table 3. ‘ ; 

The third hypothesis predicted that schizo- 
phrenic individuals recognize affective and 
neutral stimulus words as well as normal 
individuals. To test Hypothesis 3, it was 
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necessary to investigate the effect of diag- 
nostic categories on recognition scores. An 
analysis of variance resulted in an F value 
of 22.25, significant beyond the .01 level, 
for diagnostic categories. Subjects in schizo- 
phrenic groups did not recognize affective and 
neutral stimulus words as well as subjects in 
normal groups. Hypothesis 3 was not upheld. 

Differences at the .01 level of significance 
were found between types of words and edu- 
cational levels. The findings show that neutral 
stimulus words were recognized more fre- 
quently than affective stimulus words by the 
subjects in all groups; and more stimulus 
words were recognized by the subjects in 
the college groups than by the subjects in the 
high school groups. 


DISCUSSION 


The present finding showing that schizo- 
phrenic subjects, but not normal subjects, 
increased their idiosyncratic associations to 
affective stimulus words is in agreement with 
Jones’ (1957) findings. This increase in 
schizophrenic idiosyncratic associations to 
affective stimulus words appears to define 
affectively toned material as a variable which 
influences schizophrenic associations. 

Similar to the results reported by Rapaport, 
Gill, and Schafer (1946), the present study 
demonstrated that schizophrenic and normal 
subjects had poorer recall of associations to 
affective stimulus words than to neutral 
stimulus words. These findings suggest that 
Rapaport, Gill, and Schafer’s theoretical 
formulation upon which the first two hypothe- 
ses were based should be re-evaluated. 
Rapaport, Gill, and Schafer based their 
formulation regarding the disruptive influence 
of sexual, anal, oral, familial, and aggressive 
stimulus words on the following assumption. 


It appears then that stimulus-words related to 
those emotional areas which according to clinical 
psychology are the likely areas of conflict, do call 
into action the internally existing conflicts, as evi- 
denced by the disruption of the association and 
reproduction processes [p. 83]. 


Can it be assumed that the affective stimu- 
lus words in the present study touched on 
areas of conflict because these stimulus words 
elicited more idiosyncratic associations from 
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the subjects in the schizophrenic groups than 
from the subjects in the normal groups 4” 
resulted in poorer recall for the subjects ™ 
all groups? This assumption appears to be 
based on circular reasoning which mitigates 
the usefulness of Rapaport, Gill, and Schafer $ 
(1946) formulation. An alternative explana 
tion is the possibility that affective stimulus 
words, even though they are equated fo 
familiarity with neutral words, are capable © 
eliciting a greater number of associations than 
neutral words. This interpretation is further 
strengthened by the present finding of poorer 
recall of associations to affective words 
normal subjects as well as schizophren'’ 
subjects. 
To support Rapaport, Gill, and Sch E 
(1946) assumption, it would appear po 
sary to demonstrate that normal individua F 
presumably free from emotional maladjus i 
ment and internally existing conflicts, rea 
their associations to affective stimulus WO" 
as well as those to neutral stimulus WO" Z 
whereas schizophrenic individuals, WPO a 
emplify severe maladjustment, show ue 
disruption in recall of associations to affect 5 
stimulus words than to neutral stimu 
words. That normal and schizophrenic “eit 
jects demonstrated poorer recall of a 
tions to affective stimulus words tha? 
neutral stimulus words seems to indicate 
the nature of the stimulus words rather 
the nature of the subjects is responsible 
the difference in recall scores. ti 
Whether or not recall of associations 
the affective stimulus words is related t° to 
disruptive effects of affective content oa 
a plethora of possible associations tO ding 
stimulus words raises a question rega! sti- 
experimental control. Laffal (1955) E i 
gated response faults on words equate rre 
familiarity. He found recall failures © ot 
lated .646 with the number of aise In 
associations given to the stimulus Wo" rated 
the present study, recall failures cor? jon 
-334 with the number of different asso®!? cor 
given to stimulus words. Although t 
relation is considerably lower than D 
it still demonstrates a positive relau 
between recall failures and idiosyncrat!® 
ciations. Laffal suggests that in oF 
evaluate the effect of emotional fa 


afer’s 


DISTURBANCE OF THOUGHT PROCESSES 


words, stimulus words with few associations 
should be selected. His suggestion seems 
relevant to the present study. Thus far, the 
only available control on words measures the 
frequency of occurrence in printed reading 
matter. It would appear desirable to construct 
an association frequency measure for words 
equated for familiarity along the lines set 
up by Laffal. 

The present results on recognition are in 
disagreement with the tachistoscopic study in 
which familiarity was controlled (Newberry, 
1954). The tachistoscopic study showed no 
difference in recognition between the affective 
and neutral stimulus words. The disagree- 
ment between these findings may be due in 
Part to the tasks which the subjects were 
asked to perform. In the present study, the 
Subject was not asked to attend carefully to 
the stimulus words, whereas in the tachisto- 
Scopic studies, the subject was requested to 
Concentrate on the words. This difference in 
degree of attention may explain in part the 
findings. The present findings do not support 
the assumption that recognition ability is 
a relatively easy task which does not dif- 
ferentiate between schizophrenic and normal 
Individuals. 
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THE D 48 TEST AS A MEASURE OF GENERAL ABILITY 
AMONG GRADE SCHOOL CHILDREN 
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The D 48 (or “Dominoes”) test of intellectual 
but is almost unknown in the United States. 

on g, its ease of administration, and its potential: 
in cross-cultural comparisons, the test 


candidate for study by 


take evaluations of the test. 


Although widely used in Europe as a 
measure of intellectual ability, the D 48 or 
“Dominoes” test is more or less unknown in 
the United States. For example, two recent 
textbooks on testing (Anastasi, 1961; Cron- 
bach, 1960) make no mention of the D 48, 
and the instrument is also uncited in 
Buros’ Tests in Print (1961), Fourth Mental 
Measurements Vearbook (1953), and Fifth 
Mental Measurements Yearbook (1959). In 
Europe, however, the D 48 is a standard item 
in texts on measurement (cf. Callonghi, 1956; 
Pichot, 1949), and carries a lengthy research 
bibliography (e.g., Ferracuti & Rizzo, 1959; 
Pasquasy & Doutrepont, 1956; Pichot, 
Rennes, & Taver, 1953). 

Beyond the need of psychologists in the 
United States to be informed about current 
practices in Europe, there are additional 
reasons for paying attention to the D 48 test. 
One of these is the potentiality of the D 48 
for cross-cultural measurement of ability. The 
items in the test consist of domino sequences, 
and hence draw on a background of experi- 
ence found in nearly all literate societies, 
The game of dominoes is apparently as 
ancient in China as playing cards, for which 
specific references dating back at least 1,000 
years may be found. The first record of the 
game in Europe was in Italy and France in 
the middle of the eighteenth century, and 
from there the game was transmitted to 
England. Anthropologists have also recorded 


American psychologists. 

California school children it was found that D 48 
scores were highly similar to data previously availabl 
showed satisfactory predictive validity 
scholastic achievement in the 5th and 
validity of the D 48, and warrant the ho: 


ability is widely used in Europe, 
Because of its high loading 
ity as an instrument for use 
would appear to be an important 
In an initial study of 86 
item difficulties and mean 
e in Europe. The test also 
(coefficients of +.58 and +.45) for 
6th grades. The results uphold the 
pe that other researchers will under- 


r nuit 
evidence of the game among the In 


Eskimos of the Hudson Strait region. may 
In the game of dominoes, therefore, * 
be found kinds of stimuli and modes © 
ceptualizing which are universal or well 
universal. The ordinary approach 1n a 
cultural (or “culture free”) testing ? un 
search for stimuli which are new and/0" Is 
known to all, so as to minimize differe? å 
of past experience. An equally ee ic 
proach would be to utilize test stimuli ie 
are known to everyone, and which eee 
a considerable fund of childhood exper! É 
so that familiarity with the stimuli an pm 
conceptual modes employed in handling sed 
may be equated. The D 48 test, being "he 
on the game of dominoes, would seem 
a worthy candidate for cross-cultural t 
under this second principle. D 48 
A second reason for attending to the pal. 
test is that it is almost entirely non Vuc 
The problems are nonverbal, and the ins" g, 
tions are so simple as to make almost 1° ny 
mand upon verbal ability. There are, tru- 
testing situations in which nonverbal ee 
ments are to be desired, and such instrum ues 
are by no means abundant in our catalo; 
of tests. it iS 
A third virtue of the D 48 test is that pP 
highly saturated with g (Vernon, 19>% 


enigh 
ross" 


esting 


on £ 
23). In fact, the loading of the D 48 the 


(.87) was even higher than that Be 
Progressive Matrices (.79) in Vernon S 
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ae Saga te the D 48 test was entirely 
a the nuisance loading on the spatial 
K which is a problem with the Matrices. 
a Sourti advantage of the D 48 test is 
d ca K easy to administer and to score, 
(30 an be given in a brief period of time 
minutes) either individually or to groups. 


DEVELOPMENT OF THE TEST 


1 oma testing method was first developed in 
rate Aad Anstey and Illing,? for the Directo- 
Var oe of Personnel of the British 
aae N The first test, called SP 11, 
as 48 items. A revised 48-item version 
n co by Anstey, following World 
Th nee w use by the National Institute of 
es i Psychology (NIIP) in London. 
25 ha ae prepared two other versions, of 
vil s 6 items each, for use by the British 
graduat ervice Commission with university 
Use in o All four editions are in current 
version england. A new experimental 42-item 
a in color has just been completed by 
ey, 

die rench adaptation of the NIIP edition 
1940s aden test was prepared in the late 
lose 7 Binois (cf. Pasquasy & Doutrepont, 
sychon 22), and published by the Centre de 
name ae Appliquée in Paris under the 
ih Ital 48.” This version is also published 
Stee by the Organizzazioni Speciali of 
ates : and was brought out in the United 
Rists pa 1962 by the Consulting Psycholo- 
C ie (CPP) of Palo Alto. It is the 
study, sion which was used in the present 
IAN 48 test contains 44 problems, each 
tinci rg of a series of dominoes defining a 
i an e of progression. The last domino in 
equi T is left blank, and the subject is 
half P to enter the proper numbers on each 
i a the domino. The principles involved 
Proble, progressions vary from problem to 
i em, and include simple addition, identi- 
sio 2 identities with reversals, double progres- 
ibn subtraction, subtraction with progres- 
eral etc. The sequence of items 1s In gen- 
from less to more difficult, with the 


BY 
b Edgar Anstey, personal communication, Novem- 
er 17, 1961. 
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exception of five or six quite difficult items 
in the middle of the series. 

Instructions and sample problems are given 
on the cover of the test booklet, and can be 
presented, in 4 or 5 minutes. Testing time is 
set at 25 minutes, and most of the studies 
with the D 48 have observed this limit. 
However, the D 48 can be used as a power 
test and given without restriction on time 
(cf. Ferracuti & Rizzo, 1959). 


PROCEDURE 


as given to the 86 fifth and sixth 
‘alifornia elementary school.? 
re envisaged for the study: 
(a) to establish item difficulties and to relate these 
to similar data from studies in Europe, (b) to 
relate total scores to those from comparable Euro- 
pean samples, and (c) to evaluate the validity of 
the D 48 as a predictor of scholastic achievement. 
The percentage of students answering each item 
correctly is given in Table 1, along with the rank 
order for each item. The progression of difficulty 
is in general agreement with the item position, but 
there are exceptions. For example, Item 9 is 
twentieth in the ranking based on difficulty, and 
Item 33 occupies the rank of 19 on difficulty. 


The D 48 test w: 
grade students in a C: 
Three specific goals we 


RESULTS 


How do these percentages and these ranks 
compare with those observed in European 
studies? Ferracuti and Rizzo (1959) pub- 
lished item percentages for a sample of 200 
university students in Italy, tested under 
power conditions. The correlation between 
these percentages and those offered in Table 
1 was +.83. Considering the difference in 
samples and testing conditions, this would 
appear to be a rather high degree of con- 
sistency in the ranking of items by difficulty. 

Rankings for the items based on a sample 
of 78 students, ages 11:6 to 13:6, were given 
by Vautrin (1954). The rank-order correla- 
tion between Vautrin’s list and that in 
Table 1 was +.89- 

Finally, Cloet (cited by Pasquasy & 
Doutrepont, 1956) classified the 44 items 
into three levels of difficulty on the basis of 
testing in a sample of 285 Flemish children. 
Arbitrary scores of 3, 2, and 1 were assigned 

2 We are indebted to Robert Barney for his kind- 


ness in authorizing the testing, and for his help in 
assembling the scholastic records and other test 


scores. 
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TABLE 1 


PERCENTAGE OF STUDENTS (N 


= 86) ANSWERING E, 


RANK ORDER PLACEMENTS 


ND 
ACH ĪTEM CORRECTLY ON THE D 48 TEST, AY 
FoR Eacu ITEM 


% % % 
answer- answer- answer- ak 
Item ingcor- Rank Item ing cor- Rank Item ing cor- Per 
number rectly order number rectly order number rectly O" 
1. 97 1 16. 74 11 31. 19 K 
2; 97 1 17. 16 30 32. 14 ô 
3: 94 5 18. 14 31 33. 51 a 
4. 95 3 19, 65 16 34. 8 37 
5; 86 7 20. 63 17 35. 5 26 
6. 95 3 21. 49 21 36. 33 27 
Te 70 13 22, 7 36 37. 27 0 
8. 70 13 23. 35 25 38. 2 39 
9. 50 20 24, 10 33 39, 3 40 
10. 85 8 25. 9 34 40. 2 42 
11. 76 10 26. 22 28 41. 0 37 
12. 84 9 27. 42 23 42. 5 32 
13. 91 ó 28. 48 22 43 0 2 
14, 72 12 29. 42 23 aa 0 $ 
15. 70 13 30. 52 18 ` 


to Cloet’s three categories, and then these 
scores were correlated with the percentages in 
Table 1, giving a coefficient of +.91. 

It may therefore be concluded that the 
relative difficulty level of the 44 items is 
rather constant for different age groups, and 
for testing in different countries and in dif- 
ferent languages. 

The comparison of average scores for a 
number of samples is found in Table 2. Data 
from Cusin (1959) show highest scores for 
engineering college graduates in Italy, pro- 
ceeding through science graduates, economics 
and commerce, and lowest values for grad- 
uates in jurisprudence, letters, and philos- 
ophy. Cusin interpreted this progression as 
evidence for the validity of the D 48. 

Maury’s (1954) study in France shows a 
steady upward progression of D 48 scores for 
both sexes as samples are drawn from succes- 
sively higher levels of the educational system. 
In samples of Belgian and Swiss school 
children, Pasquasy and Doutrepont (1956) 
demonstrated a similar hierarchy of D 48 
means for successively older groups. Our 
data, although deriving from only two classes, 

: in showing a higher average score for 
ny raders than for fifth grade students. 
ota tata in Table 2 may also be reviewed 


a ent 
+ differ 
for comparison of absolute scores in pjecť 


u 
countries. Our fifth and sixth grade 5 


com 
showed modal ages of 10 and Pt 1 
parable European samples are the he 


13-year-old students of Vautrin, Pas u 
10:6- to 11:5-year-old sample of d mea 
and Doutrepont. Our subjects ha cts it 
of 18.68 and 20.02; Vautrin’s subje oy 9 
a mean of 20.76, and those of gaa ne 
Doutrepont a mean of 20,00. The om tha! 
ence here is close enough to su pan re ant 
average scores for subjects of similar ill b° 
educational level in different countries 
about the same. 

A word might also be added about 
differences indicated in Table 2- 
thorough consideration to date of 
is that offered by Pasquasy and D 3 
who concluded (p. 31) that sex 
were not sitions ta could therefo" test, 
disregarded in providing norms on pere w 
In the four instances in Table 2 WP® ighe 


e$ 
e 5! 
th ost 
cpl 
this a 
utreP ce 


n en! 
iffer o be 


ig” e 
differences are shown, males score ar 
three times and females once. Our A d 5 
sample (W = 34) included 15 boy: 


: ed, 
girls, and the 52 sixth graders inclu He 
boys and 24 girls. In both grades Ai 
had a slightly higher mean a ee 
boys. If these findings are added 


than 
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oe 2, the differences would favor males 
he times and females three times; the 
a conclusion would appear to be that 
a wn by Pasquasy and Doutrepont. How- 
er, this question should not be considered 
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closed, as more data and studies of larger 
samples are needed before a definitive answer 
can be proposed. 

The predictive validity for the D 48 test 
in the fifth grade sample is shown in Table 3. 


TABLE 2 
ILLUSTRATIVE Norms oN THE D 48 TEST 
Samples N M SD 
1. From Cusin (1959), Italy 
a. Graduates in engineering 160 32.46 4.67 
b. Graduates in science 28 30.71 5.10 
c. Graduates in economics and commerce 47 30.04 4.79 
d. Graduates in jurisprudence, letters, and 28 27.96 5.98 
philosophy 
2. From Ferracuti and Rizzo (1959), Italy 
a. University students, male 100 26.70 5.60 
b. University students, female 100 24.80 6.20 
3. From Maury (1954), France 
a. Primary school graduates 
1. Males 307 14.25 6.06 
2. Females 246 13.15 6.82 
b. Students, quatrième classe (ninth grade) 
1. Males 144 22.87 5.40 
2. Females 191 23.73 6.94 
c. Students, deuxième classe (eleventh grade) 
1. Males 60 27.70 4.68 
d. Students, Baccalauréat (twelfth grade) 
1. Males 73 30.70 5.04 
2. Females 118 27.82 5.12 
4. From Pasquasy and Doutrepont (1956), France 
a. Young men, ages 20-25 522 19.78 8.61 
b. Engineers and military officers, males 118 30.34 5.11 
5. From Pasquasy and Doutrepont (1956), Belgium 
and Switzerland 
a. Students, male and female, ages 10:6 to 11:5. 34 20.00 5.75 
b. Students, both sexes, ages 11:6 to 12:5 127 19.07 6.34 
c. Students, both sexes, ages 12:6 to 13:5 199 19.33 6.45 
d. Students, both sexes, ages 13:6 to 14:5 253 19.93 6.62 
e. Students, both sexes, ages 14:6 to 15:5 181 20.21 7.14 
f. Students, both sexes, ages 15:6 to 16:5 124 22.84 7.05 
g. Students, both sexes, age 16:6 and over 87 25.57 6.72 
6. From Vautrin (1954), France 
a. Students, both sexes, ages 11 to 13 78 20.76 5.36 
7. Present study, United States 
a. Fifth grade students, both sexes 34 18.68 5.38 
both sexes 52 20.02 5.86 


b. Sixth grade students, 
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TABLE 3 


CORRELATION MATRIX AMONG THE VARIABLES INDICATED, 
For A Firtu Grape Crass (N = 34) 


Variables D 48 Arithmetic Language Reading Spelling 
1. Grade point average 58 42 52 1 26 
2. D 48 test = 43 43 St 25 
3. Stanford Achievement Ea 51 z4 .05 
Test: arithmetic 
4. Stanford Achievement a 42 02 
Test: language 
5. Stanford Achievement = 18 
Test: reading 
6. Stanford Achievement = 
Test: spelling 
34 
m r F se 
The D 48 test was given in the fall, as were (fourth grade) were available for eet) | 
the four Stanford Achievement Test subtests. Fi 


Grades covering 12 aspects of school per- 
formance, viz., arithmetic, art, citizenship, 
music, oral language, physical education, 
reading, science, social studies, spelling and 
writing, and written language, were obtained 
the following June, and an average computed 
for each child. 

The D 48 score correlated +.58 with this 
average, a higher value than those observed 
for any of the four parts of the Stanford 
Achievement Test. If the ‘“nonacademic” 
subjects of art, citizenship, music, and physi- 
cal education are removed from the grade 
point average the correlations are essentially 
unchanged. It should also be noted that D 48 
correlates more highly with grades than with 
any of the four parts of the Stanford Achieve- 
ment Test. Grades from the previous grade 


students. The correlation between the lig 
and fifth grade GPAs was +.60, only i 
higher than the +.58 between D 48 4? 
grade GPA. are 
Similar data for the sixth grade on 
offered in Table 4. Once again, testi in the 
done in the fall, and then the grades t gers 
same 12 categories listed for the fifth g and 
were collected in the following ia 48 
averaged. The predictive validity of : value? 
test was +.45, again higher than the 
observed for the achievement test- 
If the GPA for sixth grade 1s 
only the eight “academic” subjects, 
relation with the D 48 test remains # G 


was 


the previous year (fifth grade) “tedeh 


available, and this prior GPA correl 


TABLE 4 


CORRELATION MATRIX AMONG THE VARIABLES INDICATED, 
FOR A SrxTH Grape Crass (N = 52) 


va D48 Comprehension Reasoning Total 
1. Grade point average 45 25 37 35 
2. D 48 test E 27 s1 AS 
3. Stanford Achievement Test: 
a. Arithmetic comprehension TA ot 86 
b. Arithmetic reasoning ot D5 


c. Total score 


hi 
fifth | 


| 
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eg the sixth grade GPA; for this sample 
e D.48 test was a better predictor of sixth 
po GPA than were fifth grade GPAs. The 
pe iusion appears justified that the D 48 is 
A adequate predictor of scholastic perform- 
nce at the grade school level. 


SUMMARY 


$ The D 48 test of general ability is widely 
nown and used in Europe, but is little 
‘am the United States and is unmen- 
ru, in standard references on psychological 
and ng. This state of affairs is unfortunate, 
pe be rectified, as the test has a 
ge attractive features, including 
easy t it is highly saturated with g, it is 
h ie and to score, it is almost entirely 
Potenti al, and it possesses an intrinsic 
: iality for cross-cultural usage. 3 
un ion therefore deemed appropriate to 
ty ake a study of the D 48 in this coun- 
diem the intent of evaluating (a) item 
a ties, (b) comparability of American 
vali a pean norms, and (c) the predictive 
Fo y of the instrument. t 
ten r this investigation, 86 grade school chil- 
has tested. Item difficulties were calcu- 
With and were found to correlate very highly 
ae data from European samples. 
S and standard deviations for the Ameri- 
Rite ples were likewise very close to the 
Euro observed for similar samples tested in 
Bra T Finally, the predictive validity for 
58 averages in fifth and sixth grades were 
being on +-.45, respectively, both coefficients 
tests ‘a than any observed for the sub- 
these the Standard Achievement Test on 
t ame samples. 
appli Was concluded that the D 48 test is 
cable to school children, that its results 
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are meaningful and interpretable according 
to norms generally available, and that the 
test possesses adequate predictive validity. 
It is hoped that these results and the at- 
tendant discussion will prompt other investi- 
gators to initiate additional studies of the 
D 48. The test would appear to merit a 
thorough and widespread consideration by 
psychologists in the United States. 
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SCALING THE SCALES: 


USE OF EXPERT JUDGMENT IN IMPROVING THE VALIDITY 


OF QUESTIONNAIRE SCALES + 


K. WARNER SCHAIE 
University of Nebraska 
It is hypothesized that the concurrent validity of 


creased by the use of item weights obtained by 
using the conventional unit weights. Items from 11 


questionnaires can be in- 
expert scaling instead of 
scales of the High School 


Personality Quiz (HSPQ) were rewritten in a form permitting application 
of the constant sum method to the judgment of item weights. The scales were 
rated by 30 psychologists and ratio scales constructed for each factor scale. 
The HSPQ records of 43 delinquent girls were correlated with behavior 
ratings on 42 traits from Cattell’s “normal trait sphere” as judged by the Ss’ 
cottage parents. Validity coefficients are given for the HSPQ unit weight 


scores and for scores weighted by expert- 
low magnitude increments in validity. 


derived ratio weights. Results show 
Effects of scaling on the problem of 


agreement among questionnaire and rating data were also evaluated with 


similar results. 


The traditional procedure in constructing 
questionnaire scales consists of defining an 
item pool and using empirical methods to 
derive subsets of items which are relevant to 
some observed or postulated dimension. Unit 
weights are usually assigned to all items 
because there is no information on relative 
item weights. Even when factor weights of- 
fering greater precision are available they are 
generally ignored for practical reasons or 
because it is argued that increments in pre- 
cision due to differential item weights will 
be trivial. 

The items selected for a given question- 
naire scale on empirical grounds may in fact 
be the best items available but there is no 
assurance that the resulting unit weight scale 
has certain properties which are likely to 
maximize validity. An important property 
for this purpose would be the requirement 
that the questionnaire scale be a ratio scale. 
Factor analytic or other empirical procedures 
of scale construction can obviously provide 
no assurance of this property. 

imi version of this paper was pre- 

Poe ae ae meeting of the Society for 
Multivariate Psychology, November 1962, at Chi- 
Illinois. The author is grateful for the co- 

Ca ee of his colleagues and graduate students 
oF parformaed the expert scaling, and to the staff 
w the State Training School for Girls, Geneva, 


i i i testing and 
for their cooperation in the g 
eg E of the questionnaire respondents. 
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It may furthermore be argued mari 


0 
ultimate criterion of the utility of 4 ques ait 
naire will be found in its ability t° ple bY 
traits or behaviors which are confirm@’ pis 
direct observation, The psychologist P may 
role as an expert on human behav ation 
have relevant information on the waits: It 
between self-report and observable a be 
is therefore proposed that an oe and 
made to utilize the intuitive process istic” 
capitalize upon the training and SOP’ judg 
tion of the psychologist by utilizing oa dat? 
ment to improve the weights assigne ee? 
derived from actuarial methods. AS ao not 
pointed out by Meehl (1954), this ae er io! 
mean that the psychologist is & 5 pro” 
computer. It means that he has bee? 
gramed with rules which are not yet Ae 
and which are therefore not availabe ould 
direct implementation. Such rules, gr? 
include that of rejecting data, baa 
patently absurd although possible, anti? 
assigning different weights or ae jon! 
weights for data which for comp" were 
efficiency have been handled as if they alt 
of equal weight even though no such © 
exists. ing pat 

In spite of the quite popular meer 
there is a wide gulf between psych? may 
and psychophysical procedures, ont ii 
utilize the latter to advantage to mæ 


ka 


dy, large number of indirect 
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desirable scale properties. The constant sum 
method (Metfessel, 1947) provides a prom- 
Ising approach for obtaining ratio scales on 
More or less subjective psychological vari- 
ables. The method has been applied mostly 
Where physical standards were also available, 
but it has been found useful and reliable in 
experiments scaling such matters as prefer- 
€nce for neckties (Dudek & Baker, 1956) 
and the association between colors and mood 
tones (Schaie, 1961). 

It is now proposed to apply this constant 
Sum method to the scaling of empirically 
ioe a personality scale items and to ex- 
Sim the effect of such scaling on the con- 
fn rent validity of the questionnaire scales 

Predicting trait ratings and factor scores 
ased on such trait ratings. 


PROCEDURE 


tent High School Personality Quiz (HSPQ) was 
Bromma by Cattell, Beloff, and Coan (1958) to 
asia e a comprehensive personality description of 
escents by means of factored scales. The 
Sistin, contains 14 questionnaire scales each con- 
to rid of 10 items with unit weights. According 
hi e authors, 11 of the 14 scales assess factors 
aaa have also been identified in the behavior 
ing domain. This study is therefore limited to 

€ 11 scales for which validity data are available. 
e items on the 11 HSPQ scales on Form A 

the questionnaire (Factors, A, B, C, D, E, F, 
Wheth I, J, and O) were examined to determine 
Whi her they were in unidirectional form. The items 
ich did not meet this criterion were then re- 
the item “At a picnic, would 
loring the woods 
fire with the 


an For example, 
alone> net spend some time: a. exp 
b. playing around the campfir th 
Wi was rewritten in the form At a aa 
the CUld rather spend some time playing around 
Woods rate with the crowd than exploring 
alone.” 
€ rephrased items were arranged in a booklet 
Pearj the 10 items of each scale on one Past is 
the @8 in the order in which the items anpes a 
to ,Westionnaire. These booklets were then ee 
hoy y Psychologists (their qualifications beinn UE 
who as eligibility for election as APA as a 
Cnst, Were asked to scale items by mean 
ant sum hod: ‘ i 
indies, E method requires the judge to 


i imuli by 
ivi de of two stimu 

te the relative zo enn, Although the pro- 
he constant sum method 
of paired comparison 
mol Jar model. In fact 


With 


die 
ssi $ 
Similarity of the stimul 
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pared. For this reason, as well as to keep the 
judgmental task within reasonable bounds, a modi- 
fied technique using the method of the constant 
stimulus as a model was applied. This modification 
was originally applied to the scaling of line lengths 
(Baker & Dudek, 1957), where physical dimensions 
are available, but has also been found useful and 
reliable in the scaling of the association of colors 
and mood tones (Schaie, 1961). 

The task of the expert judges was to compare 
Item 1 on each scale with each of the other nine 
items and to divide 100 points between each item pair 
assigning the larger number to the item rated more 
appropriate to the positive end of the factor dimen- 
sion. As an example the instructions for Factor A 
(Cyclothymia versus Schizothymia) were as follows: 


Compare statement 1 with each of the following 
statements. Divide 100 points between each pair 
of statements to judge their relative position 
on the dimension “warm, sociable vs. stiff, aloof.” 
Record only the number of points assigned to 
statement 1. The larger number of points should 
be assigned to the statement judged to be more 


“warm, sociable.” 


Scale values were obtained by totalling the points 
assigned to Item 1 in comparison with all the 
other items and the ratio of the sums of all possible 
points to the total for Item 1 was calculated for 
each of the compared items. To permit comparison 
with the unit weights of the original questionnaire 
scale values were next transformed to a 10-point 
scale. On practically all scales some item was rated 
as half as relevant or important as some other 
item with respect to the dimension on which it 
had been placed. For example, the item weights 
obtained for the scale for Factor A were 1.4, 1.0, 
0.9, 0.9, 0.8, 0.7, 1.6, 1.1, 0.7, 0.9. 

The new item weights were next applied to the 
rescoring of a set of HSPQ protocols obtained from 
a group of 43 girls who were residents of a train- 
ing school for delinquent adolescents. These girls 
were also rated by their cottage parents on the 42 
traits described by Cattell (1957) as providing an 
adequate sample of what he calls the “normal trait 
sphere.” Furthermore, factor scores were computed 
from these trait ratings using Cattell’s weights which 
yielded measures for 15 rating factors. Intercorre- 
lations of the HSPQ measures, validity coefficients 
describing the relationship between the HSPQ and 
the trait ratings and rating factor scores, as well as 
multiple correlations between the questionnaire and 
criterion data, were then obtained to compare the 
effect of the new scores using the weights provided 
by the experts as contrasted to the original unit 
weights. The following section will discuss the results 
of these analyses in some detail.? 


2 Tables giving the ratio item weights for each 
of the 11 scales, means, standard deviations, scale 
intercorrelations, and multiple correlations with the 
criterion variables have been deposited with the 
American Documentation Institute. Order Document 
No. 7557 from ADI Auxiliary Publications Project, 
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TABLE 1 | 


MEAN DIFFERENCES, CORRELATIONS, AND PROPORTION OF INDEPENDENT VARIANCE 
BETWEEN Unit WEIGHTED AND RATIO WEIGHTED ScoRES 


Factors | 
a re e DESEG a E TIN 
—.24 
Mean difference —02 08 26 —18 18 05 —.07 —.13 —.14 —.18 08 
Correlation 97 95 94 96 96 97 98 9 98 o% 4% 
Proportion of independent variance 6% 10% 12% 8% 8% 6% 4% 2% 4% 10% 


RESULTS 


The first question to be examined is the 
effect of the scaling upon the descriptive 
statistics of the HSPQ performance of our 
sample of questionnaire respondents. In 
evaluating differences between means the 
proper alternative to the null hypothesis is 
that the means for the scaled scores should 
be lower than the means for unit weight 
scores since unit weight scores should tend 
to overestimate scores of individuals rating 
low on a given scale. A difference significant 
at the .05 level of confidence appears only 
for Factor O. However, keeping in mind the 
relatively small sample and the accompany- 
ing danger of Type II error, it should be 
reported that differences in the expected 
direction which were significant at the .10 
level of confidence were found also for Factors 
D, H, I, and J. An inspection of score vari- 
ability further indicates that variability 
increases as a result of scaling for all scales 
except Factors H and I. 

Intercorrelations were also computed for 
the two score matrices and the Kolmogorov- 
Smirnow test of the difference between two 
correlation matrices (Schaie, 1958) was ap- 
plied. It was found that the two matrices did 
not show more than chance deviation from 
one another and that the scaling therefore 
apparently had no effect upon the general 
structure of scale intercorrelations. This is 
desirable since the factor-analytic solution 
presumably has obtained the best set of 
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items with respect to factorial structure and 
optimal estimation of factor scores. a 
Table 1 lists the mean differences betta 
unit weight and ratio scaled scores, as ¥ 4 
as the correlations and the propor e 
independent variance available for e repré 
The latter, ranging from 2% to 12%, ccon 
sent the limit of the increment 1M 
which may be expected as a result 
scaling procedure. The magnitude ° 
increment is small, but it may be arti 
restricted by requiring adherence 
original 10-point scale. the 
Our basie. premise for this study Me jen 
hypothesis that the concurrent vali ect tbe 
the statements which can be made 2? terio” 
ability of the HSPQ to predict 5 pe! 
behavior, would be increase by 1 teP | 
weighting of items. The next logica, | 
therefore is an examination of change 
validity coefficients. These are repor 
Table 2. It may first of all m 
number of validity coefficients which x 
interpreted as differing significantly fr 


cant by the above criterion were 
the unit-weight scores, while 77 ug sc 
ficients appeared after applying ping | 
weights. Of the 81 coefficients Te@°™ coih 
.05 level of significance by e g h 
method, 45 showed increase 12 m: itd, 
after scaling, 18 retained the same mar pare 
while 18 showed some decrease: >` -at i 
in magnitude, however, exceed 1 po 4 
first decimal place. j terp” d 
Questionnaire results are rarely 1? iy pe 5 
scale by scale. More frequently US® ygt" 
of the scale profile or regression e 
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TABLE 2 


BISERIAL CORRELATIONS BETWEEN HSPQ Form A anp Trait RATINGS 


HSPQ Factors 


Traits A B e D E F & ff I J o 
1. Considerate: 11 
Considerate- = — 36" 16 — 24 12 = —05 15-07 -20 06 —08 
inconsiderate —06 41* 14 —17 12 —03 20 —06 —18 16 —12 
2. 2 
r -20 -12 -233 —16 o5 —19 -17 —02 09 33% —04 
f e —19 —17 -22 -l6 o 25 e = —05 10 4i —12 
; etic. 08 it 10 —06 00 34* —06 235 -25 -21 -23 
j 06 13 07 —12 -04 33 00 24 —2 25 —20 
; a -03 13 -a 19 03 43 14 —39% 13 28 08 
ie —02 12-36 16 -0 = 13 —38* 17 34* 05 
- Patient- 16 22 00 16 
Tae = 10 00 19 05 28 06 —25 
i a 13 21 04 —16 w —05 22 #08 -2 10 —27 
ae ol wa o -12 16 41% —24 23 —19 —25 —07 
oe 2 = = «= 13 27 40* —28 21 =22 30 WO0 
i aces -03 —28 32* —24 00 55* 03 17 =14 40" —25 
; c —04 —20 35* —23 —09 50* 06 16-17 —42* —24 
. Meditative- 2 9 03 -0 —28 =30 <4* -07 7 ties 
unquestioning 22 24 04-16 —31* —33* 45* —0l i6 220 m3 
Coprates -0 3 40% e 09 24 © 30% 05 27 = —O0NaagtS 
obstructive —03 28 44 —02 12 25 4i* 03 —27 —04 —22 
10. Happy- 03 02 00 —3i* B 23-04 08 —-06 03 —10 
sad —24 o —04 —36* if 25 ` —01 09  —07 o0 —09 
1U. Sensitive- 06 33" 12s 13 =10 4o* —08 —23 26 = —12 
tough 15 32 21 —04 05 —15 44* —09 —21 32" —15* 
12, Intelligent- —02 33* —07 42 —O1 04 31* =05 —29 —29 09 
Stupid Oo 2 06 46 0 Ób 3s% —04 20 =28 ME 
13. Poised. —26 o5 —25 ga 0 —21 = 03 16 10 E03 
J flustered —35 02 —24 —O1 00 —21 —10 —O1 21 10 —02 
6 Tolerant- 17 04 08 00 15 —07 21 22 —28 22 —28 
Jealous 19 07 22 08 13 —10 23 20. =26 2 —sit 
1 h 
5. Dominant- —10 —04 00 —07 —29 26 ~—25 00 17 46" 9 
f Submissive =12 00 —07 —06 —19 27 —30* 00 13 —44* 12 
6: Relaxed- aay Sue. 358: aea toa so) 0S o 121) Me treegUS 
ù tense mi S 225.7 A , 00) ~—01 o2 Gf owe | OS 
* Conventional- —16 22 19 17 4 -21 25 05 17 l4 —15 
; unconventional —12 19 26 18 00  —20 25 AROS 19 17 O 
8. Sociable- 10 —30* 07 —19 23 3 6 —-ll =i -03 29 
i self-contained 13 —46* 03 —17 26 Tesh =k ete 31* 
9. Trustful- —10 20 19 11 1s 07 R io Sy 02 —06 
5 Suspicious =12 18 32 14 19 —05 —12 130 14 09 10 
0. Self-effacing- 16 16 11 10 12). 04 0a 0 19 01 
5 egotistical 14 12 18 04 08 —06 —03 —18 01 30* 00 
1. Conscientious- 18 —18 16 —08 24 04 26 —22 —18 09 —16 
Unscrupulous Mi c 2115 22 03 Ame 22 ‘18 14 —20 


354 


K. WARNER ScHAIE 


TABLE 2—Continued 


HSPQ Factors 


Traits A B c 5 E F G H I J 
22. Adventurous- 12 —12 48* —10 —12 41* 14 54* —38* 8 
timid i =i 46* —02 00 43* 15 54+ ae -85 
23. Stable- —02 34* —08 -03 -03 —06 28 13 —29 = 
unstable —07 34* —02 —04 —05 —01 37 13 -235 -1 
24, Persevering- 09 04 01 —34* —21 01 28 03 —02 a 
quitting 13 03 04 —39% —28 —01 30* 05 —05 i 
25. Modest- -22 3% -27 -02 03 -30 og -483 17 r 
attention- —21 34* —31 —09 —04 —26 09 —42* 20 
seeking 
* 
26. Open- 23-01 -15 03 26 -03 -16 —48* —36* oe 
defensive 26 B = —o1 26-01 -10 —a9r =3 3 
27. Refined- —18 15-24 08 29 -20 —05  —42* —11 o 
crude —17 o6 —21 —01 23 —22 —03 —40* —06 j 
28. Imaginative- —01 —39%* 43* 00 -13 06 —09 30 —07 et 
practical —07 —44* 48* 05 07 o -17 33-06 =! 
29. Obedient- —01 08 38* —27 03 02 15 — 30" E 
disobedient 01 04 46* —28 07 —02 12 05 -29 
30. Adaptable- —19 a if 1 15 —21 18 16 —33* A 
inflexible -17 34* 20 —09 2i =i? 21 15 —28 
31. Responsible- 09 20 03 —15 —05 —10 59* —20 —16 E7 
irresponsible 16 16 o6 —18 —06 -12 61* —16 —19 p 
32. Curious- —07 —17 47* 00-04 30 o6 17 —21 -i 
incurious -09 —12 37* —03 09 36* 02 17 -20 7 e 
33. Talkative- o9 o3 15 06 06 37 —o7 16 36 -ig 
silent 08 01 11 09 24 39* —09 16 -37 s 
34. Carefree- —06 —15 30* 05 —06 44* —34* 95 —ió A 
anxious —09 13 32* 05 06 48* —33* 295 -20 7 s 
35. Tasteful- 18 —24 20 -02 -09 27 18 —o0g  —05 s 
inartistic 22 —28 a -0 =07 20 16 -06 -07 7 
36. Resourceful- —15 -i1 -12 39* 23 —02 —05 —02 —42* E 
baffled —17 -11 —09 44* 8B 05 o3 =o —42* 7 
37. Independent- 15 09 o8 —02 08 07 08 o9 —37* n 
dependent 17 07 12 ol 09 08 12 12 —38* i 
38. Adult- 03 17 03 33 —10 0 2 13 -29 -ip 
naive 00 14 04 27 —07 13 35* 133 -2 7 z 
39. Orderly- —06 02-23 -—29 —01 —33* 15 —30* —02 T 
disorderly 05 —05 —21 —30* —08 —35* 12 -29 —04 3 
40. Easygoing- -15 —15 30* —25 24 oo  —04 oz —45* is 
irritable —08 —09 35* 16 28 01 00 o = —S1* i 
41. Expressive- 03 —23 04 ~16 10 16 o5 —06 ob A 
secretive oi —20 08 —15 11 12 04 —05 02 i 
b 23 —04 27 05 10 2 —06 —08 
42. Brave 0 19 ó 
complaining 29: —02 32* o 08 18 18 —06 —08 : 


š " n a P ad scores. 
Note.—The top figure in each row is for the unit weighted scores, the bottom figure is for the ratio weighted s¢ 


5. 


in bold face indicate correlations with the factor on which a given trait is hypothesized to load (Cattell, 1957). 
*p <.05 
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based on known relations between the scales 
and criterion measures. It was therefore felt 
to be appropriate to compute multiple corre- 
lations between the HSPQ and each of the 42 
trait rating criterion variables. To avoid 
maximizing chance and artifactual relation- 
ships in the sample use was made of a 
technique described by Horst and Smith 
(1950) as programed for the Burroughs 205 
Computer by Schaie (1963). This is an 
Iterative method which selects the largest 
Validity coefficient as the first estimate of 
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R, subtracts out the product of this value 
with the appropriate vector of the predictor 
matrix, and then selects the next largest 
remaining validity coefficient to be added. 
After each cycle the multiple R is corrected 
for shrinkage by means of Wherry’s formula. 
Whenever the corrected R no longer increases 
or when it decreases, then the multiple R 
obtained at the end of the previous cycle is 
accepted as the best estimate of R and 
further iteration is ceased. 

Of the 42 regression equations computed, 


TABLE 3 
CORRELATIONS BETWEEN HSPQ Form A AND Factor Scores DERIVED FROM Trait RATINGS 


HSPQ Factors 


D E F G H I J o 


Trait Rating Factors A B ic 
A Cyclothymia versus —07 —02 15 —17 17 06 —08 03 —24 05 —03 
schizothymia —06 —01 21 —14 22 05 —06 02 —23 09 —06 
B Intelligence 06 22 —12 03 —22 —03 38* —11 02 —13 —07 
09 20 —10 00 —23 —03 40* —09 02 —08 —10 
c Ego strength versus 07 00 —04 —10 06 00 09 00 —11 15 —12 
Proneness to neuroticism 07 —02 02 —13 04 —01 12 00 —10 18 —17 
D Excitability versus 07 —06 —04 14 —05 04 —ii 04 08 -25 15 
insecurity —09 —06 —09 16 —01 05 —ii 03 07 —31* 18 
E Dominance versus —01 —28 05 03 —08 23 -15 06 O1 -23 12 
Submissiveness —02 —26 01 04 —02 25 —16 05 —02 —29 13 
F Surgency versus -12 11 16 -08 17 16 -08 28 —27 -07 -13 
esurgency —12 08 19 —06 24 19 —05 27 -25 -10 —-14 
G Super ego strength 02 11 —04 -21 03 -13 26 -14 —08 14 -08 
08 07 —01 —22 00 —15 27 —12 —09 22 —12 
H Parmia versus threctia —06 —23 30* —14 06 41* —06 22 —31* —26 —07 
i —06 —17 36% —09 19 43* —04 21 —35* —33* —07 
Premsia versus harria —19 —08 13 03 —12 —14 05 08 25 —02 —03 
= =12 12 07 -10 -16 00 06 26 —01 —O1 
J Coasthenia o1 —04 —29 -14 -11 —44* 06 —17 33* 46* 10 
02 —06 —29 —12 —17 —47* 02 —15 32* 49* 08 
K Contention versus 10 —11 08 = 09 12 09 —11 -12 —04 —15 
abcultion 11-14 10 -06 10 10 08 —10 —11 -02 —17 
L Protensi 02 —04 —30* 10 —09 -19 -12 —19 
í nsion versus = 18 03 24 
inner relaxation =01 —08 —39* 06 —13 —18 -14 -18 1g —01 26 
M Auti 04 —20 —i4 —01 -15 05 —04 07 00 —05 03 
oe 08 24 12 0i 12 03 03 07 0i 06 00 
N Shrew, 14 17 —08 24 20 ó 09 =z -25 ül 14 
aoe 17 10 —09 2 19 gi 13 —23 —21 0s 10 
Ò Gis 4 o1 —03 05 12 —25 —18 11 —15 03 03 
m is fi h it wei; R 
by ene ehe op, Seure ig, Figures in ola face Waite eo ee earem ie b matching factors in the D aO Neighteq 


q r D 
luestionnaire domains. 
b <.0S. 
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28 resulted in increased and 14 in decreased 
corrected multiple Rs. On the average, scaling 
raised multiple Rs by .037 and corrected 
multiple Rs by .052. Twenty of the multiple 
correlations based on ratio weighted scores 
showed an increase in prediction significant 
at or beyond the .05 level of confidence. 
Carrying out the iterative process until all 
predictors had been assigned a weight changed 
individual coefficients but did not affect over- 
all results. The reported shifts are not large, 
but they do seem to indicate a noteworthy 
trend. 


EFFECTS OF SCALING ON CRoss-Mrprum 
Factor MATCHING 


An interesting extension of our experiment 
leads to the question whether it can con- 
tribute some clarification to the controversy 
regarding the matching of L (behavior rating) 
and Q (questionnaire) data (Becker, 1960; 
Cattell, 1961). Specifically we are concerned 
with the question of what if any effect the 
scaling will have in reducing the effects of 
such variables as the instrument and per- 
turbance factors specified by Cattell to ac- 
count for the lack of congruence between L 
and Q data. A more thorough approach to 
this problem would require factor analytic 
studies, which for the unscaled data have 
been reported elsewhere (Schaie, 1962), but 
which were not attempted in the present 
context. We shall be concerned here prima- 
rily with an examination of the effects of 
scaling on the zero-order correlations. If 
instrument and perturbance factors in fact 
conceal similarities in factor structure, then 
the increment in validity obtained by expert 
scaling may be a function of the decrease 
of effects of these semiartifactual components, 

To study these matters, validity coefficients 
were computed between both scaled and un- 
scaled HSPQ scores and Rating Factor scores 
which were obtained by combining trait 
ratings according to the factor structure re- 
ported by Cattell (1957). These coefficients 
are reported in Table 3. Matching between 
factor scores from the R and Q media is 
obtained at the .05 level of significance only 
for Factor J (Coasthenia). Although none of 
the coefficients for Factor G (Ego strength) 
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reach significance, the matching loadings are 
highest. In terms of our significance criterion, 
mismatchings occur between C, and Factors 
Hı and Lı; between Factor F, and Factors 
Hı and Ji; between Factor G, and Facto? 
Bı; between Factor I, and Factors Hı and Ji; 
and between Factor J, and Factors Di a 
Hı. What is the effect of the scaling on these 
relationships? In our sample it is restricted t0 
raising most of the significant coefficients very 
slightly and in introducing two addition 
mismatches. 


CONCLUSIONS 


It is apparent from these results that thet? 
seems to be a consistent trend for an inoa 
in the concurrent validity of the HSPQ sca g 
as well as their combination after rescon 
with expert-derived ratio weights. It ÍS A 
apparent that the magnitude of the increas e 
for our sample is not very great am. 
legitimate question arises whether the in It 
ment is worth the extra labor involve: 
should be kept in mind, however, tha ite 
sample of questionnaire respondent was ra 
limited and that the criterion variables "hat 
certainly fallible. Also, we do not know We 
effect the rewriting of the questionnaire t 
had in modifying the judges’ decisions: tor 
results with respect to the cross-medium fa? 
matching question are further limited 
fact that we used Cattell’s factor weigh “phe 
adults in computing factor rating scores: nts 
factor structure of rating data on adolesce? 
may, however, be somewhat different * s 
that for adults, resulting in possible ett" 
conclusions from our data. It woul 
desirable, therefore, to conduct 
studies similar to the one described jt 
under more favorable conditions, sin caeed 
appears that expert judgment can 1 pe 
Increase the validity of questionnaires: “i 
task then remains to investigate the OP oo! 
mental conditions under which the gain “ to 
expert judgment will be sufficiently 12"8 
merit practical application. 


out 
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COUNSELOR ANXIETY IN RELATION TO AMOUNT OF 
CLINICAL EXPERIENCE AND QUALITY OF 
AFFECT DEMONSTRATED BY CLIENTS* 


“PETER D. RUSSELL 


Buffalo Veterans Administration Hospital 


anD WILLIAM U. SNYDER 


Ohio University 


A counseling paradigm employed 2 actors role playing as clients, and 20 
counselors. Independent variables were hostile or friendly client behavior and 
amount of counselor experience. There were 4 measures of the dependent 
variable of counselor anxiety: palmar sweating, eyeblink rate, client-actor 
estimates of counselor anxiety, and independent judgments of verbal anxiety 
of counselors’ protocols. Results revealed that hostile client behavior led to 
significantly greater anxiety than friendly behavior. Amount of graduate 
training and counseling experience had little effect on the degree of counselor 
anxiety in either hostile or friendly interviews. Modification of some of the 
measures, lengthening of interviews, increasing group sizes, or finding more 
discriminate groups might have changed this last finding. Palmar sweating 
was of questionable utility as a measure of anxiety. 


In recent years increasing attention has 
been given to the behavior of the counselor 
in interview situations. Studies have been 
made investigating counselor personality, 
counselor techniques, and the relationship 
between counselor and client. These questions 
have practical significance for graduate 
programs in clinical psychology since they 
are relevant to the teaching of counseling 
methods. 

In the present study the investigators at- 
tempted to examine the effect of a hostile 
or a friendly client demeanor upon counselor 
anxiety from the standpoint of relatively 
experienced and relatively inexperienced 
graduate student clinical psychology coun- 
selors. Two hypotheses were developed. First, 
it was proposed that student counselors, 
regardless of amount of experience, display 
greater anxiety in interviews with hostile 
clients than in interviews with friendly 
clients. The second hypothesis was that more 


1 This study is based on a PhD dissertation con- 
ducted by Peter D. Russell at the Pennsylvania State 

iversity under the direction of William U. Snyder, 
Uni id H. Ford, and Deno G. Thevaos. One of the 
Dona used in the study was derived from scales 
tests d by William U. Snyder, which had been 
a in part on previous work by Donald H. Ford 


and Oliver H. Bown. 


5 
experienced student counselors display ra 
anxiety than less experienced student © 
selors in interviews with both hostile ex 
friendly clients. These hypotheses wer the 
amined by means of four measures almat 
dependent variable of anxiety, i-€~ Fates 
sweating, eyeblink rate, client-actor a pal 
of counselor anxiety, and judgments 0 
anxiety made by independent judges- 

A review of the literature revealed 
was a relatively new area of cons! 
Although there has been some th 
only a few experimental studies We jjent 
which explored the effects of hostie “om? 
affect on counselor anxiety. There Were vical 
investigations which utilized phy: siol jn 
measures, but only one report was foui des 
which the counselors’ palmar-sweating ol 
had been employed, and this W#? _jiry 
tangentially related to the current e 
(Light, 1951). Using actors in the TO") oes 
clients, as in the present study, Bee vest 
undertaken only once before to the * jddler 
gators’ knowledge (Kounin, Polansky, f that 
Coburn, & Fenn, 1956). However, 1°", rolê 
study the actor did not take an pat of 
in the evaluation nor was the behav! die? 
the counselors under scrutiny. Also, ; 
were reported which were concerne 


t this 
d era y 

. ři 
eorizi? ‘i 
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speech patterns as an index of anxiety, but 
none of these inquired into counselor be- 
havior. No investigations were found which 
directly related the counselor’s level of 
experience to the anxiety produced in certain 
types of interviews. 

Two studies have reported results relevant 
to those cited in the present investigation. 
Bandura, Lipsher, and Miller (1960), using 
analyses of actual therapy interviews (but 
not simulated interviews), found that thera- 
pists who typically expressed their own hos- 
tility in direct forms, and who displayed low 
need for approval, were more likely to permit 
and encourage their patients’ hostility than 
Were therapists who expressed little direct 
hostility and who showed high approval seek- 
ing behavior. Psychotherapists were more 
Inclined to avoid hostility when it was 
directed toward themselves than when the 
Patients directed their hostility toward 
Others. The patients were more likely to drop 
the hostile topic, or to change the object 
of their hostility following therapists’ avoid- 
ance reactions, than they were following 
the therapists’ approach reactions. 

Heller, Myers, and Kline (1963), in a 
Subsequent study, copied in part after the 
Present investigation, and employing client- 
actors in simulated therapy situations, have 
demonstrated that dominant client behavior 
will evoke dependent interviewer behavior, 
and dependent client behavior will evoke 
dominant interviewer behavior. They also 
Showed that hostile client behavior will evoke 

Ostile interviewer behavior, and friendly 
Client behavior will evoke friendly interviewer 
ehavior. They were not able to demonstrate, 
as was the present study, that hostile client 
ehavior will evoke anxiety on the part of 
a therapist. 
METHOD 


Design and Procedure 


The design of the study randomized the order in 
Which counselors participated. Two client-actors 
Were each seen by half of a group of 10 more 
experienced counselors in their first interviews, and 

Y the other half of the group in the second. The 
Same procedure was followed for 10 relatively in- 
€xperienced counselors. Moreover, each of the two 
Client-actors was seen in a friendly role 20 times 
and in a hostile role an equal number of times. 

The two client-actors were chosen from a group 
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of undergraduate males enrolled in a theatre arts 
course. It was felt that actors could assume the 
roles required most meaningfully, While having 
demonstrated acting proficiency, they had not ap- 
peared extensively in public roles where they might 
have been observed by the student clinicians. The 
actors were chosen on the basis of lack of neurotic 
trends, similarity of physical appearance and age, 
and ability to take the required roles and play them 
spontaneously in a hostile and in a friendly manner. 
They were paid for their work, in order to maintain 
their interest in the project. 

The actors underwent 10 hours of training to 
respond in interviews in two different ways: in a 
very friendly manner which was positive and helpful, 
and in a hostile and negative manner. They were 
quickly able to fill in the missing elements of their 
“background” stories when this seemed called for 
in the simulated counseling situation. While the spon- 
taneity of the situation was maintained, their ways 
of presenting problems, information, and affect 
became quite constant. In a pilot study which was 
carried out, it was discovered that the actors were 
presenting somewhat extreme characterizations of 
the typical friendly or hostile client roles, so that 
the counselors of the pilot study were able to 
detect that the situations were somewhat exag- 
gerated, and therefore not real counseling situations. 
Consequently the roles were made less deviant from 
the mean level of affect, and in the actual experi- 
ment itself, it was ascertained that none of the 
counselor-subjects detected the true character of the 
study, nor realized that the clients were actors. 

The utility of the counseling paradigm which 
made use of actors who took the roles of clients 
was explored at some length. It was felt that this 
design had unique advantages for the study of 
counselor anxiety. It offers a compromise between 
the reality, but lack of control, of the study of real 
counseling, and the fine control, but lack of reality, 
of other analogues to a counseling situation, i.e. 
films or tape recordings. The use of actors role 
playing in what seems to be a normal counseling 
situation to the subjects, permits the manipulation 
of an independent variable which is uncontaminated 
by the fact that the counselor knows he is being 
studied or by other extraneous factors which are 
difficult to control. The counselor not only thinks 
that the client is real, but also that he is helping 
him. For the present investigation each subject 
was able to act as his own control for the study 
of the effects of hostility and friendliness due to 
the counterbalanced design, so the need for matching 
was eliminated. 


Dependent Measures 


Palmar sweating. Employing the revised method 
of Mowrer (1953), the fingerprint stain test of 
palmar sweating was administered before and after 
each interview following thorough handwashing. 
The densities of the stains were measured by means 
of an adaptation of Mowrer’s apparatus which 
passes light through the stain onto a Photoelectric 
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cell. The counselors were told that this measure 
was being used on them in order to determine 
congruence of counselor-client feeling, a measure 
which was being considered for predictive ability 
regarding the extent to which marginally motivated 
clients would apply themselves in the counseling 
process. eyes 

In order to take into account possible influences 
on the amount of sweating, several new controls 
were employed. At the outset, stains were made 
for both index fingers. Where the stains appeared 
equally dark, the right index finger was used for 
subsequent measurements. In the cases where one 
print was perceptibly darker than the other, the 
finger producing the darker stain was used. This 
controlled for the possibility of peripheral nerve 
damage to an extremity (Cross, Dodds, & Knights, 
1960) and consequent potential reduced sensitivity 
to tension changes. 

Secondly, the counselors were allowed to rest 
for at least 15 minutes after coming to the experi- 
mental room so that they might become accom- 
modated to the situation. This time was used 
to give the instructions. Also approximately 15 
minutes of rest was given between post- and pre- 
measures of the first and second interviews, respec- 
tively, in order to facilitate dissipation of any 
anxiety and concomitant physiological reactions. 
Temperature and relative humidity were measured 
continuously by means of hygrothermographs in 
both the counseling room and the monitor room 
where the palmar-sweating measurements were 
taken. Large changes in either might have affected 
the amount of sweating, but no significant variations 
were noted. Also, it seemed necessary to make sure 
that the counselors were not ill, since it is likely 
that fever might cause an increase in palmar sweat- 
ing. In order to guarantee that there were no cases 
of fever or illness, this topic was brought up during 
the period before the first interview, and ruled out 
before the counseling began. Finally, hyperthyroidism 
can lead to an increase in palmar sweating (Walmer, 
1961).? However this factor was adequately con- 
trolled by checking the pulse rate. 

Eyeblink rate. By means of a concealed closed- 
circuit television system, blink rates of the counselors 
were observed and recorded during the second and 
fifth 5-minute portions of the half-hour interviews, 
A check on the accuracy of the investigator’s count- 
ing of the number of blinks showed a very high 
degree of reliability of measurement, 

“Client’s” Scale of Counselor Anxiety. Following 
the completion of each interview while the counselor 
was having his postinterview stain test, the client- 
actor completed a scale rating the anxiety which 
the counselor had revealed in the interview. This 
scale consisted of 60 items selected from three 
sources: items modified from Snyder’s (1961) 
Client’s Postinterview Attitudes Toward Therapy 
scale and Therapist’s Personal Reaction Question- 


2J. D. Walmer, personal communication, July 
15, 1961. 
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naire, and a group of 10 items prepared especially 
for the present study. EM 
Judgments of verbal anxiety. This scale of one 
of verbal anxiety was developed from statem the 
concerning counselor anxiety which appeared a ed 
literature,? in addition to several criteria bap 
for the present study. These criteria were ae 
by being submitted to a group of five exper! full- 
and carefully trained judges who were aome RG 
time vocational counseling and psychotherapy “state 
Division of Counseling at the Pennsylvania ia 
University. Those criteria which were 4 zeae 
“significant measures” by three of the five e 
and for which the judges agreed upon the nb into 
ate examples submitted to them, were comp! Further 
a scale of 14 items. However, this scale was of six 
modified by being submitted to a group ho 
other experienced judges, who were the ones Two 
later did the actual rating of the interviews. hy 
new criteria which appeared relatively fred aded 
enough in the pilot-study interviews a of 1 
to the original scale, thus making a OH could 
criteria for judging counselor anxiety whic tocol: 
be observed in the interview situation or PFO were 
The sixteen criteria of counselor ansiely ing 
the following: 4 direct statement of aniei ting! 
questions or changing the subject; * nee - 
impersonal or premature interpretations; u ani 
sary reassurance; disapproval; intellectualizini 
not responding to client’s feelings; introjech * une 
references to the counselor’s own experiences: asesi 
finished sentences; * repeating words Or P remu- 
* stuttering; * blocking; poor voice auai © s 
lousness, “cracking” voice, etc.; apologizing ughteri 
fault in counseling technique; inappropriate 1% 
and other signs of anxiety. ot com 
Several of these signs of anxiety were M ly igh 
mon among all counselors, but had a meat ar 
frequency of occurrence in the protocols in al! 
ticular ones. The starred items were commo” 
interviews. 


Establishing Interjudge Agreement 


The six judges spent approximately 12 ho 
training as a group, During this period the 
of each criterion and its application Mg r 
interviews was discussed and clarified. d on coe 
quired that an agreement of 88% be reache nsider® 
unit in the test sample before training was ©° 
adequate. of 


Jem 
Some question arises regarding the e 


i ar 
controlling for interview identity with resets 
knowledge on the part of the judge as t° t wou 


the interview was a friendly or hostile one- 


d 

an! 

®See studies by Dibner (1956), pollar gA9) 
Mowrer (1947), Kauffman and Rai amen 
Lasswell (1935), Sanford (1942), 2 Good? 
(1950), Mahl (1956, 1959), Boomer an jel 
(1961) and Krause (1961). nxt? 


3 ie? 
*The starred items are those signs “mly 
occurring more frequently and more unifo 
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be impossible, of course, to disguise this fact from 
the judges (who might also have guessed the 
identity of a few of the counselors). Some of the 
specific criteria of counselor anxiety which might 
be more influenced by this knowledge would be 
‘direct statement of anxiety,” “premature interpreta- 
tions,” “disapproval,” and “apologizing for some 
fault in counselor technique.” However, these were 
Fee the signs of counselor anxiety which occurred 
fat infrequently, and therefore had relatively 

tle effect on the score of counselor anxiety 
at which the judges arrived in each case. 


Selection of Student Counselors 


Fok groups of 10 male student clinicians were 
esi from the total population of clinical psy- 
ag gy graduate students. The 22 eligible students 
Mae evaluated for clinical experience in order to 
Es ce them in the “more experienced” or “less 
Ea enced group. Using a modification of Tirn- 
Einn (1959) questionnaire, 13 types of clinical 
ae ence were taken into consideration in arriving 
Aon omposite “clinical experience” rankings. The 
for ps as finally constituted, were also evaluated 
RE na ability (using the Miller’s Analogy Test 
ae and age, since these factors might lead to 
aa ape results if the experienced counselors 
aan asically more capable and/or older. Ability, 
facta a by MAT scores, was not found to be a 
Gan ifferentiating the two groups; the experi- 
over Payee was, however, something a little 
hi a half year older than the inexperienced group. 

S difference was not considered to be significant. 


Judgments of Client-Actor Affect 


nt interview typescripts were presented in 
ae eas randomized sequence to three non- 
into e Ogist judges. They were asked to sort them 
a tized piles in terms of client hostility or 
without The three judges performed this task 
ad ad error, thus indicating that the client-actors 
the ex opted clearly differentiable roles throughout 
periment, 


RESULTS 


can pothesis concerned with the dif- 
a a effects of hostility and friendliness 
ae oth the experienced and inexperienced 
of TE of Counselors was analyzed by means 
CA Nt ages matched-pairs signed-ranks 
plied a the Mann-Whitney U test was ap- 
anxiet oh the data in comparing counselor 
ty in terms of the levels of counselor 

erience, 
ates far as the comparison of types of 
ink Was concerned, it was found that eye- 
eee the “Client’s” Scale of Counselor 
Y, and the scores derived from the 
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Judgments of Verbal Anxiety Scale revealed 
that hostile behavior on the part of a client- 
actor created more anxiety in the two groups 
of counselors than did friendly behavior. The 
levels of significance obtained were much 
greater than the level set in advance (.05 
level) for rejection of the null hypothesis. 
Significance levels obtained were all at the 
001 level for the more experienced coun- 
selors and better than the .005 level for the 
less experienced counselors. For the relatively 
inexperienced counselors, one individual from 
the group had results in the nonpredicted 
direction in each case. For both groups of 
counselors the trends for palmar sweating 
were in the expected direction, but were not 
statistically significant. Thus it could be said 
that the first hypothesis was supported. 

The comparison between levels of coun- 
selor experience revealed that only one 
dependent measure significantly displayed 
the expected difference between groups of 
counselors. For the friendly interview, eye- 
blink rate indicated that the more experi- 
enced counselors were made less anxious by 
the client-actors’ behavior than were the less 
experienced counselors. The total anxiety 
score from the Judgments of Verbal Anxiety 
Scale almost reached the .05 level of signifi- 
cance for this same type of interview. The 
results of the other dependent measures 
showed trends both in the predicted and non- 
predicted directions for the two types of 
interviews. It may be concluded that gradu- 
ate training and experiences did not seem 
to affect the degree of anxiety manifested 
by the counselors. 

Paired relationships between the measures 
were computed by means of the Spearman 
“tho” correlation. Combinations of scales 
were also evaluated for interrelationship by 
means of the Kendall coefficient of concord- 
ance. For these correlations the two groups 
of counselors were combined. For the inter- 
views with the hostile client-actor, eyeblink 
was found to be significantly correlated with 
the “Client’s” Scale of Counselor Anxiety 
and with the total anxiety score of the index 
of judgments of verbal anxiety. None of the 
other paired comparisons of measures for 
either type of interview produced correla- 
tions indicative of a statistically significant 
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relationship, nor did the group comparison of 
measures reveal significant intercorrelations. 


DISCUSSION 


The hypothesis stating that , student 
counselors, regardless of experience, display 
greater anxiety in interviews with hostile 
clients than in interviews with friendly clients 
seems to be supported, at least as far as the 
present counselors were concerned. Regarding 
the failure to find significant differences in 
the predicted direction between the experi- 
enced and inexperienced counselors for both 
friendly and hostile interviews, it is felt that 
the indices might not have been sensitive 
enough to show existing differences. The sup- 
position that there should be differences in 
reactions to both types of interviews, depend- 
ing on the amount of clinical experience, 
still seems reasonable because the idea has at 
least face validity, and there were more 
trends in the expected direction than the 
converse. Perhaps a more appropriate 
measure of experience would have been the 
amount of such experience which constituted 
work with hostile clients, specifically. 

Increasing the size of the groups or extend- 
ing the length of the interviews might help 
to distinguish between such groups from a 
graduate student population. Of even greater 
importance would be the selecting of thera- 
pist groups with a minimum of overlap 
between those with experience and those who 
were inexperienced. 

Consideration of the relative value of the 
dependent measures as research instruments 
suggests that the palmar-sweating index is 
open to serious question as a meaningful 
research tool. 

Eyeblink rate appeared to be quite useful 
for investigations of counselor anxiety. It 
not only discriminated very highly between 
types of client-actor affect for both coun- 
selor groups, but also reflected differences in 
direction for a particular type of interview. 

The “Client’s” Scale of Counselor Anxiety 
was also found useful, but certain modifica- 
tions of the instrument appear to be neces- 
sary. The statement might be made that the 

differences obtained on this scale were due 
to biases of the actors. However, an evalua- 
tion of interactor agreement in the applica- 
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tion of the scale suggests that the countet- 
balanced design should serve to randomize 
any consistent bias on the part of the client- 
actors. Additional items need to be com 
structed which would allow finer discrimina- 
tions to be made at the nonanxious end 0 
the scale. 

Considering the Judgments of Verbal Ant 
iety Scale, there were high discriminating 
ability of the instrument between types A 
interviews, and high interjudge agree 
achieved in using this scale. Perhaps certa h 
individual criteria would prove to be mor 
useful in specific contexts; however, removing 
some of these criteria might cause the sc in 
to lose its differentiating ability. Increases E, 
numbers of counselors or the length of Be 
views would be possible methods of improv 
the discrimination of this scale even furt E 

As far as the lack of demonstrated Te% 
tionships among the indices is concerne oe 
can be maintained that this did not ee 
sarily indicate that they were too pao 
to be of value. It is suggested that 15 ng 
of the four apparently independent me 
sions of anxiety having separate antece aen 
they might be related to a single antec? 
viz., the mediating anxiety response. 
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CHILDHOOD AND INTERCURRENT INTELLECTUAL 
PERFORMANCE OF ADULT SCHIZOPHRENICS? 


GEORGE W. ALBEE, ELLEN A. LANE, CLARE CORCORAN, anv ANN WERNEKE 


Western Reserve University 


To determine whether intellectual deficit is a concomitant of adult schizo- 
phrenia, premorbid childhood IQs of 112 adult schizophrenics were com- 
pared with IQs obtained during the psychosis. Scores on childhood Binets 
and childhood group tests and on adult Wechsler-Bellevues were treated 
statistically to increase comparability. No significant difference was found 
between early premorbid IQs and scores during the psychosis. Correlations 
of IQs from these 2 periods of time were .55 and higher. These results 
seriously challenge the belief that intellectual loss occurs as a consequence of 
adult schizophrenia. Although hospitalized schizophrenics test below average, 
these same people are found to have made similar low scores on intelligence 
tests during childhood, long before the acute onset of schizophrenia. 


Studies of intellectual deficit in schizophrenia 
have most often compared hospitalized schizo- 
phrenics with normal controls. In a review of 
these studies, Hunt and Cofer (1944) concluded 
that ideally “efficiency should be measured be- 
fore and after the onset of the condition.” Eight 
years later Howard Hunt (1952) wrote, “The 
direct approach to measuring deterioration— 
comparison of premorbid test results with results 
at the time of diagnostic study—is generally 
impossible.” 

Rappaport and Webb (1950), however, had 
reported that, in a sample of 10 schizophrenics, 
there was a mean decline of 33 IQ points be- 
tween premorbid high school intelligence tests 
and tests administered sometime after hospitali- 
zation. Recently Lubin, Gieseking, and Williams 
(1962), in a more definitive study, compared 
the intellectual performance of 159 schizo- 
phrenics, tested with the Army Classification 
Battery shortly after hospitalization, with their 
premorbid performance on the same test at 
the time of induction. They found a statistically 
significant, but much smaller, deficit on the 
second test than that reported by Rappaport and 
Webb. They accounted for this difference by the 
fact that some of the subjects in the earlier 
study were tested after long hospitalization. 

It is generally agreed that motivational decre- 
ment is a common characteristic of schizophrenia 
and should be considered in studies of intel- 


1 The research reported in this paper is supported 
by Grant M-5186 from the National Institutes of 
Health, United States Department of Health, Edu- 
cation, and Welfare. 


lectual deterioration, The use of rigidly tes 
tests, such as the Army Battery, that places 
premium on persistence and focused attention 
might be expected to result in reduced pco S 
for the schizophrenic. Most of the evidon 
supports Hunt and Cofer’s (1944) statemen 
that “deficit becomes greater as the complex 
of the task increases” and “the findings pie 
that the deficit in schizophrenics is one of ™ a 
vation rather than one of capacity.” The eel 
tests that do not penalize seriously occas! ake 
lapses of attention and motivation would ™ 
this factor less important. 

The present study is concerned wi 
parison of premorbid intelligence test PO ijze 
ance, not from shortly before the recog? the 
adult disorder but from childhood, with 
performance on the Wechsler-Bellevue 
administered during hospitalization 
phrenia, The tests from childhood wer 
the 1916 Stanford-Binet or elementary $° the 
group intelligence tests. In the case ° 
former, when several Binet scores Were i 
able for one subject, the highest one was Ki vel 
as the best estimate of optimal premorbi 


me 
th a c0 
erfor 


avail- 


METHOD 
Subjects were 112 adult schizophrenics unde a 
years of age diagnosed at any of several a the 
Veterans Administration hospitals and clini) wh? 
Cleveland area, All schizophrenics were incluc eit 
had been given a Wechsler-Bellevue test ann 
psychosis and who had been given one ? 
intelligence tests in the Cleveland Public 
during childhood. 
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NOTES AND 


Since 1937 all Cleveland school children in regular 
second-grade classes have been given the Kuhlmann- 
Anderson Test. Another intelligence test, the Cleve- 
land Classification Test, has been given since the 
early 1930’s to children in regular sixth-grade classes. 
The 1916 Stanford-Binet has been given since its 
Inception to all children in special classes, usually 
pea 3 or 4 years, and to a large number of other 
children for a variety of reasons, such as: kinder- 
ae entrance, admission from another school sys- 
RA The se and academic problems, and candidacy 
= Sean ment programs, Most of the subjects under 
id ae of age, approximately half the total sample, 
waite F given both group and individual intelli- 

T ests and are included in the results of both. 
the Sree! premorbid IQs and scores during 
childhend osis for those subjects who had had a 
Res A Stanford-Binet, Wechsler-Bellevue 1Q 
10 panies converted to an equivalent 1916 Binet 
paring ie to Wechsler’s (1944) table. In com- 
ebeatned pear group tests with adult Wechslers 
Cuvata uring psychosis, childhood scores were 
divin at to deviation IQs based on the mean and 
are et Cleveland school children in a random 
tion IQ e particular test. The mean of the devia- 

S was set at 100 and the standard deviation 


TABLE 1 


CHILDHOOD Tests COMPARED WITH TESTS 
DURING PSYCHOSIS 


Hospital 
Wechsler-Bellevue 


Highest converted to 
childhood equivalent 
Binet IQ Binet IQ 
98 98 
A 87.1 87.6 
E 11.8 15.8 
r 50 
Second-grade 
Kuhlmann- Hospital 
Anderson Wechsler-Bellevue 
deviation IQ IQ 
y 32 32 
4 92.6 39.4 
; 15.6 14.9 
68 
Sixth-grade 
Cleveland Hospital 
Classification  Wechsler-Bellevue 
deviation IQ IQ 
A 39 39 
88.1 91.5 
if 19.9 17.9 
55 
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at 15 in order to make these tests comparable to 
the Wechsler-Bellevue. 

Means and standard deviations were computed for 
the premorbid measures and for the tests during 
psychosis. Product-moment correlations were com- 
puted between childhood and adult psychotic scores. 


RESULTS 


As can be seen in Table 1, there was practi- 
cally no difference between the mean of the 
highest childhood Stanford-Binet tests and the 
mean of the Wechsler-Bellevues administered 
during the psychosis for 104 subjects. The 
former was 87.1; the latter 87.6. The correlation 
between these tests, taken many years apart, 
was .56. This correlation compares not unfavor- 
ably with correlations of .70 to .80 that 
Anastasi (1958) considers usual for such meas- 
ures over years. 

Similarly, the mean of the Kuhlmann-Anderson 
deviation IQs from second grade was 92.6 while 
the mean hospital Wechsler-Bellevue IQ was 
89.4, a decrease that was not statistically sig- 
nificant, The correlation between these measures 
was .68. The mean of the scores of the sixth- 
grade group intelligence test was 88.1; the mean 
for the same people during hospitalization was 
91.5. The correlation between these two measures 
was .55.? 

It is clear that no deficit was found between 
premorbid childhood intelligence tests and tests 
administered during the psychosis. The corre- 
lation coefficients between these two estimates 
of intelligence were surprisingly high, both when 
the premorbid individual tests and the group 
tests were compared with the hospital test. 


DISCUSSION 


The present study fails to confirm the two 
previous studies in the literature which reported 
a decline in measured intelligence when tests 
administered during the psychosis were com- 
pared with premorbid measures. It also contra- 
dicts the persistent clinical contention that there 
is intellectual deterioration in schizophrenia. 
Because the highest childhood Stanford-Binet 
Test was used in comparison with the Wechsler- 
Bellevue, deterioration had more than usual op- 
portunity to be demonstrated, if it existed. 
Indeed, when Stanford-Binet scores obtained 
between ages 11 to 14 were compared to hospi- 
tal scores, an actual increase in measured intel- 


2 Separate analyses were made for males and 
females and for the subtypes of schizophrenia. No 
significant changes in IQ between childhood tests 
and adult tests were found in these analyses. 
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ligence during the psychosis was found (Lane, 
1958). 

While additional research is required to 


reconcile these findings with the others in the 
literature, it seems probable that the discrepancy 
is related to the nature of the tests during 
psychosis. The Wechsler-Bellevue Test, used in 
this study, is not as rigidly a speed-timed test, 
half the subtests not being timed at all, as the 
Army test on which different findings were 
reported. The length of time subjects had been 
hospitalized when tests are administered, the 
adequacy of the premorbid testing, the attitudes 
and motivation during the hospital testing—all 
of these are factors that should be considered 
in future research. 

Earlier studies reported intellectual deficit 
when schizophrenics are compared with normals. 
Now it seems likely that premorbid measures 
on future schizophrenics would show that intel- 
lectual performance has already been depressed 
as a consequence of long-term personality 
characteristics associated with incipient schizo- 
phrenia. Schizophrenics score low in the hospi- 
tal but these same people always scored low, 
long before the recognized onset and diagnosis 
of the disorder. Birren (1944) found a mean 
IQ of 88.7 in a group of boys who later became 
schizophrenic and a mean IQ of 97.7 for a 
normal control group of boys from the same 
schools. Mason (1956) found premorbid AGCT 
scores of 368 inductees who later became schizo- 
phrenic to be significantly below the norms made 
by more than a quarter of a million inductees. 
Since schizophrenia has been found to be much 
more prevalent in the lowest socioeconomic levels 
(Hollingshead & Redlich, 1958) these results 
might have been influenced by those factors 
related to a poor environment that are known to 
affect intelligence test scores. 

A recent study (Reuter, 1962), however, re- 
ported a significant difference between second- 
grade intelligence test scores of children who 
later became adult schizophrenics and control 
children from the same elementary school classes 
and neighborhoods. Another study (Lane & 
Albee, 1963) found that children destined to 
be adult schizophrenics were significantly below 
their own siblings in intelligence tests taken 
during the second grade. 

Intellectual deficit as a characteristic of 
schizophrenia then does not appear to be a 
concomitant of the severe or acute stages of the 
disorder but rather as a characteristic of the 


NOTES AND COMMENTS 


ildhood 


; P chi 
total life picture, at least from early ntually 


on, of those individuals who are eve 
admitted to public mental hospitals. 

Although the group averages were peen 
mately the same in these comparisons ee 
premorbid and intercurrent tests, some ers 
viduals, of course, did go down while me j- 
stayed the same or went up in IQ after he ie 
talization. A study is now under way to cet a 
whether two kinds of schizophrenics contr can 
to these results; that is, whether those La 
be identified as “process schizophrenics ; cive 
distinguished from those considered “1e? erm 
schizophrenics” on the basis of this longt 
poor performance on intelligence tests an i 
on the basis of the relationship betwee? ring 
morbid tests and intelligence test scores 
the psychosis. 


approx 
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FAILURE TO VALIDATE THE Cr AND Sm SCALES OF THE MMPI? 


JOHN L. VOGEL? 
Baldwin-Wallace College 


Conversion reactions and somatization have 
much current interest. Such terms give a dy- 
namic twist to the more nominal linkage of 
Psyche and soma. They speak of the patient 
who presents himself with somatic symptoms 
and complaints which are considered expressive 
of or dynamically related to emotional conflict. 
In the most dramatic case, such individuals 
Present symptoms which are obviously incapaci- 
tating but for which no organic basis is found. 

The Cr and Sm scales of the MMPI, de- 
veloped by Rosen and described by Dahlstrom 
and Welsh (1960), were intended to differentiate 
individuals in whom conversion reaction or 
eet zation appears to be operative. In an 
s Ort to validate these scales, MMPI profiles 
vere selected from two discrete populations: 
a. Psychosomatic (PS): Patients hospitalized 
fi a medical service with somatic complaints 
pt which no organic basis had been found at 
he time of observation by the psychiatric con- 
sultant, This group included 9 males and 16 
females, ranging in age from 18 to 56, with a 
Mean age of 40.6 years. 

Somatic (S): Patients hospitalized on a 
Medical Service whose somatic complaints were 
Consistent with findings of organic disease. This 
Stoup included 13 males and 12 females, ranging 
sais An extended report of this study may be 

aned without charge from John L. Vogel 
(Baldwin-Wallace College; Berea, Ohio) or for a 
e from the American Documentation Institute. 
par Document No. 7505 from ADI Auxiliary 

Ublications Project, Photoduplication Service, 
trary of Congress; Washington 25, D. C. Remit 
@ advance $1.25 for microfilm or $1.25 for photo- 
oe and make checks payable to: Chief, Photo- 

Plication vice, Library of Congress. 

? This nae oe pre while the author was 
at the University of Washington School of Medi- 
“Me, Department of Psychiatry. 
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in age from 20 to 67, with a mean age of 


45.7 years. 
Distribution of scores, means, and standard 


deviations on the two scales for the two groups 
is shown in Table 1. Using the median as a 
cutting score the Cr scale correctly identifies 
43% of the psychosomatic cases. By a similar 
method, the Si scale correctly identifies 52% 


TABLE 1 


CONVERSION REACTION AND SOMATIZATION SCALE 
SCORES IN Two GROUPS oF PATIENTS 


Cr Sm 
PS S PS S 
65-69 ó 3 31-32 1 1 
60-64 4 10 29-30 2 2 
55-59 5 4 27-28 5 3 
50-54 5 4 25-26 7 8 
45-49 2 1 23-24 3 5 
40—44 2 21-22 5 3 
35-39 1 19-20 1 
30-34 2 17-18 1 i 
25-29 15-16 1 
20-24 1 13-14 
11-12 1 
M 56.35 55.40 24.88 24.00 
SD 8.89 11.03 3.37 4.10 


of the psychosomatic cases. It is apparent that 
the Cr and Sm scales fail to discriminate 
effectively in the present sample. 
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BRIEF REPORTS 


SPONTANEOUS GSR AND ANXIETY LEVEL IN 
SOCIOPATHIC DELIN QUENTS ' 


ROBERT FOX 


Longview State Hospital, Cincinnati 


Recent research on autonomic function has 
revealed that frequency of spontaneous changes 
in skin resistance and in other autonomic sys- 
tems is a reliable characteristic of the individual, 
which is relatively independent of magnitude 
measures (Wilson & Dykman, 1960). 

The mechanism underlying emission of these 
responses is not completely known, nor have 
the behavioral correlates of differences in fre- 
quency been fully explored. However, some sets 
of data (Burch & Greiner, 1960) suggest the 
hypothesis that rate of emission is some positive 
function of an individual’s state of “arousal” or 
“activation.” 

As a preliminary test of this Proposition, this 
experiment measured the spontaneous activity 
(SA) and basal skin conductance (BSC) levels 
of two groups which differed in their clinically 
assessed level of manifest anxiety. The assump- 
tion was made that group differences in manifest 
anxiety level can be interpreted as indicating 
correlative differences in arousal intensity, 

Ten male inmates of the Juvenile Court 
Detention Center who had received the diagnosis 
Sociopathic Personality Disturbance, Antisocial 
Reaction, formed-one group (SP group), while 
10 comparable subjects who had received the 
diagnosis Personality Pattern Disturbance, In- 
adequate Personality, formed a second group 
(IP group). Psychiatric and psychological evalu- 
ations of the manifest anxiety displayed by 


1An extended report of this study may be ob- 
tained without charge from Robert Fox, Psychology 
Department, Longview State Hospital, Cincinnati 
16, Ohio or for a fee from the American Documen- 
tation Institute. Order Document No. 7508 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress; Washington 25, 
D. C. Remit in advance $1.25 for microfilm or 
$1.25 for photocopies and make checks payable 
to: Chief, Photoduplication Service, Library of 


Congress. 


AND WALTER LIPPERT 


Hamilton County Juvenile Court, Ohio 


these subjects indicated low anxiety levels E 
the SP group and high anxiety levels for ace, 
IP group. Differences between age, 12, cate 
and duration of internment were not significa 
Conductance (bi-palmer) was continuen 
measured via a Fels Dermohmeter and Ea oa 
Angus recorder. Each subject, under instruc ss 
to relax, reclined on a couch located in 4 Mit: 
low ambient noise level room. After aordi 
minute adaptation period, a 10-minute Biri: 
ing was taken. To analyze SA, 20-secon ere 
ments of each subject’s 10-minute recor ding 
examined for decrements in resistance exce? 
an arbitrary criterion; the total number 
segments containing one or more change 
stituted the score. To determine BSC were 
the mean square root conductance values jod. 
taken at each minute of the 10-minute Pen” g 
Results were: (a) the IP group exhibita 7) 
significantly greater frequency of SA ™ (b) a 
than the SP group (M = 8.5), p= 02; 
trend analysis of variance of the BSC 
revealed no significant differences 
groups and no significant trend. Thi 
tion between SA and BSC for the S Eed 
was +.37 (p >.05); for the IP group 
(p < .05). was 
Absence of differences in the BSC dat® ‘pe 
not unexpected, under the conditions OŚ 4 js 
experiment. The significant difference i”. phe- 
consistent with the hypothesis that this pect 
nomenon is positively related to some ® 
of arousal. 
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SOCIAL DESIRABILITY RESPONSE SET IN THE INDIVIDUAL? 


ERLING E. BOE 
Victoria College 


A number of studies have demonstrated high 
Correlations (r’s in the .80s) between average 
Social Desirability (SD) scale values and the 
Probability of item endorsement. However, re- 
Sults based on grouped data may not represent 
the magnitude of the SD response set in the 
Individual subject. The results of Taylor (1959) 
Suggest that the usual practice of grouping data 
Serves to obscure a large amount of individual 
Variance, although, as Taylor (1959) observed, 
this interpretation requires qualification in the 
absence of reliability data and the lengthy time 
interval between self-descriptions and SD ratings 
ìn his study. The problem of the present study 
1S to investigate the magnitude of the SD 
response set (as measured by a point-biserial 
portelation) in the individual without the limi- 
ations of the Taylor study. 
in is distribution of SD scale values of items 
eae Uded in several personality inventories was 
ound to be bimodal. Because heterogeneous 
Samples tend to produce spurious correlation, 
a scale of 100 MMPI items was constructed 
So that their SD scale values were normally 


ssttibuted about a mean representing neutral 


Three groups of college freshmen were formed 
follows: Group I, N=112; Group I, 
self: and Group III, N = 36. Group I gave 
ee scriptions (yes/no) on the items; 1 week 
ith they made SD ratings on the same 
Meek Group II gave two self-descriptions, 1 
apart. Group III made two SD ratings, 
Week apart. Point-biserial correlations between 
e self-descriptions of each individual in 


as 


an An extended report of this study may be ob- 
m ned without charge from Erling E. Boe (Depart- 
cnt of Psychology, Victoria College, Victoria, 
ritish Columbia, Canada) or for a fee from the 
merican Documentation Institute. Order Document 
9. 7509 from ADI Auxiliary Publications Project, 


hotoduplication Service, Library of Congress; 
ashington 25, D. C. Remit in advance $1.25 for 
: hotocopies and make 


Microfilm 
or $1.25 for pI sod as F 
Checks payable oe Chief, Photoduplication Service, 


ibrar i * 
2 ‘Alco oh OnE ae the University of Washington 
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AND 


WILLIAM S. KOGAN 2 


Veterans Administration Hospital, Seattle 


Group I and (a) their own SD ratings and 
(b) mean SD ratings yielded measures of indi- 
vidual SD response set. Reliabilities of indi- 
vidual performance, based on the subjects in 
Groups II and III, were used to compute a 
correction for attenuation. 

Correlations were computed across the 100 
items for each subject and averaged by using 
Fisher’s 2’. For Group I, the uncorrected mean 
Typ between self-descriptions and group SD rat- 
ings was .27, while corrected it was .32; the un- 
corrected mean 7p between self-descriptions and 
individual SD ratings was .40, while corrected 
it was .54; and the uncorrected mean r between 
group SD ratings and individual SD ratings 
was .49. For Group II, the mean ry reliability 
coefficient was .77. For Group III, the mean r 
reliability coefficient was .72. The mean corre- 
lation of self-descriptions with individual SD 
was found to be significantly higher than the 
mean correlation of self-descriptions with group 
SD at the .001 level of confidence. Finally, the 
correlation between the probability of endorse- 
ment and the group SD scale values was .53. 

The results suggest that correlations between 
probability of endorsement and group SD scale 
values do not accurately represent the magnitude 
of correlations between individual self-descrip- 
tions and group SD scale values. However, cor- 
relations between an individual’s self-descriptions 
and his own SD ratings were significantly higher 
than with group SD scale values. Hence, a rele- 
vant consideration in establishing a measure of 
the SD response set in the individual would seem 
to be the basis, individual or group, for deter- 
mining the SD attribute of test items. Due to 
the special nature of the scale used in the 
present study, it would be reasonable to expect 
the magnitude of the SD response set in the 
individual to be somewhat higher for some 
personality inventories in general use. 


REFERENCE 
TAYLOR, J. B. Social desirability and MMPI per- 
formance: The individual case, J. consult, 


Psychol., 1959, 23, 514-517. 
(Received November 1, 1962) 


Journal of Consulting Psychology 
1963, Vol. 27, No. 4, 370 


TAT AND MMPI PSYCHOPATH DEVIANT SCALE DIFFERENCES 
BETWEEN DELINQUENT AND NONDELINQUENT 
ADOLESCENTS? 


ALBERT W. SILVER 
Wayne County Clinic for Child Study, Detroit 


This study sought to determine whether the 
performance of male adolescent delinquents 
selected for having characteristics associated 
with psychopathy differed significantly from 
three control groups on the MMPI Psychopath 
Deviant Scale (Pd) and the TAT. 

Twenty reform school subjects were selected 
on the basis of a history of more than 18 
months of offenses, generally unsatisfactory re- 
lationships, absence of intrapsychic tension such 
as anxiety, guilt, or depression, as well as of 
other neurotic or psychotic symptoms. Prima- 
rily environmental delinquents were excluded, 
This group was matched for age and intelli- 
gence with a group of mild offenders from the 
same institution to control for environmental 
factors, with a group of orphan home residents 
to control for the high incidence of broken 
homes in the psychopathic group, and with a 
group of high school subjects. Six TAT pictures 
and the Pd Scale were administered in small 
groups. 

On the Pd Scale psychopaths had the highest 
number of deviant responses (M = 26), reform 
school control subjects were next (M = 22.4) 
followed by orphan home subjects (M = 22.2), 
and by high school subjects (M = 18.1), All 
possible comparisons between psychopaths and 
the other groups were statistically significant, 

TAT protocols were analyzed by Stein and 
Aron’s (Heymann, 1951) analysis of press and 
need variables. Orphan home and high school 
subjects told longer stories with more needs and 
presses than reform school subjects. Five of 33 


1An extended report of this study may be ob- 
tained without charge from Albert Silver, Wayne 
County Clinic for Child Study, 1025 East Forest 
Avenue, Detroit 7, Michigan, or for a fee from 
the American Documentation Institute. Order Docu- 
ment No. 7510 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C. Remit in advance 
$2.50 for microfilm or $1.75 for photocopies and 
make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 


need variables and 3 of 25 press gi 
discriminated significantly between at = nifi- 
groups. High school subjects expressed a 
cantly more achievement, recognition, and 17 
punitive needs than both reform school yale 
High school subjects also expressed more Ye pen 
tion need and dominance press than any ° 
group. ig- 
Reform school control subjects expressed ings 
nificantly fewer rejection press and sex stu ant 
and more affiliation press to a nonsign a, 
degree than any other group. On the e 
psychopaths emerged as distinct from other ifi- 
form school inmates only in expressing K 
cantly more need sex and press rejection. (Pd 
In conclusion, on an objective test ctive 
Scale), psychopaths emerged as a iste fort 
group even in comparison with other Ne ex- 
school inmates. On the TAT, psychopath pjes 
pressed sex needs more openly but their, : feel- 
were short and achievement strivings, guilt were 
ings, recognition, and retentive strivings 
expressed less often than in the high Sdent 
group. More hostile, destructive, Or depar i 
needs did not occur in the TAT stor cat 
psychopaths. While the high school group $2 im 
to be the most psychologically healthy, n com 


school control subjects seemed to be ed and 
stricted, relying on repressive, withdraw oe 
avoidant defenses for maintaining some chop” 


of social conformity, The concept of PSY whe? 
athy seems to have some validity eve? A de- 
applied to adolescents with relatively Þ”® rce 
linquent histories, The limited fantasy TS? use 
of psychopaths, however, argue against t A res 
of an instrument which calls for fantasy & 

sion in understanding their psychodynamics- 


REFERENCE f 
om 


P si 
Heymann, G. M. A TAT investigation a mas” 
Psychological factors in Acne. Unpublish 
ter’s thesis, University of Wisconsin, 1954- 


(Received November 8, 1962) 


370 


Journal of Consulting P. 
1963, Vol. 27, No. 4, ae 


DEVELOPMENT OF POTENTIALLY THERAPEUTIC INTERPERSONAL 
RELATIONS IN SCHIZOPHRENICS THROUGH COMPETITION + 


ALBERT E. MYERS? 


Educational Testing Service, Princeton, New Jersey 


This study investigated the effects of team 
Competition on the social behavior of schizo- 
Phrenic patients, Earlier, Myers (1962) found 
that interteam competition led to better adjust- 
Ment of team members than did an otherwise 
Comparable noncompetitive condition. This find- 
Mg suggested the possibility of using team com- 
Petition as a therapeutic device in the treatment 
Of people with mental disorders. 
ea trer-tiett chronic schizophrenic male pa- 

ts were selected to participate in a 6-week 
ae The patients were easily grouped into 
ree levels of psychological incapacitation. 
ion subjects were combined into 24 two-man 
ek Each team participated 1 day every 
In a recreational golf program. 

ee first 4 weeks were devoted to practice. 
Be teta were combined to form a foursome 
Š mee, ayed an average of six holes per day in 
rine ee fashion. The teams were then 
cot omly assigned to competitive and non- 

a Nea conditions for the final 2 weeks. 
whil al interactions of the patients were recorded 

Ue they played. 
reaa blishing a criterion in this kind of 
me en by no means, a simple task. The 
a: Hospital Adjustment Scale” was use- 

> Interview criterion data proved fruitless; 
1 
fine ended report of this study may be ob- 
cational 5 ou charge from Albert E. Myers, Edu- 
or a fee Fa ing Service, Princeton, New Jersey, or 
tute, On rom the American Documentation Insti- 
iliary Publi Document No. 7511 from ADI Aux- 
Library oe ae Project, Photoduplication Service, 
i adraia Looe Washington 25, D. C. Remit 
Copies M 23 for microfilm or $1.25 for photo- 
duplicati make checks payable to: Chief, Photo- 

2 Ca, lon Service, Library of Congress. 

Effectiv a out with the assistance of the Group 
tincipal Te Research Laboratory (Erea D 
-2060 vestigator) under contrac = 5 
ment of of the Office of Surgeon General, Depart. 
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paper-and-pencil measures were out of the 


question. 

Consequently, a whole new approach to 
criterion assessment was called for. The method 
which was chosen was a unique one and, hence, 
represented somewhat of an experimental 
gamble. It seems that this approach perhaps 
deserves serious consideration by researchers in 
this area. 

The basic assumption was made that patients 
with a relatively less severe degree of in- 
capacitation enjoyed better interpersonal rela- 
tions. That is, there should be a strong associa- 
tion between the level of pathology of a patient 
and his ability to function adequately in a 
group. A positive change in interpersonal ade- 
quacy or adjustment is, then, operationally de- 
fined as a team behaving more similarly to the 
better adjusted patients at the end of the 
experience than it did at the beginning. 

With these assumptions, multiple correlations 
were run with adjustment ratings as the cri- 
terion. This provided weights for the best pre- 
dictors so as to permit the compilation of a 
single score. This score represented the par- 
ticular weighting of those behaviors which were 
maximally effective in discriminating across the 
range of pathology. A spontaneous change in 
the appropriate behaviors was, once again, 
assumed to index a change in the level of inter- 
personal adequacy. The results suggested that 
competition facilitated adjustment among the 
better patients but that it had an unfavorable 
effect among the poorer patients. 

Our apparent success with verbal behavior 
suggests that certain inhibitions toward using 
this type of data with schizophrenics is unwar- 
ranted. 
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KENT E-G-Y: 
DIFFERENTIAL SCORING AND CORRELATION WITH THE WAIS? 


GERALD L. CLORE, Jr.2 


University of Texas 


The Kent E-G-Y Test has been widely used 
to estimate the level of intellectual functioning 
of hospital patients. The test consists of only 
10 items, and the author of the test gives the 
individual examiner a wide latitude in the inter- 
pretation and scoring of the items. The present 
study was carried out to determine the effect of 
different degrees of strictness in scoring the 
Kent on final Kent scores and their relation to 
WAIS scores. 

From a general medical and surgical Veterans 
Hospital population a representative sample of 
30 psychological referrals was tested. Two 
scorings of the Kent were made. An easy score 
was obtained by interpreting questions liberally 
and by crediting ambiguous answers, highest 
order abstractions, and correct answers that were 
spoiled by incorrect additional answers. A 
strict score resulted from interpreting questions 
strictly and from crediting only clearly correct 
answers, lowest order abstractions, and only 
unspoiled correct answers. 


1An extended report of this study may be ob- 
tained without charge from Gerald L. Clore, Jr. 
(Department of Psychology, University of Texas, 
Austin 12, Texas) or for a fee from the American 
Documentation Institute. Order Document No. 
7512 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; 
Washington 25, D. C. Remit in advance $1.25 for 
microfilm or $1.25 for photocopies and make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. » 

2 This study was conducted at the Veterans Ad- 
ministration Hospital at Dallas, Texas. The author 
is grateful for helpful suggestions from As, J, 
Jernigan, Chief Psychologist. å 


The mean score of the Kents which were 


scored easily was 26.6 and the mean score a 
the strictly scored Kents was 24.1. A £ aa 
for difference between correlated means yielde 
a ¢ value of 6.63 which was significant at en 
-001 level with 29 df. Pearson product-me is 
correlations between easy Kent scores and V Per- 
scores were: Full Scale .68, Verbal .69, nt 
formance .61. Correlations between strict Ke 
scores and the WAIS were: Full Scale - a 
Verbal .69, Performance .63. All six coeficiep 
were statistically significant at the .01 eae 
determine if the three easy Kent corre 
differed from the three strict Kent correlator” 
a test for differences between correlations 
the same sample was used. There were nO 
nificant differences between the correlations: tly 
Since the mean Kent scores were significan 

different, but the correlations of easily were 
strictly scored Kents with WAIS scales “ to 
not significantly different, it is importa that 
Score the Kent using the same procedure ular 
was used when the norms for the parle 
population tested were obtained. Howeve?» the 
ferent scoring procedures do not alec” ions 
Kent’s validity. The size of the correla ato 
indicate that the Kent is a useful est ihe 
of WAIS scores. It should be noted the als 
Kent has no correction for age, while "nis 
IQs are corrected for age differences. work 
factor would, however, be expecte to ighet 
against a high correlation, so an even | pte 
correlation might be expected if WAIS vak 
scores were used rather than IQs. 
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EXPERIMENTAL PREDICTION OF THE LONG-TERM EFFECT 
OF PSYCHIATRIC THERAPY * 


GUDMUND J. W. SMITH asn GUNNAR JOHNSON 
Psychological Laboratory, Lund, Sweden 


pe eriou investigations (see Smith & John- 
i 962) the present authors have described 
Gone Serial (microgenetic) tachistoscopic tech- 
ae utilizing pairs of stimuli (one being in- 
eal with or threatening to the other) was 
ara for the study of the effects of psy- 
tied therapy. Reactions to this stimulation 
ae in three main categories of pre- 
ice (defensive) organization: isolation- 
main n, repression, and projection. A fourth 
of e A referred to a general syndrome 
mee as revealed in the perceptual 
ing dein fifth to signs of discontinuity signify- 
version urbances of a psychotic order. Parallel 
a a ae used before and after therapy. 
Were in ts implied that isolation and repression 
eae affected by therapy whereas many 
ten ae With depression or slight projective 
Were “a seemed to recover. Psychotic patients 
eco en registered as improved but seldom 
ob: ieee The present authors also wanted 
Scorn ore, 2 years later, whether the original 
ear, categories could predict the subsequent 
tes of the patients. 
Wo € original 48 patients could be divided into 
Proven aaps. In one group (U) either no im- 
ey fle: was reported for the subjects, or 
Btoup e within half a year. The other 
as o m) consisted of patients described 
Dlaints ae and not reporting any serious com- 
= ited the first 6 months, most of them 
Hig in 2 years. This differentiation resulted 
istinguish ve of 24 patients each. We only 
and a th here between presence of a sign, 
Proved ave Since isolation and repression had 
If the © be quite stable, they were scored even 
Th 4 appeared only in one of the experiments. 
Jection case of the more unstable signs of pro- 
thera and depression, results obtained before 
ae Were used for making predictions. 

to aes 19 cases with isolation 13 belonged 
DA P Im. The result certainly does not mean 
Posite extended report of this study has been de- 


teq 
Orde, „Vith the American Documentation Institute. 


Pup Scument No. 7513 from ADI Auxiliary 


tied: 
brary Bane Project, Photoduplication Service, Li- 
aWance Ongress, Washington 25, D. C. Remit in 
Die, 5 for microfilm or $1.25 for photo- 


s 
Publicatia make checks payable to: Chief, Photo- 
n Service, Libtary of Congress. 


that their compulsive symptoms disappeared or 
even improved substantially. But the compulsive 
character is probably likely to accept his status 
after therapy as a conclusive sign of how much 
he is able to improve. The effects of ego sup- 
port in a hysteroid person, on the contrary, 
are often known to be no more lasting than 
traces in sand. An exception was made for 
patients with simultaneous signs of isolation- 
negation. Thus, 8 of 9 patients with repres- 
sion but no isolation belonged to Group U, 
but only 1 of 6 patients with isolation 
plus repression. In the case of projection we 
picked out one category previously marked as 
grave and predicted that the probability of a 
stable improvement would be lower than in 
other subjects with projection. None of the 
mild cases showed simultaneous signs of repres- 
sion. Group Im, then, included 11 of 16 mild 
cases but only 1 of 8 severe ones. If signs of 
depression before therapy appeared together 
with signs of psychotic discontinuity, the prob- 
ability of a cyclic recurrence of the symptoms 
would be quite high. One category of 11 pa- 
tients with depressive signs plus signs of dis- 
continuity (6 patients) or repression (5 pa- 
tients) but no signs of isolation was compared 
with another category of the 16 remaining cases. 
Among 13 individuals in Im, 12 belonged to the 
latter category. 

An overall prediction for Group U, those who 
relapse easily or do not even improve in the 
clinic, would utilize the presence of severe signs 
of depression and projection, signs of repression 
but absence of signs of isolation. The rest of 
the patients would be more likely to belong to 
Group Im. There were 17 patients with nega- 
tive signs in Group U and 2 in Group Im 
(2 = 19.60, p < .001). If severe signs of pro- 
jection or depression are allowed to dominate 
simultaneous signs of isolation the numbers are 


but little affected. 
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ERRATA 


In the article by Edmund S. Howe and Benjamin Pope, “Therapist Verbal ae 
Level and Diagnostic Utility of Patient Verbal Responses” (J. consult. Psyc a 
1962, 26, 149-155), two corrections are required in the right column of page i z 
In line 5 the word “positive” should read “negative”; in line 6 the word “negatively 
should read “positively.” 


In the article by Irving Steingart, “Conditions of Personality Organization Related 
to Empathic Ability Among Normal and Schizophrenic Adults” (J. consult. Psych ihe 
1962, 26, 416-421), the second half of the sentence on page 418 beginning with aie 
words “The schizophrenic ‘high empathizer’ group” should read: “the schizophre! 


o 
‘low empathy’ group consisted of eight subjects who obtained empathy scores 
0-4 (the lower 17.4% ).” 


In the article by John E. Exner, Jr., Eugene McDowell, Joan Pabst, bile 
Stackman, and Lynn Kirk, “On the Detection of Willful Falsification in the MM oie 
(J. consult. Psychol., 1963, 27, 91-94), the statement, “Gynther (1961) has demon 
strated that a high F score is found in almost all cases of behavioral disorders swiss jes 
should have read as follows: “Gynther (1961) found that in a group of white ma a 
referred by the court for psychiatric examination, almost all the high F scores 3 
tained were given by cases of behavioral disorders.” 
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Brief Reports 


The Journal of Consulting Psychology will 
accept Brief Reports of research studies in 
clinical psychology for early publication with- 
out expense to the author. The procedure is 
intended to permit the publication of soundly 
designed studies of specialized interest or lim- 
ited importance which cannot now be ac- 
cepted because of lack of space. Several pages 
in each issue will be devoted to Brief Reports, 
published in the order of their receipt with- 
out respect to the dates of receipt of the regu- 
lar articles. Most Brief Reports appear in the 
first or second issue to go to press following 
their final acceptance. 


An author who wishes to submit a Brief 
Report: 


1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the American 
Documentation Institute to insure indefinite avail- 
ability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 


4, Agrees not to submit the full report to another 
journal of general circulation. 
Specifications 


Brief Report. The Brief Report should give 
a clear, condensed summary of the procedure 
of the study and as full an account of the re- 
sults as space permits. 

To insure that the Brief Report will be no 
longer than one printed page, its typescript, 
including all matter except the title and the 


; ‘nes av- 
author’s lines, must not exceed 85 lin 


A b. 
eraging 42 characters and spaces Ee 
Set the typewriter margins for short it a 
42 characters, which are 3.5 inches ica 
elite typing, and 4.2 inches long 9 ae must 

The manuscript of the Brief ot for its 
be double spaced throughout. Except 1 of 


Manual. Headings, tables, and reference in 
avoided or, if essential, must b t be 
the 85 lines. Each Brief Report = pelo 
companied by a footnote in the st¥ ig no! 


os an 
which is typed on a separate sheet 
counted in the 85-line quota: pe ob- 

1An extended report of this study ivin ia 
tained without charge from John ee a fe ie 
author’s full name and address) oF fo ord Doe 
the American Documentation Institute. 0 


ment No. from ADI Auxiliary A of CF 
Project, Photoduplication Service, Librevance Sigs 
gress; Washington 25, D. C. Remit in 2 an Bi, 
for microfilm or $—— for photocopies, , Serv’ 
checks payable to: Chief, Photoduplica 


Library of Congress. re 


not copy to be sent to a piine, that s 
should differ in several ways f° de repor 
other manuscripts: (a) The exten% og f 


should be typed with single SP°7nq fË; 
economy in duplication. (4) Tables the w 
ures should be placed adjacent to pou! 
which refers to them. A caption ‘ei p 
typed below each figure. (c) Foot? on wr 
be typed at the bottom of the seat respeti, 
reference is made to them. In othe 
the full report is prepared in the s 
fied by the Publication Manual. 
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A STUDY OF CERTAIN SCHIZOPHRENIC DIMENSIONS 
AND THEIR RELATIONSHIP TO DOUBLE 
ALTERNATION LEARNING 


WALTER J. JOHANNSEN, SAMUEL H. FRIEDMAN, 
THOMAS H. LEITSCHUH, ann HELEN AMMONS 1 


Veterans Administration Center, Wood, Wisconsin? 
and Marquette University School of Medicine 


In order to clarify relationships among four dimensions thought to underlie 
schizophrenia, 52 patients were rated on a scale of premorbid personality and 

on a scale measuring the process-reactive dimension. They were further cate- 

gorized with respect to chronicity and the presence of paranoia. Results indi- m 
cated that the process-reactive, good-poor premorbid personality, and acute- 

chronic dimensions are essentially similar; and that the paranoid-nonparanoid 
dimension is independent of the others. Testing on a double alternation 

learning task demonstrated differences between paranoids and nonparanoids, 

but not between the poles of the other dimensions. 


i An recent years, several researchers have 
Vocated a change in the present nosology 
Ki schizophrenia, Suggestions have included 
anal ont based systems derived from factor 
a (Guertin, 1952; Lorr, McNair, 
» & Lasky, 1962; Wittenborn, 1951), 
R: E of these methods has found wide 
which here is another empirical approach, in 
explici assumptions have never been made 
consig but which has nonetheless generated 
erable research. This method consists 
whig ating classificatory dimensions along 
Ea ral, hizophrenics can be placed and 
to É e ating the scores on these dimensions 
Do fa correlates. The utility of the 
. mension then depends on the range 
it, o MPortance of behavior correlated with 
i fant these dimensions are firmly grounded 
ia € observable events, etiological and 
Ostic correlates can be elicited. 
t ha search has already demonstrated that 
Meng ensions derived from the older no- 
One pe have genuine descriptive value. 
Se ae these is the paranoid-nonparanoid 
phrenic dimension. Raters using inter- 


] 


l 

Mi hannsen is now at, the Curative Workshop, 

O Wis *Ukee, Wisconsin; Friedman is at Milwaukee, 

\ Adame sin; and Helen. Ammons is at the Veterans 
A nistration Hospital Palo Alto, California. 

Sisteg atitude is due to Douglas Holmes, who as- 

| Breh in the running of the subjects, and to Shirley 


data m, Who performed the statistical analysis of the 


view data can reliably differentiate these two 
groups even when other categorizations are 
difficult. Paranoids have been differentiated 
from nonparanoids on a variety of psycho- 
logical functions and abilities: tapping speed 
(Shakow & Huston, 1936); steadiness 
(Shakow & Huston, 1946); perception of 
Lissajou figures (Philip, 1953); Rorschach 
genetic level (Siegel, 1953); degree of sug- 
gestibility (Williams, 1932); conditioning 
(O'Connor & Rawnsley, 1959); response to 
incentives (Topping & O’Connor, 1960); 
alternation learning under social feedback 
conditions (Johannsen, 1961); degree of in- 
telligence test decrement (Mason, 1956); and 
perception of self (Havener & Izard, 1962). 
Goldstein and Carr (1956) suggest that dif- 
ferences in parental child rearing attitudes 
differentiate the groups as well. 

A similar picture emerges when schizo- 
phrenics are divided into acute (in the sense 
of recent onset) and chronic groups. Stable 
differences have been found between them on: 
reaction time (Huston & Senf, 1952); gestalt 
perception (Cohen, Senf, & Huston, 1956); 
response to praise and censure (Long, 1961); 
intelligence test performance (Feinberg & 
Garman, 1961); vocabulary (Rabin, King, & 
Ehrman, 1955); and appreciation of humor 
(Senf, Huston, & Cohen, 1956). 

During the last decade considerable quanti- 
tative work has developed about two other 
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dimensions: process-reactive and good-poor 
premorbid personality. The history of the 
former category extends back to Bleuler 
(1950), but research in the area was given 
impetus by the study of Kantor, Wallner, 
and Winder (1953) which systematized the 
elements that defined the dimension. In es- 
sence, reactive schizophrenia differs from 
process schizophrenia in being of more sud- 
den onset and involving more adequate inter- 
personal relationships, normality of affect, 
presence of precipitating cause and a better 
response to treatment. The literature reveals 
that process patients could be distinguished 
from reactive patients on: physiological re- 
sponse to mecholyl (King, 1958); flicker 
fusion threshold (McDonough, 1960); re- 
sponse to incentives (Reisman, 1960); Ror- 
schach genetic level (Fine & Zimet, 1959); 
Rorschach organic signs (Brackbill & Fine, 
1956); and performance on Benjamin 
Proverbs (Becker, 1956). 

A fourth posited dimension, good-poor 
premorbid personality, is defined in terms of 
a scale developed by Phillips (1953), which 
deals with the sex life and interpersonal rela- 
tionships of the patient. Due primarily to the 
interest engendered in this scale by research- 
ers at Duke University, an extensive body 
of literature has developed. Differences be- 
tween the two groups have been established 
in many areas: size estimation (Harris, 
1957); perception of pictures with inter- 
personal themes (Zahn, 1959); retention of 
hostile themes (Mallet, 1956); and remi- 
niscence (Bleke, 1955). Regarding etiology, 
differences have been found in parental child 
rearing attitudes (Garmezy, Clarke, & Stock- 
man, 1961); patterns of parental dominance 
(Baxter & Becker, 1962); and rural versus 
urban residence (Query, 1961). Finally, 
Farina and Webb (1956) have demonstrated 
prognostic implications for Phillips Scale 
scores. 

Examining the range of behavior touched 
upon, one notices that differences occur on 
similar tasks for different categorizations. 
Perceptual tests differentiate acutes and 
chronics, process and reactive, paranoids and 
nonparanoids, good and poor premorbids. 
Are these dimensions independent or are we 
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referring to the same underlying dimension 
in every case? Is a paranoid soem s 
one who has a good premorbid personality’ 
Is an acute schizophrenic always zenci 
Becker (1959), referring to earlier esar 
states, “These studies have led to a number 
of terms, all describing the same distinction: 


p Ë ess- 

malignant-benign, chronic-acute, ia 

reactive [p. 442].” Moreover, recent wo a 
1961; 


(Chapman, Day, & Burstein, 


refer to the Phillips Pre 


Wolfe, 1962) ess-reactive 


morbid Scale as a measure of proc 
schizophrenia. 

However, the issue is not so clear-cut Oy 
item content of the Phillips - win er 
(1953) alternatives. Those who clair 
acute schizophrenia is equivalent to ee 
schizophrenia ignore the facts that the por’ 
chronic dimension refers to @ cen 
sequence, while the process-reactive dme em 
is structural, and that many long-terns with 
tients begin their schizophrenic are Taote 
a reactive-like break with reality. Mo pizo- 
one might predict that a paranoid À act”) 
phrenic (usually considered “more ‘aed a 
would be more likely to be class natives 
reactive; yet one of the scale altemesenc® 
defining process schizophrenia 1s the P 
of massive paranoia. , 

The major purpose of this 
examine the relationships among 
dimensions and to determine wh 
jects classified as members of one 
within a dimension (e.g., reactive) 5 
be classified as members of an anaha 
gory within another dimension oi pur 
premorbid personality). Its secon’ dit 5 
pose is to extend the construct a jo 
these schemes by relating thes¢ 
to a learning task which is believe" avi 


+. tO 
1S 
study | four 


ether A 


wi 


R jo? 
relevance to an important aspect nfor as al 
namely, the ability to utilize se exter 
about performance derived from 
human source. 

METHOD Pr 
Subjects ii oie y 
P u jé 

The sample consisted of 52 Conse ar Dh 


vi 
i issi otic f 
phrenic admissions to the psy¢ Center pays 


Wood Veterans Administration wari 


service. Initial screening by the 


N 
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TABLE 1 


ACTUARIAL CHARACTERISTICS OF THE SAMPLE 


Time 
Total since 
hospital onset Education * Vocabulary 
Group (mos.) (yrs.) (yrs.) 1Q score Age N 
Total: 
M m 
D 21.33 7.79 11.20 105.65 28.28 (9) 36.96 52 
32.77 6.24 2.86 18.76 6.31 7.17 
Acute: 
M n " 
SD 2.30 2.70 12.22 110.00(2) 27.87 (2) 35.40 10 
2.05 5.10 2.39 14.80 $13 6.07 - 
Chronic H 
M ai 
SD 25.74 9.00 10.98 104.66(7) 28.37 (7) 37.61 42 
34.97 5.86 2.91 19.41 6.55 7.28 
Paranoid A 
M 
SD 23.14 9.00 11.32 113.13 (6) 30.53 (6) 38.75 21 
30.20 6.82 2.96 18.84 6.54 6.44 
Nonparanoid: 
M 
SD 19.93 6.97 11.13 101.64(3) 27.07 (3) 36.16 31 
34.33 5.66 2.80 17.44 5.84 7.34 
Pr cess 
m 
SD 32.08 10.65 10.80 106,22 (3) 29.56(3) 38.88 26 
40.58 6.16 3.18 19.29 6.76 7.38 
Reactive 
M ee 
SD 10.38 4.92 11.60 105.00(6) 26.80(6) 35.54 26 
16.29 4.84 2.50 18.11 5.39 6.44 
Goog 
M 
SD 21.23 6.18 10.86 100.76(5) 25.53(5) 36.90 22 
43.83 5.95 2.82 16.97 4.78 6.97 
Poor: 
M 
SD 21.23 8.97 11.45 108.85 (4) 30.08 (4) 37.37 30 
21.27 6.18 2.87 19.18 6.54 n21 
A total of nine subjects were dis- 


mber of subjects missing from that cell. 
nie ratings were obtained; three did not perform 
Ily identical to the larger sample. 


ont Sten, 
the double alternation task. 


> N i 
Ay Bed beforectals in parentheses above refer t 
hen Completion of intelligence testing, 5 
acteristics of the reduced samples were virtual 


imi 
h; n i s ; 
SUbje sie individuals unsuitable for the study. All would be comprised of relatively early saiko- 
rear Were between the ages of 20 and 55, were phrenics, as compared to the bulk of Veterans Ad- 
Vere win OMPlicating neurological conditions, and ministration psychotics, although they would appear 
Detign thheld from somatic therapy until the com- moderately chronic from the viewpoint of time since 
"isticg S testing. The nerinent actuarial charac- the initial diagnosis of schizophrenia. 

OF the t i 1. 
this aluation otal sample are presented in Table Procedur 
typ PPulati of the clinical data indicated that . j dads 
Re, ltinig lon is characterized by frequent relapses, The paranoid-nonparanoid an acute-cl 'hronic di- 
to dogs ,SPitalizations, and comparatively brief mensions were evaluated from psychiatric material, 
inpatient an F the viewpoint of the process-reactive and good-poor premorbid per- 
Eee pao i jons from interview ratings, 


tay len i 
SE gë hospitalization, the overall sample sonality dimensi 
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Paranoid-nonparanoid. Diagnosis established by 
the ward staff on admission was accepted as the 
criterion of the paranoid-nonparanoid dimension. 
Since it appears that this discrimination can be 
made reliably (e.g., Orgel, 1957), a more refined 
method of categorization was deemed unnecessary. 
Three of the four raters who participated in the 
study were unaware of the subject’s diagnosis at 
the time of rating. The fourth, a psychiatrist, 
participated in the admission staff meeting where 
diagnosis was established. Any bias resulting from 
the knowledge of the diagnosis was deemed minimal, 
since agreement on the Phillips Scale ratings between 
him and his fellow rater (who was unaware of the 
subject’s diagnosis) was excellent. 

Acute-chronic. Subjects were assigned to the acute 
group if total hospitalization was less than 6 months 
and the earliest schizophrenic diagnosis occurred 
within the previous year. One subject had been diag- 
nosed schizophrenic 18 years before, but had since 
made a good extramural adjustment, and had re- 
mained out of the hospital during that period. Since 
he bore a current acute schizophrenic diagnosis, 
he was included in the acute group for purposes of 
this study. All other subjects, a total of 42, were 
considered chronic, 

Process-reactive. The 24 items of the Kantor, 
Wallner, and Winder (1953) alternatives were ar- 
ranged in two columns, one referring to the process, 
the other to the reactive dimension. The rater 
checked the alternatives that best described the 
subject. After all items were checked, he then rated 
the subject on a 5-point bipolar scale which ranged 
from “classic process picture” to “classic reactive 
picture,” with a midpoint classification for indi- 
viduals who showed an equal mixture of process 
and reactive characteristics. The rating was to be a 
clinical judgment, in which the clinician was not 
necessarily bound by the number of characteristics 
checked in one or the other column. This expedient 
was deemed preferable to an arithmetic total because 
it could not be assumed that alternatives were of 
equal discriminating power. Where disagreements 
between raters existed, discrepancies were resolved 
by totaling the number of process attributes checked 
by the raters, comparing this total with the number 
of reactive attributes checked, and assigning the 
subject to the category in which the majority of 
checks appeared. Justification for this procedure lies 
in the fact that in only 7 of 104 protocols was there 
a discrepancy between the column with the majority 
of check marks and the final classification assigned. 

Good-poor premorbid personality. Section 1 of 
the Phillips Scale was used as a measure of pre- 
morbid personality, following the usage of the Duke 
group (Rodnick & Garmezy, 1957). Several areas 
of premorbid adjustment were scored on separate 
7-point scales: recent social adjustment, social aspects 
of sexual adjustment during adolescence and im- 
mediately beyond, sexual adjustment above and 
below age 30, and history of personal relations. 
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Median scores were obtained, and the subject y 
placed in the “poor premorbid” category if E 
median rating fell above the scale midpoint, an 
in the “good premorbid” category if below. à 
Three psychologists and one psychiatrist sent 
as raters. Two psychologists rated subjeces T aad 
process-reactive scale; the other psychologis ae 
the psychiatrist rated them on the Phillips ee 
The raters were all experienced, having 2 M ital 
of 5 years work with patients in mental pee 
settings. Ratings were done independently, s 0 
and after a brief, semi-structured init 
ducted by one of the psychologists and eemi 
by the other raters. Interview ratings were 
preferable to the more common method © 
clinical history data, since the majority © 
folders were incomplete. To avoid the 
of raters basing their judgments merely 
effect” derived from the interviewer $ ly con- 
the questions asked were brief, deliberately . 
fined to the content of the two scales, and ny 0 
to elicit categorical answers (eg. “Have in 
your brothers or sisters ever been a patien 
hospital like this? Do you get into fights 
Recently? How old were you when you 
first sexual experience?”). i e 
Each subject was again seen, usually a ee he 
week following the interview, and adminis ure that 
Shipley Hartford Intelligence Scale to elligence 
groups were adequately equated on the z a 
variable. Soon afterwards, they were tes sisewber® 
double alternation task, described in full c seated 
(Johannsen, 1959). In brief, subjects We" wooden 
at a desk with an unpainted 6X 10 inc® doorbell 
panel placed flat on its surface. Tw poard, 
buttons were mounted on this response er. 
inches to the right and to the left of nena angle 
experimenter sat beside the desk at 4 wire 


to the subject. The response buttons w ted 
in a circuit to two 7.5-watt light bulbs neat 
on a panel placed inconspicuously On pe ignal to 
the experimenter. These lights served tO oie had 
the experimenter which of the tw0 but exper 
been pressed. By focusing on the lights: "ign ° 

the ™ re 


menter was prevented from seeing iving 15 
the subject’s hands and inadvertently K subject 
mature information to him. All of t 


e 
responses were recorded. itive oubli 
The task was the learning of a repeti ni 


alternation pattern (RRLLRRLL). eat. subj i 
trials were given, each of which required 5 putto”; 
to respond by pressing one of the tW 

After each of the subject’s responses 
menter replied “right” or “wrong” in 2 
of voice. Adequate performance OR, 
quires an ability to codify, assimilate oe. A 
information from an external human 5° ck 18 ett 
trial initial series was run without feed e ab be 
to control for response biases and tO A PIL? 
to comprehend instructions. No subjec 
disqualified for confusion. 


SCHIZOPHRENIC DIMENSIONS AND ALTERNATION LEARNING 


RESULTS 
Relationshi 
ationship Among Dimensions 


oe Aaron how reliably raters could 
Bo0d-pvor jects on the process-reactive and 
Yo: raters premorbid personality scales, the 
Betcentare on each form were compared for 
the ps Sh fags On the Phillips Scale 
agreed ig Ogist and psychiatrist raters 
(good ve the dichotomous classification 
threefold i poor) in every instance. A 
reliabilit classification was used to evaluate 
ess, Bei in the process-reactive scale (proc- 
egree of ma and intermediate). Here the 
ratings m agreement between the independent 
analysis a rig sufficient to justify further 
intermediat the data. In data analysis the 
ubjects fall category was eliminated. All 
© either ing into this group were assigned 
lowing th process or reactive categories fol- 
o pte procedure mentioned previously. 
among in uate the degree of relationship 
tables yy e four dimensions studied, 2 X 2 
Possible ie constructed for each of the six 
'S not ‘sar ee of dimensions. Since it 
Sent nor wn whether these dimensions repre- 
ther. ot mally distributed continua (e.g., that 
S a gradation of “processness” from a 
devoig an picture to a reactive picture 
horie , process elements), both the tetra- 
tio i a which presupposes such a distribu- 
Wete co the phi coefficient, which does not, 
ate ee olted in each instance. The results 

Tt jg vtized in Table 2. 
lihis qa, that certain significant rela- 
nd Proce © exist between the acute-chronic 
Rot “ao dimensions, the acute- 
Te Poor good-poor dimensions, and the 
tq Ese relati x process-reactive dimensions. 
ati lonships are affirmed by both cor- 
methods employed. It thus appears 


1 
Classic 


tha t Onal 


es schizophrenic will be classified 
Mago? t ‘i nd as having a good premorbid 
Drei ii a chronic schizophrenic will be 
Morpjg > Process and as having a poor 

l Personality, that good premorbids 
Orbi : classified as reactive and poor 
ene Oid di as process. The paranoid-non- 
telapi the mension appears to be independ- 
'ONshi Other three. The high degree of 

P between the good-poor and 


Usual 
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TABLE 2 


CORRELATIONS BETWEEN Various HYPOTHETICAL 
DIMENSIONS OF SCHIZOPHRENIA 


Comparison Te ¢ max*® 

Acute-chronic versus 18 .09 59 
paranoid-nonparanoid 

Process-reactive versus .06 04 82 
paranoid-nonparanoid 

Good-poor versus AS .09 -96 
paranoid-nonparanoid 

Acute-chronic versus 71* .39* 49 
process-reactive 

Acute-chronic versus aa Rh 57 
good-poor 

.62* 39* 86 


Good-poor versus 
process-reactive 


* p <.01. 
s Values refer to maximum phi possible with the cell frequen- 


cies involved. 


process-reactive dimension can be accepted 
with confidence. However, the small num- 
ber of acute subjects renders any conclusion 
involving them tentative, and must be ac- 
cepted with caution. 

An attempt was made by item analysis to 
determine which items on the process-reactive 
scale served as cues for raters’ judgments. 
Chi squares were computed for each item 
relating them to the final categorization. 
Eight of the 24 items discriminated at better 
than the .05 level of confidence for both 
raters. Two additional items discriminated for 
one rater though not for the other. It would 
appear that the same set of cues was used in 
making the final clinical judgment, and it is 
probable that the scale in the form used in 
this study can be refined and abbreviated.* 


Relationship of Dimensions to Alternation 


Learning 

Prior to analyzing the differential perform- 
ance of the various groups on the double 
alternation task, it was necessary to determine 
to what extent extraneous variables played a 
part in the results obtained. Examination of 
Table 1 indicated that groups were closely 
equated on most of the variables likely to 
affect performance. Where two groups showed 


3 Copies of the abbr 
from the senior author, 


eviated form may be obtained 
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more than marginal differences, e.g., para- 
noids versus nonparanoids in intelligence, 
analysis of variance was used to assess the 
significance of these differences. Only length 
of hospitalization and time of onset showed 
significant differences between diagnostic 
groups. The effect of these variables was then 
evaluated by correlating them with scores ob- 
tained on the double alternation task. No 
correlation achieved significance. Drug ef- 
fects were not studied, but evidence obtained 
earlier (Johannsen, 1961) indicated that 
medication has little effect on this task. No 
examiner differences appeared, and the sub- 
jects’ preliminary response sets were un- 
related to subsequent task performance. We 
may therefore conclude that the effect of these 
variables on any obtained group differences 
is minimal. 

The basic analysis was performed on the 
number of correct responses made during the 
200 trials administered. Four analyses of 
variance were performed comparing acutes 
versus chronics, paranoids versus nonpara- 
noids, process versus reactive and good versus 
poor premorbid personality. Only the F test 
for the paranoid-nonparanoid classification 
was significant (F = 5.18, 1/47 df, p < .05), 
with paranoids performing more adequately 
than nonparanoids. The mean number of 
correct responses was 153.00 for paranoids, 
126.07 for nonparanoids. This result is in 
accord with that obtained by Johannsen 
(1961). In the earlier study, Johannsen noted 
a strong relationship between perseverations 
and number of errors, where perseverations 
were defined as single or RL alternations. 
Here, however, no diagnostic group showed 
a significant preference for these maladaptive 
patterns. 


Discussion 


Our results indicate a high degree of rela- 
tionship among several dimensions: acute- 
chronic, process-reactive, and good-poor pre- 
morbid personality. It is, of course, possible 
that these correlated dimensions have suf- 
ficient residual variance to predict differen- 
tially various performance or historical cri- 
teria. The correlations obtained do not 


account for more than half the variance in 
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any instance. A conservative view could jus- 
tify maintaining these dimensions as distinct 
until further research with a variety of 
criteria shows that they can be subsumed 
under a single heading. 

Nonetheless, it seems that at least ae 
good-poor versus process-reactive relation 
ship is functionally meaningful. A 

The Phillips Scale and the Process-Reactivé 
Scale are highly interrelated, and appear 
measure an underlying dimension, va 
might be labeled “general interpersonal a “ 
quacy.” Because of its higher interrater m 
ability, the Phillips Scale is perhaps a 
preferable research instrument. However, t 
abbreviated Process-Reactive Scale a 
to above may be easier to rate when is 
patient is guarded or when evaluation i 
made from incomplete file data. It is 1 ae 
esting that the two raters used muc dis 
same cues in making a process-reactlve © 


aaa . items whic 
tinction. The failure of some ith one 


the relative homogeneity of 
Moreover, several items appeared me 
for this sample. Not one of our 
showed a “Loss of decency (nudity, 
masturbation, etc.).” 

The high relationship between th 
chronic dimension on the one hand, 4” ef 
process-reactive and good-poor prem ae 
personality on the other, must be an a 
in light of the small number of eer 
tients involved. Moreover, one must qui al Jí 
whether this correlation is meaning prog 
process-reactivity is defined in terms aii i 
nosis, then it is obvious that an on 
who has been hospitalized for sever cove”) 
is less likely to make a satisfactory Top” 
than one who has just entered the 
for the first time. ti the 

Of considerable theoretical interes’ ymer 
fact that the paranoid-nonparano’ 
sion is apparently independent of th 
personal adequacy” dimension taPP ve 
Phillips Scale and the Process-Reac® +. 
Several pieces of evidence attest to sions ó 
clusion: the failure of these dimeer” 
intercorrelate; the significance of 


aningless 


subject 
public 


e acute 
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aay paranoids and nonparanoids on the 
ouble alternation task, coupled with the 
ge performance of groups when clas- 
oe de ways; and the failure of the 
discrimi ie Rig paranoia” item to 
silbconit, e etween process and reactive 
ban we renics. The results of the alternation 
study i confirm the earlier Johannsen (1961) 
might or paranoids versus nonparanoids. One 
i caw speculate that the pertinent variable 
individ attention or vigilance. Paranoid 
ae 53 s are more attentive to the environ- 
od an nonparanoids. Whatever differ- 
they ex between process and reactives, 
ea oe are not related to that par- 
ie al interpersonal function important to 
feedb on alternation learning under social 
ack conditions. 
ion tae chronicity is related to alterna- 
Srn is not established, since the 
abilit „group does not show sufficient vari- 
t ig a in this respect to settle the question. 
leva) re ge however, that the general 
= 135.98 accuracy (mean number correct 
itt the ) is far in excess of that obtained 
Correct _ Johannsen study (mean number 
Patiente ee A which used a group of 
total ho nat differed only in the length of 
spitalization. 
rng vs of interview material rather than 
ack a Ories (a procedure dictated by the 
Sane recorded data) represents a 
OUS resea PH method from the bulk of previ- 
east on = he The high reliabilty obtained (at 
tion for i Phillips Scale) is partial justifica- 
Suming és is approach, However, before as- 
tween i generality of the correlation be- 
Personali e process-reactive and good-poor 
t coi ity dimensions, it will be necessary 
The ane results using both approaches. 
in futur ors expect to examine this problem 
€ research, 
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The oral character type is said to be dependent and suggestible; anals are said 


to be obstinate and resistant. 


Consequently, it was hypothesized that orals 


would condition better than anals in a verbal conditioning experiment. 24 
strong oral or anal undergraduate Ss were selected by the Blacky Test; E did 
not know the character type of any S. A significant increase in the dependent 
variable was seen for the orals, while a pronounced drop was found for the 
anal Ss. The data support the prediction based on Freudian characterological 


theory. 


Through the years the overwhelming 

majority of concepts used to explain human 
chavior and to guide the course of psycho- 
€rapy have originated from the writings 

of Freud. Sears (1944), after reviewing the 
attempts to quantify the tenets and prin- 
ciples of Psychoanalytic theory, concluded 
that experimental psychology had yet to 
Make a major contribution toward stripping 
away the subjectivism of psychoanalytic con- 
cepts. Many consider Sears’ judgment too 
arsh and point to the heuristic value in the 
mectimental analogues of defense mecha- 
1940. as evidence (Miller, 1948; Mowrer, 
1940. Murray & Berkun, 1955; O’Kelley, 
or less complete 


wa > etc.), Further, more 
oe of personality and psychotherapy 
z e been offered by Shoben (1949), Dollard 
iller (1950), Mowrer (1950), and 
(1959), in which they substituted 
toid terms for dynamic terms to offer 
Rac, understanding and to suggest re- 
SVstem, approaches to the psychoanalytic 
in The Method of verbal operant condition- 
? “Owever, has been largely neglected as 
Ria for directly studying the theoretical 
S which are basic to much of the 


ascal 
learning 


aDpli 
Gia Work being done in clinical settings. 
foun advantages of the operant condi. 


Athen” „approach are its empirical, near 
Tetical nature and the marked flexibility 


it allows the experimenter in choosing both 
independent and dependent variables, 

The present study attempted to utilize 
Operant conditioning techniques to test the 
predictive efficiency of two of Freud’s most 
venerable personality concepts—the charac- 
terlogical types of oral and anal. The writings 
of Freud’s followers contain numerous refer- 
ences to the etiology of these character types 
and to the personality and behaviors expected 
of them, For instance, Fenichel (1943) and 
others have suggested that oral character 
types—being dependent—are highly suscepti- 
ble to suggestion from authority figures on 
whose assurance they depend for self-esteem, 
while anal characters tend to maintain their 
self-esteem through obstinacy and resistance 
to authority figures. Consequently, it was 
hypothesized for this experiment that Oral 
subjects would condition markedly better 
than Anal subjects when mild, affirmatory 
words from an authority figure were used as 
reinforcing stimuli. A second hypothesis was 
that more Anals than Orals would be able 
to verbalize the reinforcement contingency 
due to the obsessive compulsive or even 
paranoid method of approaching problems 
ascribed to anal types by Freudian theory. 


METHOD 


The Blacky Test was administered to two under- 
graduate psychology classes (N =90) from which 15 
Oral and 15 Anal subjects were selected for q Taffel- 
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type verbal conditioning experiment. All the proto- 
cols were administered and scored by Ez; Eı con- 
ducted all the conditioning portion of the experi- 
ment. E:, who was the instructor of both classes, 
did not know which subjects were Orals or Anals 
until after the conditioning data were completely 
collected. The scoring was done according to the 
method developed by Blum (1950). In assigning 
subjects to oral or anal experimental groups, it 
was necessary to ignore indicators of other areas 
of conflict (e.g., masturbation guilt, sexual indentifi- 
cation problems, castration anxiety) which are also 
afforded by the Blacky Test. Blum’s method of 
scoring entails weighing (a) the subject’s spontane- 
ous story, (b) answers on a multiple-choice ques- 
tionnaire, and (c) cartoon preference. Only indi- 
cators of orality and anality from these sources 
were used as criterion measures for this study. 

The experimental procedure was standard and 
simple. The subjects were seen individually, with 
Orals and Anals randomly intermixed. Each subject 
was seated in a chair with the experimenter behind 
and slightly to the left to obviate extraneous re- 
inforcing cues by the experimenter. A series of 120 
5X 7 cards were presented to each subject one at a 
time. On each at the top were 2 pronouns—one 
first (I or we), and the other third person (he, she, 
they). At the bottom of each card was a sentence 
fragment which, when preceded by either of the 
pronouns, made a complete sentence. This deck of 
cards has been refined, and each type of pronoun 
is equally represented and randomized as to position 
within each of the 12 blocks of 10 cards. 

The first 30 cards were used to establish an 
operant level for each subject’s choosing the selected 
class of pronoun. (For this experiment, first person 
pronouns were designated as the “correct” response.) 
During the next 60 trials mild, affirmatory utterances 
such as “Um-hmm,” “That’s fine,” “O.K.,” “Good,” 
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for 15 subjects rated as “Anals” 
rated as “Orals” on Blacky Test. 
n four blocks of 30 trials each.) 


tioning situation 
and 15 subjects 
(Data plotted i 
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etc., were administered following correct responses 
on a 75% variable ratio schedule. The schedule 
was employed to help minimize the number of sub- 
jects who might verbalize the reinforcement con- 
tingency. The last 30 cards constituted the extinction 
phrase in which no reinforcements were given for 
either choice. 

After each subject’s responses had been collected 
and recorded, he was asked the following three 
questions in an attempt to assess awareness: a) 
Now, what do you think this experiment was all 
about; what was the general idea? (b) Did you 
notice MY saying or doing anything? (answer) 
Why was I doing that? (c) Did you notice any 
change in YOUR responses from the first of the 
cards to the last? (answer) How did you change? 
(answer) Why did you change? 

Any subject who verbalized 
principle was omitted from the analysis of 
conditioning data. 


the experimental 
‘the 


RESULTS 


Figure 1 shows the conditioning data for 


each experimental group plotted in blocks of 
30 trials. The curve generated by the Oral 
subjects shows the typical verbal condition- 
ing curve: a level of response near chance 
during the operant phase, a continuing 1 
crease in the dependent variable throughout 
the treatment phase, and a drop back to 
near-operant level during extinction. Fried- 
man’s nonparametric analysis of variance or 
correlated data showed the overall effect ° 
treatment to be significant beyond the ~ 
level. Wilcoxon’s matched-pair T tests applied 
to the points along the curve reveal that © 
source of the significance is due to the dit- 
ference between the operant level and the 
second half of treatment level (p < -01)» an 
to the drop from the second half of treatmen 
to the extinction level (p < .05). The oine 
three comparisons failed to reveal significa 
differences. 

A marked contrast is noted when the ad 
formance of the Anal subjects is plotted. Da 
Anal subjects started at a high level (56% 
in choosing the personal pronouns aur 
operant, then dropped nearer chance 63% 
during the first 30 treatment trials, a" 
dropped to a below chance fred en 
(48.6%) during the second 30 geam d 
trials. When the experimenter discon” e- 
reinforcement during extinction, 
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quency of the dependent variable rose to near 
chance again (51.5%). The curve ap- 
Proached, but did not reach, the accepted 
Significance level. 
a rey! comparison between the two groups 
fonded a points along the curves was af- 
T test x the Mann-Whitney nonparametric 
- or independent data. The Oral and 
levels groups’ operant levels, the extinction 
all aR the first half of treatment for each 
free ed to yield significance; but the dif- 
Pee between the Orals’ 63% and the 
te ley % during the second half of the 
isa nt phase was significant beyond the .05 
- All p levels reported are two-tailed. 


Discussion 


or VPothests 1 was supported by the data, 
alike ang oups of students treated exactly 
ros differentiated only by Oral and Anal 
leg the Blacky Test showed strikingly 
conditio performance on a standard verbal 
ning problem. 

bes S Pothesis 2 was not supported. It had 
dina ed that the suspicious, precise 
with th to problems said to be associated 
nalk R Anal character would lead to more 
experim an Orals verbalizing the idea of „the 
the ex ent. Actually, six subjects verbalized 
Anale Perimental concept—three Orals, three 


mane rest hoc re-analysis of the data was 
Predict ‘pe how consistently one could have 
e€ Bla tae Oral or Anal rating on 
alone, 5 y Test from the conditioning data 
razzo ane the method used by Mata- 
ina, al. (1960), each subject was tallied 
hi m X 2 table as having “gone up” from 
during a number of personal pronouns given 
uring eoat to his mean number given 
Own» he treatment phase, or having “gone 
and Sea operant to treatment phase. Oral 
e re al subjects were tallied separately. 
Or ain table showed that of the 15 
“taye Ubjects, 12 showed an increase and 3 
Ana re same; none went down. Of the 15 
treatn jects, 4 increased from operant to 
€nt phases and the remaining 11 
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showed a decrease over the same comparison. 
A chi square comparison of these data yielded 
a p value considerably less than .01, reflecting 
the consistency of the data. 

Since the execution of this experiment, 
closely related findings have been reported 
by Cairns and Lewis (1962). These investi- 
gators used the Edwards Personal Preference 
Schedule to divide subjects into High and 
Low Dependency groups, then placed the en- 
tire group in a verbal conditioning situation 
very similar to ours. The two sets of findings 
were almost identical, for their Low Depend- 
ency subjects behaved very like our Anal 
subjects in showing a sharp decrement in 
emission of the reinforced verbal class; their 
High Dependency subjects showed a condi- 
tioning effect, though not as pronounced as 
our Orals. Further, an extension of the Oral- 
Anal predictor type experiment has just been 
completed by one of our students which repli- 
cated the findings of both this and the Cairns 
and Lewis study. It would appear that verbal 
operant conditioning can be a useful tool in 
investigating clinical concepts. 


REFERENCES 


Brum, G. S. The Blacky pictures: A technique for 
the exploration of personality dynamics (manual). 
New York: The Psychological Corporation, 1950. 

Carns, R. B., & Lewis, M. Dependency and the 
reinforcement value of a verbal stimulus, J. 
consult. Psychol., 1962, 26, 1-7. 

Dotrarp, J, & Miter, N. E. Personality and psy- 
chotherapy. New York: McGraw-Hill, 1950. 

FENICHEL, O. The psychoanalytic theory of neuro- 


sis. New York: Norton, 1943. 
J. D., Sastow, G., & Parens, E. N. 


MATARAZZO, 
Verbal conditioning of two response classes: Some 
methodological consideration. J. abnorm. soc. 


Psychol., 1960, 61, 190-206. ; 
N. E. Theory and experiment relating 


MILLER, r 
psychoanalytic displacement to stimulus-response 
generalization. J. abnorm. soc. Psychol, 1948, 43, 
155-178. 

Mowrer, O. H. An experimental analogue of regres- 
sion with incidental observations on reaction 
formation. J. abnorm. soc. Psychol., 1940, 35, 
56-87. 

Mowrer, O. H. Learning theory and personality 


dynamics. New York: Ronald Press, 1950. 


386 Epwin O. TIMMONS AND CHARLES D. NOBLIN 


Morray, E. J., & Berkus, W. M. Displacement as 
a function of confict. J. abnorm. soc. Psychol., 
1955, 51, 47-56. 

O’Kettey, L. I. An experimental study of regres- 
sion, I. and II. J. comp. Psychol., 1940, 30, 41-53 
and 55-95. 

Pascat, G. R. Behavioral change in the clinic—A 
systematic approach. New York: Grune & Strat- 
ton, 1959. 

Sears, R. R. Experimental analysis of psycho- 


analytic phenomena. In J. McV. Hunt (Ed.). 


Personality and the behavior disorders. New York: 
Ronald Press, 1944. 
Snosen, E. J. Psychotherapy as a problem in learn- 
ing theory. Psychol. Bull., 1949, 46, 366-393. 
ZELLER, A. F. An experimental analogue of regres- 
sion. I. Historical summary. Psychol. Bull., 1950, 
46, 39-51. 


(Received October 25, 1962) 


Journal of Consulti 
k t 
1963, Vol. 27, No. 5, ect 


FREQUENCY HYPOTHESIS AND CONTENT ANALYSIS 
OF PROJECTIVE TECHNIQUES 
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Northwestern University 


An attempt was made to evaluate the frequency hypothesis of projective 
theory by constructing a multitrait-multimethod matrix for 4 traits and 5 
methods. Traits measured were somatic concern, hostility, achievement con- 
cern, and religiosity. Methods were self-description, peer reputation, content 
Rorschach, Incomplete Sentences Blank, and Thematic Apperception Test. 
Additional measures were number of visits to health services, medical symptom 
check list, and scholastic average. Ss were student nurses, The frequency 
hypothesis received support only for specific traits as measured by specific 
methods. Self-description, peer reputation, and ISB yielded the most promising 
convergent and discriminant validities. Some evidence supported the validities 
of ISB and TAT for measurement of religiosity while the Rorschach had 
some convergent validity for measurement of hostility. Self-description and 
peer reputation measures yielded consistently higher correlation than any 


other combination of methods. 


ce (1961) has made explicit a 
ie ee of _ hypotheses which underlie the 
Scie bene techniques. The research 
Ses whi T concerns one of these hypothe- 
aie wwe have termed the “frequency 

ia 5 The frequency hypothesis states 
under 4 nose responses elicited or produced 
tions variety of different stimulus condi- 
Cnt a Particularly likely to mirror impor- 
iggy ss of the subject. [p 145)" Lewy 
Sumptio 1n a discussion of the logic and as- 
dla orate of psychological interpretation has 
ing th ed the frequency hypothesis in learn- 

ory terms: 


The 
assumption wd seem to be some support for this 
“arning an from the vast amount of research in 
a freque cory, whereby it is generally found that 
eT alter ncy of occurrence of a response increases 
Mere, natively, its probability of occurrence 


‘Ses . 2 
lp, 51 ) with increased training and reinforcement 
I: s 


ACh} 

Rt = knowledge of a person’s past history 

to regi ment, the clinician may attempt 

By E future test or extra-test behavior 

of certai S upon frequency of occurrence 
in responses during testing. Frequency 


“This 

va hd USP was completed while the first author 
Stern p, HS predoctoral research fellow at North- 

ayy The rersity. 

in 26 Dedas wish to thank Bernard Schiff, Judy 
the ; evine, and Virginia Smith who assisted 

Sment of responses and analysis of data. 


of response may be construed, then, as 
one potentially useful measure of response 
strength (Auld, 1954). 

In the present research, an attempt was 
made to investigate the usefulness of simple 
frequency counts in conjunction with content 
analysis of projective techniques. Frequency 
counts of specific content have yielded prom- 
ising results in several studies (e.g., Fisher & 
Cleveland, 1958; Harris, 1960). Moreover, 
concentration on content rather than determi- 
nants and the use of simple linear rather 
than configural methods of combining data 
hold considerable promise for high scorer 
agreement. Obviously, appreciable agreement 
among scorers is a necessary minimal require- 
ment for adequate measurement. 

The traits measured were hostility, somatic 
concern, achievement concern, and religious 
concern. The methods of measurement em- 
ployed were the Rorschach, selected Thematic 
Apperception Test cards (TAT), Rotter’s 
Incomplete Sentences Blank (ISB), self-de- 
scription, peer reputations, visits to the 
student health service (SHV), first year grade 
point average (GPA), and a typical medical 
symptom check list (MSC). 

In selecting traits to be measured, a 
deliberate attempt was made to select at least 
one clearly socially acceptable trait as well 
as one socially disapproved trait. It appeared 
reasonable, for this sample of subjects, to 
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consider religious concern as high in social 
acceptability and hostility as low in social 
acceptability. An attempt was made to select 
methods which vary in degree of structure. 
For example, the Rorschach represents the 
most unstructured and indirect of the methods 
of measurement while self-description repre- 
sents the most structured and direct. In order 
to assess the relative validity of the pro- 
jective techniques, self-description was in- 
cluded to provide a base line against which 
to evaluate the more unstructured techniques. 
Obviously, if projective techniques are ulti- 
mately to prove valuable, they should provide 
more accurate information than can be ob- 
tained by the simple and far less costly 
technique of asking the subject to describe 
himself. In the present investigation, peer 
reputations were considered a principal, 
although admittedly imperfect, criterion. 

In order to assess the discriminant validity 
of the methods of measurement, a multitrait- 
multimethod matrix was constructed (Camp- 
bell & Fiske, 1959). In brief, the multitrait- 
multimethod matrix provides the means 
through which the extent of true (trait) 
variance as well as unwanted methods of 
measurement can be assessed. 


METHOD 
Procedure 


The subjects consisted of 72 freshmen nursing 
students enrolled in an introductory psychology 
course at a large midwestern university. All of the 
testing was conducted in groups in two separate 
testing sessions separated by a time interval of 
3 days. The most direct measures, self-description, 
peer reputation, and check list, were the last measures 
gathered. Standard instructions for group adminis- 
tration of the Rorschach, TAT, and ISB were read. 
In the case of the TAT and Rorschach, the stimuli 
were projected on a large white screen and the 
subjects were instructed to write their responses 
in previously prepared booklets. An exposure time 
of 5 minutes was allowed for each TAT card and 
each Rorschach card. All of the Rorschach cards 
were presented. In the case of the TAT, the following 
cards were presented: 1, 3GF, 12, 7GF, 3BM, 8GF. 
The self-description and peer reputation ratings 
consisted of simple rating scales of 28 items, 7 items 
for each trait. Subjects were asked to distribute 
their ratings of themselves and their roommates 
over a 9-point scale. Sample items from the rating 
s follows: Somatic concern: “has vague 

scales were as i 1 health.” 
aches and pains,” “concerned about personal health.” 
Hostility: “gets mad at others,” “criticizes others. 
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Achievement concern: “concerned about grades,” “is 
dissatisfied with a C on exams.” Religious concern: 
“concerned with Christian values,” “interested in 
church matters.” At the time the ratings were 
gathered, the subjects had been living together for 
at least 6 months in small apartments which housed 
a maximum of four persons. In nearly all cases, 
each girl rated her three roommates and was rated by 
her three roommates. Considering the nature of the 
living arrangements, it seems reasonable to conclude 
that the subjects knew one another quite well. 


Scoring Systems 


Rorschach. As mentioned above, no attempt was 
made to elicit the determinants of the responses- 
All Rorschach scoring was accomplished on the 
content of the responses. Only the most obvious 
percepts were scored as positive for the trait under 
investigation. Thus, for Somatic concern all an 
tomical responses were considered relevant (Klopfer, 
Ainsworth, Klopfer, & Holt, 1954, p. 383). Percepts 
such as bones, X rays, viscera, and internal organs, 
as well as diseases such as cancer and tubercular 
lung, were all scored as positive for somatic gnc 
Blood with no mention of violence was also score’ 
for somatic concern. In the case of Hostility, al 
expressions of violence, human or animal, ye 
scored as relevant (Klopfer, 1954, pp. 381, 383). Z5 
addition, all implements of violence were also Score: 
(Holtzman, Thorp, Swartz, & Herron, 1961). 1 


a- 


devouring one another, warriors, 
scored as relevant for hostility. Rorschach i 
concern was scored for percepts such as cross, Jeni 
monks, church, temple, hands in prayer, ete- for 
was not possible to score Rorschach content 

Achievement concern. 


i ch 
Incomplete Sentence Blank. As with ane 
attention was again focused upon the ann ic 


the responses in the scoring of the ISB. For 50 ne’s 
concern, all responses suggesting concern with Thus, 
physical well being were scored as positive. of 
responses which mentioned actual physical inen” ae 
the subject or a person known to the subject reve 
scored. Expressions of fatigue were also conside ig 
indicative of somatic concern, A similar scor 


responses which indicated interpersona: 
aggressiveness were scored. Thus, “I hate mY hate 
instructor” was scored as positive while, cug- 
to study biology” was not. Responses W: ich “ 
gested Achievement concern were as follows: 
greatest fear is flunking out,” “Exams really id 
me,” “I really regret that I didn’t study ene pic? 
In the case of Religious concern, responses" por 
obviously dealt with religion were score®: iy,” 
example, “at bedtime, I read my Bible and Paal 
“I want to serve God with all my heart 2? 
were scored as positive for religious conce™? p she 
Thematic Apperception Test. The scoring rs ach 
TAT differed somewhat from that of the ae the 
and the ISB. No attempt was made to qu 
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ie KODE, thems, or stony eniling Shee TAT 
Siment requently so long and complex that the 
ae of a simple score can prove difficult. 
number spe word counts were conducted. The 
traits wer words thought indicative of the four 
. Televant e recorded. Words which were scored as 
“cancer,” or Somatic concern were as follows: “sick,” 
ug Cn? “fever,” “illness,” “heart attack,” and 
diteast i eee ob aoe l 
ing Were ostility, words such as the follow- 
“fight,” « Scored: murder,” “destroy,” “shoot,” 
concern Gee and “strike.” For Achievement 
Work.» « e following serve as examples: “study,” 
In the anbi oa “learn,” “school,” and “master.” 
the follo ase of Religious concern, words such as 
“Gog n» Wing were scored: “Jesus,” “praying,” “sin,” 


od,” d 
« Oh” “blessing” “Jast rites,” “priest,” and 


Minister.” 
co, 
amounted 8 eement. Since the resulting responses 
the init to an almost overwhelming mass of data, 
Mitial jn sha 
ducteq b indgments of the responses were con- 
Into Weare first author, who sorted all stimuli 
under inve and irrelevant categories for the traits 
Judgments stigation. Prior to the experimenter’s 
Were tyner all of the Rorschach and ISB responses 
White Ped by another person on individual 3 X 5 
Judgment €x cards, which were then shuffled. The 
"periment sedure was truly “blind” in that the 
Sola, ita had no information other than the 
è oe responses which were to be judged. 
; ioo. aetimenter had made his initial judg- 
va readonly selected irrelevant responses 
relevant these Rorschach and ISB responses 
tot irelevant - This new set of responses, relevant 
petther wi hi were presented to a second judge 
hree di ith instruction for making the judgments. 
i the hon Judges were employed, one for each 
Fate Is ithe Projective measures. In the case of 
Th Vant r second judge categorized each response 
ee Us, ¢ ance One of the four traits or as irrelevant. 
ine g os ae een between the experimenter 
t © ac al nd judge was } or 20%. For the ISB, 
e agreement between the experimenter and 
and chach, eee was 90%. In the case of the 
of One itrele gments were made of three traits 
Or 25% vant category with chance agreement 
obtained” An actual agreement value of 96% 
ni fudge server the experimenter and the 
n à spur: er the Rorschach. It should be noted 
vat extremely y high agreement value arising out 
Pe sponse. ly disproportionate number of irrele- 
a ee the me not possible by this method. 
ingi lows, All » Scorer agreement was demonstrated 
Spo idua] shi of the TAT stories were typed on 
ign ets of paper and shuffled. Again, the 


to er ir 
a oth Made his judgments with no informa- 


Yi 


eet Ot 
Den ther e his 

Select enter at individual TAT stories. After the 
of te made his judgments, 85 randomly 


med ston 
to TAT sts were obtained from the total pool 
The? secon; © These 85 protocols were presented 


Sego, relay: WUdee with a iate instructions. 
on trela; ppropriate 

toned ve between the experimenter and the 
8p etn, or the TAT were as follows: somatic 


aa a 
"8nd religi hostility, .92, achievement concern, 
Slous concern, .94. 
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In general, it seems justified to conclude that the 
scoring systems produced considerable agreement. 
Considering the fact that attention was focused 
upon the most obvious attribute of the response, 
the content, the high agreement values which were 
obtained are not surprising. 


Analysis of the Data 


Prior to correlational analysis, the raw scores 
from several of the methods were converted to 
indices. Since neither the TAT nor the Rorschach 
controls the number of responses a subject may 
make to a single stimulus, it was necessary to 
convert each trait score into a ratio score. This 
was accomplished by dividing each trait score by 
the total number of responses. In this manner, a 
possible spurious correlation between trait scores 
and total responses was made considerably less 
likely. For the peer reputational scores, mean peer 
reputational scores were computed and employed in 
the correlational analysis. All of the possible inter- 
correlations among the 22 variables were obtained, 
resulting in a matrix that consisted of 231 Pearson 
product-moment correlation coefficients. 


RESULTS 


Table 1 presents the multitrait-multimethod 
matrix obtained by intercorrelating the data 
obtained for the four traits by the seven 
methods of measurement. Utilizing peer repu- 
tational ratings as a principal criterion, it 
is obvious that the highest validity coefficients 
in the entire matrix involve self-description 
and peer reputation. All of the peer reputa- 
tional self-description diagonal values are 
significant beyond the .01 level of signifi- 
cance. Moreover, the validity diagonals 
are clearly higher than the off diagonal 
heterotrait-heteromethod values as well as the 
values contained in each of the respective 
monomethod-heterotrait groups of correla- 
tions. Thus, both methods, peer reputation 
and self-description, meet the criteria for 
reasonably good convergent and discriminant 
validity. 

It is encouraging to note that both self- 
on and peer reputation yield signifi- 


descripti 1 
cant relationships with behavioral measures. 
For achievement concern, both of the 


measures correlate significantly with scho- 
lastic average at the close of the academic 
year. For somatic concern, peer reputation 
correlates significantly with number of visits 
to the infirmary while self-description does 
not. However, on a second self-description 
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a number of medical symptoms 
Pan a significant relationship with visits 
e infirmary was found. 
ee pe projective techniques, it is 
Cleat t at the ISB yielded the most en- 
Sing validity values. Inspection of the 
vali Peg diagonals reveals five significant 
a ee With the exception of 
—— significant negative correlation 
Rion t SB somatic concern and peer repu- 
validity a achievement, the discriminant 
results A the ISB appears acceptable. The 
somewh S the Rorschach and the TAT are 
viel ced more discouraging. The Rorschach 
the TAT one significant validity value and 
nant aia two. Moreover, the discrimi- 
ment was : y of both methods of measure- 
@ number ound wanting in that both yielded 
ich of heterotrait-heteromethod values 
Validity or higher than the obtained 
Na In addition, the obtained 
} ene hieterotrait correlations for the 
lot in two instances approximately 
_ to the obtained two significant 
tion z {iagonals. In summary, self-descrip- 
®ceptaby Peer reputation appeared to be 
Of the ts € methods of measurement for all 
ificant che measured. The ISB yielded sig- 
Concern elationships for hostility, religious 
discrim; and somatic concern. Moreover, the 
ceptable validity of the ISB appeared 
Rethog €. The TAT, scored by the present 
of a see ay prove of some use as a measure 
Cr, pclally acceptable trait, religious con- 
“a Rorschach, on the other hand, 
he significant validity value for a 
ver, bo UNnacceptable trait, hostility. How- 
; y methods of scoring the Rorschach 
rmg AT leave much to be desired in 
\scriminant validity. 
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the present research does not 
© question of the validity of the 
Wa. que ri and TAT per se. Such a 
the ade : ion is meaningless. An attempt 
She, frequene, evaluate a specific hypothesis, 
ana ic Sco Cy hypothesis, in the context of 
tesis, Whe systems based upon content 

arch q hile it is apparent that the present 
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question of the validity of projective tech- 
niques, it should be equally clear that the 
investigation does not provide a final answer 
as to the usefulness of the frequency hypothe- 
sis. Howeyer, the frequency hypothesis does 
appear tenable when used in conjunction 
with content analysis of projective techniques 
for some traits and some methods. 

Of considerable interest in the present 
investigation is both the convergence as well 
as the lack of convergence between projective 
measures and more public measures of per- 
sonality. With the exception of the Rorschach, 
it is clear that all of the measures converged 
for the socially acceptable trait, religious 
concern. Thus, for a socially acceptable trait, 
frequency appears to be a very useful 
measure of response strength. However, with 
the exception of Rorschach content, measure- 
ment of a socially unacceptable trait, hostility, 
resulted in an appreciable decrease in valid- 
ity. The fact that Rorschach content for 
hostility correlated significantly only with 
self-description is indeed suprising. However, 
it may well be the case that the content of 
the Rorschach response is closely related to 
the other rather accessible verbal responses 
of the subject while the determinant of the 
response is not. Clearly, the present research 
does not provide an answer to this question. 

Since the present research is concerned 
primarily with the degree of convergence 
among methods varying in degree of struc- 
ture, it is important to consider possible 
sources of method variance. Of rather critical 
concern is the amount of “hidden” method 
variance which might exist between methods 
which, at an obvious level, appear to be 
independent. While it might be thought that 
self-description and peer reputation are inde- 
pendent, it seems likely that they share 
two important methods factors. First, self- 
description and peer reputation measures 
usually involve a common descriptive vocabu- 
lary, e.g, when the subjects and their peers 
are asked to make ratings on “hostility.” 
Such commonality of descriptive terminology 
does not exist between such methods as the 
Rorschach and TAT. Second, self-description 
and peer reputation measures are not inde- 
pendent from the standpoint of method by 
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reason of the fact that each measure very 
likely reflects something of the other. They 
very probably develop in a reciprocal way. 
Attitudes toward the self develop, in part, 
out of the reactions of others to oneself. 
And our reputations are probably based in 
part on our descriptions of ourselves to 
others. 

However, it would very probably be er- 
roneous to conclude that peer reputations are 
to be dismissed as “nothing but” one or 
the other of the above notions. Certainly, 
among reasonably sophisticated and intel- 
ligent populations, impressions about others 
are not formed solely on the basis of what 
others say or show directly about themeslves. 
Marlowe (1962) has shown that fraternity 
men who paint an exaggeratedly favorable 
picture of themselves are less well liked than 
those who admit to what would seem to be 
flaws of personality and character. In such 
a case we might conclude that the peer 
reputations are indeed insightful and reflect 
something other than persons’ verbalizations 
about themselves. 

In some respects it may be helpful to 
regard a given behavior from several vantage 
points. First, one may think in a rather 
abstract or theoretical sense of the underlying 
“behavioral disposition” (Campbell, 1963) 
which will at some point become overt and 
thus observable by others. However, in the 
process of being translated into an overt 
response, a behavioral disposition will be 
filtered through the cognitive and other sys- 
tems of the behaver, perhaps being com- 
pletely altered in the process. Thus, for 
example, an underlying disposition of hostility 
toward other persons may, because of a past 
history of negative reinforcement for such 
behavior, be inhibited. When the individual’s 
past history of negative reinforcement for 
hostile behavior is known (Lesser, 1957) 
and when expectancies of punishment for 
aggressive behavior are present (Mussen & 
Naylor, 1954), the relationship between pro- 
jective responses and overt behavior is much 
clearer. = 

In addition to modification or inhibition 
of underlying behavioral dispositions, one 
must consider the intent of the behaver, i.e., 
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what effect the behaver means to produce. 
However, it would be clearly incorrect to 
assume a one to one relationship between 
the intent of the behaver and the impact of 
his behavior upon others. In other pul 
the intent of the behaver may not be reflecte 
accurately in the response which he arouses 
in others. Thus, a response which the behaver 
intends as hostile may in actuality be a 
inept or may strike such an insensitiv 
environment that its intended hostility ' 
missed. 

Obviously, then, a given measure may pe 
into the behavior eventuation process at om 
of three points: the underlying behavior 
disposition, the intent of the behaver, pe, 
the impact of behavior upon the environmen j 
It appears reasonable to assume that mit 
description has a strong component of a 
whereas peer reputations should be M 
nearly a measure of impact. Moreover, m 
theorists and clinicians would be incline 
agree that projective measures should pro ‘ia 
some measurement of the “pure” or on ale 
behavioral disposition underlying ~ the 
process or some measurement © vene 
transforming operations that may inter 
between disposition and intent. 


able trait, one would expect to find cans, 
able congruence among disposition, vestige” 
and impact. Indeed, in the present 19 iel 
tion, measurement of religious concern y eas 
the highest validity coefficients for al chach 
ures with the exception of the Rors jgnifi 
The failure of the Rorschach to yield 5'8 
cant correlations for religious concern 
well be a function of the stimuli. > 
Rorschach cards, the “pull” for T ini 
percepts may be, in actuality, 50° 
as to preclude frequent occurrence 
percepts, even by the religious zealo rove’ 
In the measurement of socially cor find 
traits, one would hardly expect, sition 
appreciable congruence among hong not? 
intent, and impact. It is interesting o esch, 
that the projective measures, with t jeld? 
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impact Undesirable traits. Similarly, response 
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Cant and somatic concern correlate signifi- 
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position of which the actor may be at least 
partially aware but which is either so trans- 
formed in overt response or is so subtle in 
its manifestation that its impact is nil. 
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THE RELATIONSHIP OF EXPECTANCY OF REWARD 
TO ACHIEVEMENT PERFORMANCE ON AN 
ARITHMETIC AND THEMATIC TEST? 


BERNARD I. MURSTEIN 2 


Connecticut College 


Atkinson’s theory of motivation was tested with regard to a thematic test 
scaled for achievement value and a simple arithmetic task. Three randomly 
selected groups, of approximately 20 college men each, vied for a $1.25 cash 
prize, their objective probability of success being .1, 5, and .9. After the 
achievement arousing instructions they received a thematic test followed by 
the arithmetic one. Hypothesis 1 predicted the greater manifestation of n-Ach 
for the .5 group over the others. Hypothesis 2 predicted a similar superiority 
for the arithmetic task. The results largely supported Hypothesis 1, but not 
Hypothesis 2. The stimulus properties were found to greatly influence the re- 
sponse, the medium-achievement structured cards being the most differentiating. 


In recent years Atkinson (1958) has pro- 
posed a theory to account for the expression 
of motivation to achieve under a variety of 
circumstances. The theory states that the re- 
sponse is a function of the product of the 
motives operating on a subject, his ex- 
pectancy of goal attainment through making 
this response (subjective probability of suc- 
cess, P,), and incentive (1-P,). The equation 
representing incentive follows the logic that 
the goal which is the least likely to be at- 
tained possesses the greatest attraction. The 
approach gradient of motivation to achieve 
is represented by 


Motivation to Achieve = M, x P,X I, 
where 


M, = strength of motive to achieve 

P, = subjective probability of success 

I, = the incentive value for achievement 
=1— P, 


An avoidance gradient such as “fear of 
failure,” where measured, would be repre- 
sented by the following equation: 


Motivation to Avoid Failure = M; X P; x I, 


1 The study was carried out through the support 
of a Public Health Service Grant (M-4698) from 
the National Institute of Mental Health. 

2 The author wishes to express his thanks to 
Martyn Spitzer for his aid in carrying out the 


study. 


where 


M; = motive to avoid failure 

P; = probability of failure ; 

I, =the incentive for avoiding failu 
= Py 


re 


PEE sum 
The resultant motivation equals the net 


of adding the two gradients. «active 

P, may be differentiated from the obee 
probability of success P, since a persons task 
mate of his chance of succeeding at ait o 
may depart from the objective probabi W of 
success. If, for example, the top 10 neli 
20 students are to receive a prize fOr their 
performance, some students will perceive pile 
chances as better than one in tw 
others will not. Earlier work (Easter & 
stein, in press) has indicated that for 4 Sgian 
of randomly selected college men, the E of 
value of estimated subjective probabi 0 
success and the objective probabi! 
success are approximately equal. 

If through the process of randomly = 
subjects, motive strength which is co i 
to be a stable personality character” iva” 
Atkinson, is held constant as well as son t 
tion to avoid failure, then Motivat!? p, 
Achieve becomes a function of Po % yes 
This product is at a maximum W", x C 
probability of success is .5, 
— .5) = .25, and is lower for any ° T 
of P,, the distribution taking the 
a parabolic curve, 
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RELATIONSHIP oF REWARD EXPECTANCY TO ACHIEVEMENT PERFORMANCE 


\ os studies by the author and col- 
the the ave Supported the effectiveness of 
lane regarding the prediction of per- 
a eet of children on simple arithmetic 
itr aoe on the projection of achieve- 
1962) wen cards (Murstein & Collier, 
up Goer, college men instructed to make 
fae ot pe Stories,” which were then scored 
resulted (3 only partial support of the theory 
ed (Easter & Murstein, in press). 
"trop ai Studies, however, did not fully con- 
; coed aes which might be expected to 
factors , t = character of the response. These 
Sexeg Ee uded (a) the use of combined 
and (3) Ma masked possible sex differences, 
the boat failure to adequately determine 
the res of the thematic stimulus upon 
®ssential] nse. The earlier experiments were 
t erefore. repeated in the present study, 
in that : ee for these two variables 
he stim college men were employed and 
easured Us pull of the thematic cards was 
Value oH with regard to judged achievement 
Bb or to the experiment. 
'Ypotheses were: 


la 
Men p SOUP of randomly selected college 


Cash fant a .5 expectancy of receiving a 
Cellence Ze of $1.25 contingent upon the ex- 
Nore a their performance will project 
havin n vement imagery than (a) a group 
wu also -l expectancy of success, and (b) 
han arog Ect more achievement imagery 
k 2. The oe having a .9 success expectation. 
Paditions” 8roup will likewise under similar 
an eitha, tempt more arithmetic examples 
®t the .1 or .9 groups. 


ogtitty à PROCEDURE 
on X c 
Setje, aee men from two introductory psy- 
insty ns, ay Were randomly divided into three 
Uetions tom ee received the following 
e examiner: 


fate f 
mit to Sea mateting an experiment in which we 
il hen E Nat you can do on arithmetic prob- 
am, T ten rere go, you start and keep working 
of the S so fet to stop. There are quite a few 
t em, 6 is unlikely that you can finish all 
Brigg Ms o > persons finishing in the top 10% 
here f $1.2 umber correct will receive a cash 
- There are approximately 20 persons 
means the two highest scores will 
Prize. (For the other two groups, 
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the winning percentage was set at 50% and 90%, 
respectively). 


After all questions had been answered, E? continued, 
saying: 


Due to the shortage of time remaining and the 
length of the arithmetic task, we will not give 
out the arithmetic sheets today, but will do so 
at your next meeting in two days. Today, we 
would like to have you participate in a normative 
study of college responses to the TAT. This project 
has been going on since last term and is designed 
to obtain an idea of how college students respond 
to the various TAT cards. 


At this point the usual McClelland instructions 
for writing TAT stories were given, including the 
projection of the cards for 20 seconds and the 
allotment of 4 minutes for the writing of the 
story. Nine cards were used, seven TAT cards, and 
two which were selected from the list provided by 
Atkinson in his book (Atkinson, 1958a, pp. 832-834). 
These cards had been selected from 38 cards scaled 
by the Likert technique for amount of achievement 
depicted, on the basis of differentiating between high 
and low judged achievement scores for all of the 
cards, and represented the entire range of the con- 
tinuum. The cards had been divided into three sub- 
sets of three cards each: High (pioneer chopping 
down tree—Atkinson no. 98—2, 17BM), Medium 
(8GF, 14, boy at desk with open book in front of 
him—Atkinson no 8), and Low (20, 3BM, 13B). 
They were presented in a single order randomly 
arrived at, except that every three cards had to 
contain one card from each of the subsets. The 
order of administration was: card showing pioneer 
chopping down tree, 8GF, 20, 3BM, 14, 2, 17BM, 
boy in checked shirt at a desk with an open book 
in front of him, 13B. 

On the following meeting of the class, the experi- 
menter repeated the instructions, the arithmetic ex- 
amples were given, and a time period of 20 minutes 
allotted for the 600 arithmetic problems. These 
consisted of simple arithmetic or subtraction ex- 
amples of the type 33+71=? and 42—29=? 
Only one of the students finished all of the examples, 
this student doing so just as the time period elapsed. 

The accuracy of the scorer in scoring the stories 
was verified prior to the experiment by comparing 
his scoring of a series of stories listed in Atkinson 
(1958a) with the scores given the stories in the 


3 Because, as Birney (1958) has shown, the 
person giving the test whether student or professor 
may profoundly affect the amount of achievement 
educed, certain precautions were taken. Both the 
author and his assistant, a senior college student, 
essayed a relaxed but not informal role, being neatly 
attired in white shirts, but wearing sweaters. The 
role of the administrators was explained as “par- 
ticipating in the research project.” As far as was 
known, neither of the administrators was known 


to the students, 
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book by experts. The interscorer reliability of .89 
was deemed sufficiently high to assure adequate 
reliability of scoring for the present study. 


RESULTS 


To test the first hypothesis, the means and 
standard deviations of n-Ach were obtained 
for each of the groups and are shown in 
Table 1 along with the means and standard 
deviations of the low, medium and high sub- 
sets. The ¢ values are shown in Table 2 for 
the group comparisons. This table shows that 
the .5 group as predicted manifested signi- 
ficantly more m-Ach than the .1 group. A 
nonsignificant trend is evident when the .5 
group is compared to the .9 group, with the 
.5 group manifesting more n-Ach. The im- 
portance of the stimulus structure is further 
indicated in Table 2 by the fact that the 
medium structured cards are shown to be 
responsible for the superiority of the .5 group. 

The second hypothesis was tested by com- 
puting the mean number of arithmetic ex- 
amples attempted and running ¢ tests be- 
tween the groups as shown in Table 3. No 
differences were found for any of the com- 
parisons, and the hypothesis is thus not 
supported. 

Further analysis was undertaken to see 
whether the number of arithmetic examples, 
correctly solved, differentiated the groups, but 
t tests showed no significant differences. 

An unanticipated event which occurred 
during the course of the experiment provided 
an indirect test of the assumption that 
achievement imagery and number of arith- 
metic examples attempted were valid indices 
of achievement motivation. Nine students of 
the .1 group did not show up for the second 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF n-ACH SCORES 
FOR THE TOTAL, Low, Meprum, AnD Hicu SET 
or THEMATIC CARDS 


Group Total Low Medium High 


M SD M SD M SD M SD 


I =—28 142 -81 .76 —66 .82 .61 1.89 
‘5.18 1.77 —.61 1.04 .27 190 .88 1.90 
3o 14 160 —.77 .74 —67 .53 1.02 221 
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TABLE 2 


t VALUES FOR THE DIFFERENCES IN 
n-ACH BETWEEN THE GROUPS 


Groups Total Low Medium High 
9vs..5 1.67 90  3.42** 2 
5vs..1 2.00 115 3.26** A 
9 vs. .1 98 .28 .07 1.07 
o E e ma ee 
*p <.05. 
**p <.01. 


session in which the arithmetic task was = 
ministered. These persons were subseque? 7 
given the task at the next session, Howe 
and are included in the .1 group ors 
Since not a single one of the subjects of a 
other groups missed the second session, ma 
subjects’ absence is hardly attributable i 
chance, The only apparent reason for t o 
absence seems to be the low probability hy- 
their obtaining the reward. It was d 
pothesized, therefore, that this group “en 
attempt fewer examples and project thet 
achievement imagery than any of the en 
groups. Consequently, further analysis 
undertaken comparing the .9, 
“prompt” .1 members (N = 12) «metic 
“tardy” .1 subjects (NW = 9) for arithn 


a solvet 
examples undertaken and correctly Ks 
and achieve” 


predicted direction. The findings for 4 dicat? 
ment imagery as shown in Table 4 chiev” 
that the late group projected less ae 

ment imagery (M = —.82) than any © up 
other groups, though only the he t 
showed a significant superiority for t scored i 
of 9 cards. Each of the other groups ards: 


aa A ji Ce 
significantly more imagery for "a signif 
y 


examples attempted. As with the ear i g Ha 
by Murstein and Collier, the come rov" 
the .1 (7 = 00) and .5 groups (m z the 
to be nonsignificant, while that $0 
group reached significance at the - 
(r = .48). 
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TABLE 3 


MEAN NUBE 
A NUMBER OF ARITHMETIC EXAMPLES ÅTTEMPTED, 
TANDARD DEVIATIONS AND VALUES FOR 
GROUP COMPARISONS 


Mean 
examples Group 
Group attempted SD comparisons t 
-9 376.4 92.1 9 vs. .5 21 
373.6 104.0 veel, 1.50 
‘ 349.6 117.1 .9 vs. 1 1.32 
DiscussIon 


sme ae of this study support Atkin- 
n-Ach mea with regard to the projection of 
Metic i = not for performance on the arith- 
Work aE M This is contrary to the earlier 
asten Murstein and Collier (1962) and 
Probab Murstein ( in press) where varying 
cant EPRA groups did not show clear signifi- 
ailure PESA in achievement imagery. The 
f nih the arithmetic tasks to differentiate 
exactly ups in the present study also runs 
and Gone to the findings of Murstein 
fence (1962) as well as the earlier work 
reversa] Son (1958b). The explanation of the 
Present in the projection of n-Ach lies in the 
range ara of cards representing the entire 
încludin, achievement expression and thus 
achievers a medium-structured subset of 
Which i tigi cards. It is these cards 
Steen may accounted for the differences 
f cards ; e groups. The haphazard selection 
Pon oma. earlier studies may well have hit 
NAc s either too low structured to elicit 
high : Or too highly structured so that both 
Tespon, low achievement-motivated persons 
fectiven a to the omnipresent cues. The ef- 
ferentia. Of the low-structured cards in dif- 
S, ating all of the groups (“prompt” -1, 
SUggest, from the “late” .1 group further 
“fee “iy that these cards may be especially 
Motiva, when dealing with unusually low- 
= ae Persons who reveal themselves by 
Ach mely low rate of manifestation of 
a edium stimulus-pull cards probably 
Pe largest differences between the 
6 =e because they possess sufficient 
Tigger off an achievement response 
ly achievement-oriented persons, 
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TABLE 4 


I VALUES OF n-ACH BETWEEN “PROMPT” .1 Group 
.5 Group, .9 Group AND “LATE” .10 Group 
FOR TOTAL Set or Carps anD Low, 
MEDIUM AND HIGH SUBSETS 


Group comparisons Total Low Medium High 
“Prompt” .1 vs “Late” .1 97  1.99* 1.55 02 
Svs “Late” .1 2.55% 2.69% 3.95% 59 
9 vs “Late” .1 1.25 2.23** 1.30 81 

*p <.05. 

p <.01 
but not so much that the low-motivated 


person is forced to respond out of respect for 
the reality demands of the picture. The find- 
ing that the medium and low cards gen- 
erally proved more effective than the high- 
structured cards leads to the question of what 
is meant by medium stimulus pull for achieve- 
ment. The view advanced here is that 
medium pull for achievement refers to the 
fact that a card elicits a considerable but 
not preponderant number of themes reflecting 
n-Ach. Such a card is 14 of the TAT which 
is described by Murray (1943) as “The 
silhouette of a man (or woman) against a 
bright window. The rest of the picture is 
totally black [p. 20].” This picture proved 
to be the only picture which by itself sig- 
nificantly differentiated the .5 group from the 
others. It is a picture which elicits themes 
of achievement with some frequency, but not 
exclusively so, for themes of escape, depres- 
sion, and suicide are quite frequent. Another 
characteristic of a medium stimulus-pull card 
is that it often clearly depicts who is in the 
picture, but remains ambiguous as to the 
actions of the characters or what is going 
on. Current work is being done on these 
questions and will be reported on in future 
articles. 

When attention is turned to the task of 
explaining the failure of the arithmetic tasks 
to differentiate the groups, the task becomes 
more formidable. Possibly, the groups were 
differentially aroused by the varying proba- 
bilities of winning a reward and reflected this 
differential arousal state in their manifesta- 
tion of n-Ach. When, however, they convened 
2 days later, their motivation may have 
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elapsed as a function of time, or, they may 
have become bored with the simple nature of 
the problems and their great number (600). 
If the theory is affected by this time delay 
and/or the fact that a slight difference in 
complexity existed between the simple arith- 
metic problems presented by Atkinson and 
the even simpler ones which were utilized in 
the present study, then the theory is in need 
of greater elaboration before it can be more 
usefully employed in the study of motivation. 
Yet another possibility is that a multi- 
incentive condition as the present one, arous- 
ing both achievement motivation and the de- 
sire to acquire a small monetary prize, may 
have served to depress the achievement incen- 
tive for persons high in the achievement 
motive, a possibility supported by Atkinson 
and Reitman (1958). These authors noted 
a decline for high »-Ach men in attempted 
arithmetic problems from an achievement- 
orientation condition to a multi-incentive one 
in which an additional cash prize was offered. 
They also, however, showed an increase in 
arithmetic performance for low n-Ach men 
under these conditions. Since low -Ach men 
are held to be particularly sensitive to ex- 
trinsic rewards such as money (McClelland, 
1961), it would then follow that the .9 group 
should have manifested the best arithmetic 
performance, but this was not the case. 
Rather, no significant differences were found 
between any of the groups on arithmetic 
performance. 

Next, we are confronted with the problem 
of accounting for the positive correlation for 
the .90 group between achievement imagery 
and arithmetic performance (r= .48), and 
the absence of a significant correlation for the 
other groups. In an earlier study (Murstein & 
Collier, 1962) we obtained a similar positive 
correlation, but begged off offering an expla- 
nation since the result was so unexpected. 
The repetition of this earlier finding no longer 
permits this luxury. , = 

It should be noted that this finding is con- 
trary to what would be expected according 
to Atkinson’s theory. The theory suggests 
that performance and fantasy should be 
positively correlated at the point of highest 

achievement motivation, that is, .5. McClel- 
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land confirms this view (1961), showing his 
highest positive correlations between #-Ach 
and arithmetic performance for a .5 group of 
Japanese and Brazilian subjects, whereas the 
.75 group produced all negative correlations, 
though these were low. R 

A possible explanation is that a situation 
where the probability of success is very great 
affects achievement imagery and arithmetic 
performance in the following way. First it 
imposes a “set” in manner of approach to 
both tasks which slightly reduces variability. 
This “set” is not necessarily one to strive 
for achievement, since the mean score of the 
.9 group is not high on either task relative 
to the other groups. Rather, the prospect of 
success strikes the group in different ways, 
some individuals presumably being highly 
motivated for both tasks, whereas others are 
not greatly motivated in their approach to 
both tasks. The important fact is that the 
high probability of success does have co? 
siderable impact with regard to both tasks. 

Next, we may ask whether the results were 
influenced by the length of our test. Reitman 
and Atkinson (1958) and the present author 
(1963) have indicated that the use a 
large numbers of cards may serve toO mas 
significant findings that are present when 
only the first few cards are consideret. 
Reitman and Atkinson, for example, fou” 
that m-Ach scores to the first four of & Se 7 
of eight cards predicted differences in peg 
formance on arithmetic tasks more succe 
fully than when all eight cards were coa 
sidered. Whether this effect occurs fro™ E 
drainage of n-Ach on the first four cards E 
whether, in the desire not to be rep sagt 
persons expressing achievement themes yore 
early cards avoid doing so on the later ("+ 
is a moot question. In any event, iS” se 
possible that the lack of correlation be mic 
the achievement imagery and arit 
scores for the .5 group was affected aoe 
fact that we used nine cards instead °t js 
This possiblity seems unlikely whe? Pe 
recalled that the .9 group showed 4 ong 
cant correlation despite the use of the 
series. 

We may further ask whether the SUP 
ity of the medium set accrued 


ries 


erior” 
the 
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ELATIONSHIP OF REWARD EXPECTANCY TO ACHIEVEMENT PERFORMANCE 


pire circumstance that the medium set 
agit earlier on the series than the other 
aa A will be recalled, however, that the 
Sy cards arrived at stipulated that each 
Cites ae be represented only once in any 
inne three cards. The order of cards 
ET ae to stimulus value was in fact H, M, 
an A , H, H, M, L. The first four cards 
ae! Fi neti by only one medium card 
within i one was not the most effective 
Cadet T medium subset. It may be con- 
stüdy erefore, that the findings of this 
= were not seriously affected by the 
a of the test. 

Peia a dhes the fact that we have found 
taking ie rmation of the theory of risk 
rmation avior, but also partial lack of con- 
al revien indicate that the theory is in need 
justifab] n? Perhaps so, but before one may 
Pear ne 7 reach this conclusion, it would ap- 
certain sasany: to more rigorously control 

conditions which this experiment 


clear] * i 
ates Y indicates may affect the results. These 


te stimulus properties of the cards 
ferent ri assessed, for it appears that dif- 
Brees of ne of structure have varying de- 

W on ility to differentiate those high and 

n-Ach, 

iea S, number and intensity of motives 
exą Hie. ould be controlled. This study, for 
and atte that achievement imagery 
cach oth metic performance are related to 
ity er most strongly when the probabil- 
Whereas -onetary gain is quite high, (.9), 
ha Se st i moderate possibility of gain (.5) 
Performance effect on imagery, but not on 
a the performance task needs to be 

ently assessed in terms of its arousal 


indepe 
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of the achievement motive for a particular 
sample. McClelland’s work (1961) illustrates 
that arithmetic examples having high arousal 
properties for one national group may possess 
little incentive for other nationalities. So too, 
in the’ present experiment, it is tenable to 
believe that the results obtained, which differ 
from those of Atkinson (1958), are due to a 
difference in the difficulties of the problems 
which, accordingly, affected their motivation 


educing properties. 
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EXTRAEXPERIMENTAL EFFECTS IN VERBAL 
CONDITIONING * 


WILLIAM R. WIMSATT 


Minneapolis Rehabilitation Center 


AND NORRIS D. VESTRE 


University of Minnesota 


74 male psychiatric patients, selected according to criteria thought to be 
related to conditionability, participated in experimental sessions in which they 
received the verbal reinforcer “good.” 1 group was reinforced for responding 
to MMPI Si items in the scored direction, 1 group was reinforced for respond- 
ing in the nonscored direction, and a 3rd group received no reinforcement. A 
significant conditioning effect was found but no extra-experimental or general- 
ization effects were demonstrated either in respect to a correlated self-admin- 
istered temperament scale or in respect to withdrawal symptoms as rated on 
a correlated behavior scale. The practicality of employing verbal reinforcement 
as a means to effect changes in basic personality variables is not supported, nor 
is its interpretation as an “automatic strengthening” phenomenon sustained. 


Several investigators in the area of verbal 
conditioning have been intrigued by the pos- 
sibility that this phenomenon, properly under- 
stood, might come to serve at least in part as 
an explanatory principle in accounting for 
benefits said to result from psychotherapy. 
Verbal conditioning studies to date, how- 
ever, as reviewed most lately by Salzinger 
(1959), have focused primarily on the prob- 
lem of demonstrating the existence of the 
phenomenon itself and establishing the con- 
ditions under which it will appear; such 
studies have thus, for the most part, con- 
fined themselves to considerations of direct 
effects observable within the conditioning 
trials. 

It is clear, however, that the proper evalua- 
tion of any procedure which is to be called 
therapeutic must ultimately involve an ac- 
count of the patient’s activity outside of the 
treatment sessions. Likewise, if verbal con- 
ditioning techniques are to be shown anal- 
ogous to, or argued to be in principle the 
same as psychotherapeutic techniques, it is 
essential that behavioral changes occurring 
outside the conditioning sessions be demon- 
strated. 

While the relatively few studies which have 
been conducted recently in an effort to clarify 
the area of extra-experimental effects in verbal 
conditioning have been perplexing in a num- 


1Data for this study were collected while both 
authors were at the Veterans Administration Hos- 


pital, St. Cloud, Minnesota. 


ber of ways, there seems to be fair agreement 
at least in regard to the conditionability © 
some response categories based on personality 
dimensions. Oakes and Droge (1960) have 
reported a successful attempt to condition 
responses to Social Introversion (S$) scale 
items on the Minnesota Multiphasic Peson- 
ality Inventory (MMPI) using the verba 
reinforcer “good,” while Nuthmann (1957) 
was able to influence responses to a self-ac- 
ceptance inventory and Hildum and promi 
(1956) were able to manipulate respons” 
to an attitude questionnaire with the ne 
reinforcer. Thus it seems well established rin 
at least some conceptual categories, 


defined ! 
° n . añ 
terms of certain personality variables, © 


be identified as response classes when ap- 


propriately reinforced. ae 
The question is still very much at E 
however, as to whether personality varia ai 
themselves, as manifested in other ee 
and as measured by other devices than t 
involved in the conditioning routine, C4” 
altered by verbal reinforcement. T 
say, does being able to influence tk 
recorded on some self-report scale by pat 
of verbal conditioning techniques imply 
the trait upon which the subject is assu 
to be reporting is itself influenced by 
affirmative answer of sorts is PrOViC™.. ion 
Singer (1961), who was able 
prodemocratic responses to the = 
F scale items and was then able to sho E 


z ifornið 
scores later obtained on the Califo 
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agi Aieten. This generalization effect 
that gro rved, however, only in the case of 
the i mate E mether of which completed 
in the a ile the experimenter remained 
ae conclusion, on the other hand, 
formed ag Fs follow from an experiment per- 
the pk ogers (1960). While successful in 
in what rock of self-reference statements 
Roges í e calls a “quasi-therapy” setting, 
adju pale the scores on two postsession 
Scale an F scales (Taylor Manifest Anxiety 
Þarently Q Sort Adjustment Test) to be ap- 
E ea sn likewise, frequency of 
perimental ce statements outside of the ex- 
increased, sessions was not significantly 
ba ee with Singer (1961) that the ef- 
Confined e reinforcing verbal stimuli are not 
mann fi the conditioning sessions are Ull- 
teinfotced: “eon and Collins (1961), who 
Perimental emotional words” in four ex- 
rating A sessions and used a behavior 
cale ark the Palo Alto Group Therapy 
eir ‘Cond ependent measure. Unfortunately, 
Statistical UsIONS: are based on an incomplete 
along ee A reanalysis of their data 
Present e generally accepted lines by the 
differen authors failed to reveal significant 
the er w the dependent variable, though 
direction d differences were in the “right” 
paS 
questa? (1961) raises another issue when he 
tioning the adequacy of operant condi- 
taine Enei to describe the data ob- 
wBests io conditioning studies, and 
*ctually b a subjects who condition may 
Sets fou e operating under self-instructional 
the mee on their hypotheses about what 
that et to be doing. The possibility 
Saki al conditioning is best conceived 
formation of problem solving or concept 
While i phenomenon is interesting, but 
a atten ety concerns himself mostly with 
e Pts to demonstrate awareness, OT 


Droy, ack of it, on the part of subjects who 
ie ee in various studies, it 
A ge here is better evidence upon which 
a the issue. The confirmation or dis- 


mat; 

$ a eos 

“tra R of the possibility of obtaining 
rimental effects is important in this 
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regard, it would seem, since confirmation 
would support an “automatic strengthening” 
interpretation of the phenomenon while non- 
confirmation would suggest an “hypothesis 
formation” interpretation. 

The experiment here described was de- 
signed to investigate possible extra-experi- 
mental effects of verbal conditioning tech- 
niques applied to hospitalized psychiatric 
patients. In particular it was intended to test 
whether the selective reinforcement of re- 
sponses to items of the Si scale of the MMPI 
can be shown to affect scores on a correlated 
self-report inventory or to affect withdrawal 
behavior as reported on a ward rating ques- 
tionnaire also known to be correlated with Si. 


METHOD 


Subjects 


The subjects w 
Cloud, Minnesota, 


ere 74 male patients from the St. 
Veteran’s Administration Hospital 
who were selected to meet the following criteria: 
(a) 55 years of age or less; (b) known to be 
literate; (c) not suffering any known impairment of 
the central nervous system; (d) at least of average 
intelligence by the most recent available test results; 
and (e) not receiving unusually large drug dosages. 
“Unusually large” was defined as a daily dose greater 
than 600 milligrams of chlorpromazine (Thorazine) 
or the equivalent in other drugs. The foregoing 
a were applied in order to obtain a group 
ould probably be conditionable. 
ears and the 


criteri: 
which it was felt wi 
The mean age of the sample was 36 y 


mean length of current hospitalization was 5.5 
months. With respect to diagnosis, the sample in- 
ders (65%), 4 other 


cluded 48 schizophrenic disor 
(5%), 17 neurotic disorders 


psychotic disorders 
trait or pattern dis- 


(23%), and 5 personality 
turbances (79%). 


Procedure 
One hundred 3 X 5 cards were prepared upon 70 
of which were typed items scored on the Si scale 
of the MMPI. The remaining 30 were filler items 
also taken from the MMPI but not scored on 
any scale; these were included to disguise the 
character of the items as a whole and to attenuate 
the pattern of reinforcement for those experimental 
to have a high initial 


subjects who happened ; S 
operant level for responses reinforced in the group 


to which they were assigned. , 
The patients were randomly assigned to three 


Experimental I, Experimental II, and Con- 
participated in two experimental 
d 2 days apart. In each session a 


items. 
items randoml. 
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items. The cards were presented one at a time by 
the experimenter and appeared in the same order 
for every subject. Each response to an Si item 
was recorded. A typical session lasted about 15 
minutes. 

In the case of those subjects assigned to the 
Experimental I group, each Si item read and fol- 
lowed by a response in the scored direction was 
reinforced by the experimenter’s “good,” uttered 
jn as encouraging a manner as possible. No other 
items, Si or filler, were so reinforced. Experimental 
IL was treated just as Experimental I with the ex- 
ception that only those responses which were 
opposite to the scored direction with respect to 
the MMPI Si scale were reinforced. No responses 
were reinforced for members of the control group. 
Thus, Experimental I subjects were reinforced for 
“introverted” responses, Experimental II for “ex- 
troverted” responses, and control subjects received 
no reinforcement. All subjects were seen by the 
same experimenter. 

Immediately following the second experimental 
session each subject was required to complete a self- 
administering survey form which contained all the 
items scored on the scales G, E, S, and A of the 
Guilford-Zimmerman Temperament Survey (GZTS) 
(Guilford & Zimmerman, 1949). These scales have 
been shown to be significantly correlated with Si, 
at least in a normal male population (Miller & 
Cottle, 1955). Subjects completed this form seated 
at a table in a room separate from the experimenter’s 
office, where the experimental sessions took place, 
and in the presence only of other patients similarly 
occupied. 

Preceding the first experimental session and fol- 
lowing the second, each subject was rated on the 
Withdrawal scale of the Psychotic Reaction Pro- 
file (PRP) (Lorr, 1961) by a ward attendant. In 
addition, a follow-up rating 1 week after the second 
session was obtained in order to provide for the 
possibility that some passage of time might be 
required in order to permit the occurrence of in- 
stances wherein raters could observe changes of 
behavior. 


RESULTS 


Table 1 shows the mean Si scores obtained 
during the experimental sessions, the mean 
total scores on the four GZTS scales, and 
also the mean score on the S scale of the 


TABLE 1 


Group MEANS FOR EXPERIMENTALLY OBTAINED Si 
AND FOR POSTEXPERIMENTAL GZTS Scores 


Experi- 
Scales Experimental I Control mental II 
3 26.08 
PI Si (N =74) 29.83 28.50 l 
eas S (N =65) 17.77 17.83 17.38 
GZTS total (N =65) 69.67 65.67 68.76 
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TABLE 2 


ANALYSIS OF COVARIANCE FOR Si SCORES ADJUSTED 
ON PRE-EXPERIMENTAL Si SCORES 


Adjusted 
Source df MS F 
Between 2 224.33 3.63* 
Within 58 61.88 
*p <.05. 


GZTS for the two experimental groups and 
for the control group. The reduced number 
of cases associated with the GZTS scales 15 
due to the exclusion of nine cases whose scores 
were invalidated by excessive responding an 
the “can’t say” category. 
An analysis of variance performed on all 
74 cases for differences between groups W! 
respect to Si failed to yield a significant 
ratio. For 62 members of the original sample, 
however, pre-experimental Si scores were of 
tained from hospital records for use M 
covariance analysis. The mean duration be- 
tween these pre-experimental and the re 
perimental obtained Si scores was 3.8 mont Si 
Table 2 shows the results of an analysis ° 
this latter data by the covariance men 
with adjustment made for pre-experime? 
Si differences. da 
While neither the Control group-EXPC' 
mental I or the Control group-Experime? 
II differences were significant, a ¢ of ihe 
(Lindquist, 1953) was obtained !0F o 
difference between the adjusted means ig 
Experimental group I and Experi’ n 
group II, indicating the presence of the 
ditioning effect. 


P p were 

As is seen, while the three groups cores: 
ordered as expected with respect to 9? agni- 
and a significant F was obtained, the was 


tude of the differences between gtoUP “pi 
not very large. However, the expert avin 
procedures were such that for subjects ses t° 
a low operant level initially for tesPOP* ere 
be reinforced in the group to which toe ight 
assigned, many items in the first sesso gforce 
be read and responded to before the Te" ping 
ment could begin to be applied. ed in 
thus, a further analysis was pa alf of 
volving only the last 15 items (Jas 
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zA : second treatment session). For these 
level) bs significant F was obtained (.01 
Sec aE differences between groups, and a 
the some of the Experimental I group with 
Lee el group yield a agpificant £ (O5 
lne S hile the Experimental II group made 
in this i responses than the control group 
ater as in the preceding analysis 
ota all Si items, the difference was still 
P a significant. 
tien oe analysis for differences be- 
levalas ee to the GZTS data 
total Ca, nonsignificant results both for the 
Scales ad (F = .80) for the four GZTS 
= 57) igi and for the S scale (F 
analyze a en separately. The S scale was 
for ‘this separately because it was found that 
T aE the correlation of S with 
Was the imental Si was higher (—.56) than 
the Pi ine of pre-experimental Si with 
tered cee for the four scales adminis- 
Scale sco; 49). The correlation between S 
experiment, and Si scores recorded during the 
idence A sessions was — .83, the .99 con- 
~912 ae about which coefficient being 
cients Pig — 684. These correlation coeffi- 
‘ eae different from zero beyond the 
differen idence level, and in addition, the 
Si ce between the two correlations of S 
fone a CR of 2.83, p < .0l. 
PRp wit analyses were performed on the 
Preeapers aval scale scores adjusted on the 
Spect o al withdrawal scale scores. In 
chance dif this behavior rating scale, no non- 
st poste ferences are exhibited either on the 
F ratio experimental rating or on the second. 
b tained = 99 and .36, respectively, were 
Y one es the two ratings. An earlier study 
Patient of us (NDV) on 78 schizophrenic 
Si a} > Yielded a correlation of .40 between 


and 
the PRP Withdrawal scale. 


DISCUSSION 


Sin 
i monstra extraexperimental effects were 
5 tament ed, it would appear that no tem- 
'MfAuenceg O7 Personality factor was actually 
“dure €d by the verbal conditioning pro- 
cores | The elevation or depression of Si 
it o E this conditioning technique is then 
Suci e psychological consequence than 
€vation or depression were produced 
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by a mischievous test scorer, and it seems 
in effect as if the thermometer has been 
tampered with while the furnace remains 
untouched. One interpretation is merely that 
the generalization gradient is very steep; but 
perhaps more intuitively appealing is the 
interpretation that, regardless of the instruc- 
tions originally given the subject, the verbal 
conditioning sessions are actually to be con- 
sidered a type of problem solving task in 
which the subject learned how to make the 
experimenter say “good.” The failure of such 
problem solving behavior to persist in other 
situations is, thus, less surprising. 

Since the correlation between GZTS S scale 
scores and Si scores recorded during the ex- 
periment was so strong, we were surprised at 
first to have obtained such a small F for 
the S scale scores while a significant F was 
obtained for differences in Si. The explanation 
for this apparent paradox is the very steep 
regression line for S on Si. 

The significant difference between 7’s for 
S scale—Si (pre-experimental ) and S scale— 
Si (obtained during experiment) seems 
similar to differences Singer (1961) found 
between pre- and during-experiment correla- 
tions and which he suggested might be ex- 
plained by supposing his subjects to have 
learned during the experiment the concept 
theoretically connecting the E and F scales. 
However, the significant difference between 
ps here obtained might be more simply ex- 
plained by the fact that, while the second 
correlation of S and Si was computed on 
measures taken within 3 days of one another, 
a lapse of about 4 months, for the typical 
subject, occurred between the registering of 
the pre-experimental Si scores and the admin- 
istration of the GZTS scales. Thus, any 
changes in respect to these variables occurring 
within the intervening months would tend 
to magnify the difference between the cor- 
relations. 

Finally, there remains, of course, the pos- 
sibility that the conditioning effect achieved 
was of such low strength that real extra- 
experimental concomitants went undetected 
by reason of their being of such slight magni- 
tude. Perhaps more conditioning trials could 
have increased the effect to an extent that 
a generalization effect would have appeared. 
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COUNSELOR ATTITUDES AND CLIENT BEHAVIOR‘ 


IRENE E. WASKOW 
Psychopharmacology Service Center, National Institute of Mental Health 


The hypothesis of this study was: Degree of counselor acceptance, interest, 
nonjudgmentalness, and expressiveness is directly related to the degree to 
which the client, fairly early in counseling, both discusses and expresses his 
feelings. Ratings were made on the basis of interview material. Counselor 
attitudes were rated in 3 different conditions: reading transcripts, listening 
to tape recordings, and listening to filtered tapes (with intelligibility elimi- 
nated). Counselor attitudes that were reliably rated were related to the client 
measures by means of partial correlations. Significant (p < .05) relationships 
were found between counselor judgmentalness, in all rating conditions, and 
client discussion of feelings. These relationships are in a direction opposite 
to that predicted, and attempts are made to clarify this finding. 


cami z 
ne. E i : + 
8, fairly early in a series of counseling 


ika the counselor accepts him as a 
nd to feel t he is “safe” to say, to think, 
Sible jn he ings that have not been permis- 
i eo More specifically, the study 
ies Ao: relationship of certain counsclor 
ms reflect aspects of client behavior that 
"uation į t such learning that the counseling 
Selor atita safe.” Ratings of both the coun- 
on oe udes and client behavior are based 
Sions? ¢ recordings of actual counseling ses- 
the area n the basis of theoretical writings in 
of Several of counseling and psychotherapy and 
Were d i pilot studies, four counselor attitudes 
and as a as both relevant to this problem 
hie ce E rateable : the counselor’s ac- 
Si j ‘tetas client, his interest in the client, 
Ree j ae talness, and his expressiveness. 

€med that vocal dimensions might 


Cuses 
îttituq 


might 


Loy 


ot llsihyas 
dug holog Bation was conducted at the Laboratory 
8 th on National Institute of Mental Health, 
om t os of a Post-doctoral Research Fellow- 
ing âtion fo ational Institute of Mental Health. 
8 the Gave their constant help and encouragement 
> Allen ce this project is expressed to David 
iS also ittmann, and Paul Bergman. The 
of te” of Tite ee to Thomas Ewing of the Uni- 
verte Counseling Service and to John Butler 
a ee Chicago Counseling Center and 
the M ccoberation in this study. 
Sesion erms “client,” “counselor,” and 
the ns” are used throughout this report, 
Watigng “ature both of the clients’ problems 
«Se in trent de. of the counseling methods used 
hery, \teractig mple (to be discussed later) renders 
Þistg» = ns similar to those of “patients” and 
Psychotherapy.” 


be used by the client as cues to the counselor’s 
attitudes, these attitudes were rated on the 
basis of filtered tape recordings (discussed 
below), as well as unfiltered tapes and tran- 
scripts. The two dimensions of client behavior 
selected as possible signs of the client’s achiev- 
ing some feeling of “safety” in the counseling 
setting were the extent to which the client 
deals with (i.e., discusses) his feelings, and the 
amount of feeling he actually expresses. 

Thus, the specific hypotheses of this study 
are as follows: the degree of counselor accept- 
ance, interest, nonjudgmentalness, and expres- 
siveness will be directly related to the degree 
to which the client, fairly early in the counsel- 
ing sessions, both discusses his own feelings 
and actually expresses his feelings. Predictions 
for the first three counselor attitudes can be 
easily made on the basis of a broad range of 
theoretical writings on counseling and psycho- 
therapy (e.g, Fenichel, 1941; Menninger, 
1958; Rogers, 1951, 1957). Although authors 
of different orientations may use different 
terminology—e-g-, Fenichel’s “tolerance” and 
Rogers’ ‘unconditional positive regard’ — 
they generally agree on the importance of 
these attitudes. The prediction in regard to 
the fourth counselor characteristic, expres- 
siveness, is more equivocal. The use of this 
variable is based partly on the recent work of 
Butler’s group’, on the expressiveness of the 
therapist’s speech. Since the present study is 
Alice K. Wagstaff, and J. M. Butler, 


Classification System for 
Chicago Counseling 


3 Laura N. Rice, ; 
unpublished manuscript, C 
Therapist Responses, Univ. 
Center, January, 1960. 
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of an exploratory nature, any direction of 
relationship between the counselor attitudes 
and client behavior variables will, of course, 
be of interest. 


METHOD 


Materials Used for Ratings 


The basic material utilized for testing the hypotheses 
of this study consisted of tape recordings of 28 counsel- 
ing interviews, two for each of 14 client-counselor pairs. 
For each pair, the first interview was included, as well 
as one interview between the twentieth and thirtieth. 
Also available for most cases was the second interview 
and an additional interview between the twentieth and 


thirtieth. This interview material was obtained from ~ 


two university counseling centers. The counselors in- 
cluded nine males and five females. There were no 
requirements in the request for tapes regarding amount 
of experience or other counselor qualifications. The aim 
was to obtain representative counseling interactions. 
The counselors were regular employees of these centers 
and the counselor sample covered a wide range of 
experience. The orientations of the counselors were 
varied, including three or four who might be considered 
“client-centered,’”’ but more who were generally eclectic 
in their procedures. Most of the clients were under- 
graduate students. They presented a variety of prob- 
lems, largely neurotic in nature. All of them dealt in 
their interviews with general problems in adjustment, 
rather than merely focusing on educational or voca- 
tional difficulties. The fact that a minimum of 20 inter- 
views was required by the original specifications in the 
request for tapes to be used in the study also set a mini- 
mal level in terms of the depth or intensity of the 
counseling experience. There are undoubtedly other 
effects of this requirement, including a restriction of the 
client sample to a group of moderately successful cases, 
since many “failure” cases ordinarily drop out before 
this point. 


Rating Counselor Altitudes 


Excerpts of counselor speech were selected from either 
the first or second interview of each of the counseling 
cases. The first interview (or second, in the five cases 
for which there was poor audibility or too few usable 
segments in the first interview) was used in order to 
eliminate, as far as possible, the effect of the client on 
the counselor’s attitudes. The assumption was made, 
however, that these attitudes and characteristics would 
be fairly constant throughout the early interviews. 
Four 3-response units were selected for each counselor, 
and these were transferred to new tapes with all client 
sounds omitted. The four 3-response units selected for 
each counselor were the first three Tesponses after 5 
minutes, 15 minutes, 25 minutes, and 35 minutes that 

met or could be cut to meet the following requirements: 
(a) no response shorter than five words or longer than 45 
seconds, (b) no 3-response unit longer than 1.5 minutes, 
(c) no 3-response unit shorter than 15 seconds. 

These requirements assured that the rating task 
would not be too long (1.5 to 2 hours) and that there was 
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enough material (as indicated in pilot work) on yess 
to base each rating. The 56 segments (4 for each of 1 

counselors) were arranged in a random order with the 
restrictions that there be no two consecutive responses 
of the same counselor and that each counselor be repre- 


sented once in each 14 consecutive segments. f 
Ratings were made in three different experi 
conditions: (a) listening to tape recordings © 


mental 
f the 


bat re- 
counselor responses, (b) listening to filtered peck 
cordings, and (c) reading transcripts. Before sta 


erence, 


to rate, and in order to establish a frame of referent 


the raters heard (or read) the first 14 segments, 
were then reheard (or reread) as the rating ta 
For the filtered condition, Soskin’s technique 
& Kaufiman, 1960) was utilized and frequenci 
450 cps were either eliminated or greatly attenu 


whic 

sk began. 
(Soskin 
ies above 
ated. 


" ps as hile mos 
The speech was thus rendered unintelligible, w nt uae 
voice quality characteristics were retained. This 


evera 


Š 2 in s 
nique had already proved its usefulness 1n 563 
studies of content-free speech (Starkweather, 9508 


1956b). In the present study the ratings ba! 
tapes could be taken as measuring only that W. 
conveyed by the vocal characteristics of t 
speech, To assure the use of verbal as well a: i 
in the unfiltered tape condition, raters had trans 
before them as they listened to the tape. 

Before beginning the study, 
definitions of the characteristics to be rat 
description of the rating system to be used. 
tions, the same for raters in all three con 
reproduced below. The rating system had b 
in pilot studies to result in fairly normal dist 


sed on filtere. 

hich p 
he counselor $ 
s vocal cues 
ripts 


jed the 

aters studie! 
the rai ed and # 
The defint 
ditions, af 
een foun! 
ributions 


of ratings. 
DES 
DEFINITIONS OF COUNSELOR ATTITU 
Accepting Rejecting rent 8 
7 jient | 
Accepts the client asa per- Is critical of ny bo as in- 
son; has respect for and a person; treats ea ded 
values him; recognizes his ferior, to be d; is co de- 
worth as an individual, re- or manipulated; 
gardless of limitations. scending. 
jve 
Expressive Nonexpress™ jife 
“Blah”; seems dull, 


(As popularly defined) has 

life and color; is variable; 

does not seem dull. 
Judgmental 


Seems evaluative; sets self 


less, little variability: 


; I 
Nonjudgmeh!” ag. 


on) 
Seems neutral, P? cts 


5 ives" age 
up as the judge rather than ing, nonevalua ral udg 
client as judge; seems to if client is t 
push own views as correct 
ones; acts as if he knows 
the answers. 

; ut 
Interested pa, no E 
: red; i 
(As popularly defined) con- Apathetic, rene 3 a of 
cerned; alert, attending ticularly COpat’s goin 
to and interested in the volved in W 


client’s communications. 
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SAMPLE 
MPLE RATING SCALE AND INSTRUCTIONS 
Rejecting 


Accepting 
1 2 3 
V 
Nonexpressive Expressive 
1 2 3 
Nonjudgmental Judgmental 
1 2 3 
Interested Indifferent 
k i 2 3 
maki, . 
Correspon ioc rating, please circle the number that 
- Counselor se © your judgment, e.g., if you feel the 
or seems accepting, circle , rejecting O, 


t e ae 
Or the Position between accepting and rejecting 
catenin do not clearly belong in one of 
d Y, eg » an arrow may be drawn to another 
ee ; acceptin, response that is somewhat, but not 
f Aa Pile UD you ng may be rated 2—(8). Please try not 
° spread the T ratings at any one point, but rather try 
O a through the range of 7 possible ratings: 

mr 2s 
Beton ¢, O O 0H +O: O 
tj ers qraracteristics were rated in different ran- 
aska Dt of 3 re ifferent 3-response units. After each 
ed to make Sponses was read or heard, the rater was 
attin om! rating for the 3 responses on each 
| E found “imensions. He was given as much 
You Mest that th Necessary, The instructions included 
pyre the clie € rater “try to make the ratings as if 
Brady, Taters nee to whom the counselor is speaking.” 
| ieee es (research ee students and recent college 
ational Į assistants and normal control sub- 
Ophist: -| nStitutes of Health projects), without 


fae 
bmilan hoped Te about counseling or psychotherapy. 


ai 
x 8toup, 


Analyses 
(ty ach” Can rating: based on the averages of the six 
wet 190 Mselor ga lie, mean of 4 separate ratings) 
igi for.) Were eliabilities of these average ratings 
ten a numha a level acceptable for exploratory 
ing dts jlness r E of the characteristics: Filtered: 
4 entaln -76, expressiveness, r = .64; Tran- 
tigget .66- te r = .81, acceptance, r = .70, 
te? = o2 Unfiltered: judgmentalness, r = .64, 
s ratio essiveness, r = .61. Only these 
ngs were then related to the meas- 
avior, 
thy fou, intercon igs of the different characteristics 
jytte tad p- cOrrelated at can 
a tati between roe Strong relationships (r>.60) 
‘een NS Con iti interest and expressiveness in all 
tanga in Ta between expressiveness and 
© anq int € filtered condition, and between 
€rest and between acceptance and 


Rta C ayy, e 
Mere also stage 
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judgmentalness (the last, a negative relationship) in the 
transcript condition. Although these correlations prob- 
ably reflect actual relationships among the four char- 
acteristics rated, they undoubtedly are spuriously high 
because all four characteristics were rated at the same 
time. Intercorrelations among ratings in the three 
conditions for each of the characteristics revealed only 
three strong relationships: for judgmentalness, between 
filtered and unfiltered ratings, and between unfiltered 
and transcript (there was also some relationship— 
r = .49—between unfiltered and transcript ratings for 
judgmentalness) and for expressiveness, between un- 
filtered and filtered ratings. 


Rating Client Behavior 


Segments of client behavior were selected from both 
the first interview and some interview between the 
twentieth and thirtieth for each client. Segments of the 
first session were rated in order to get a base level on 
both client variables, before one could expect any 
changes to occur in response to the counselors’ attitudes. 
It was felt that such changes might be discernible at 
some point between the twentieth and thirtieth inter- 
views; for this reason the tapes obtained from counsel- 
ing centers had included the first session and one be- 
tween the twentieth and thirtieth. If two interviews 
during this later period were available, the one closer 
to the twenty-fifth was used. 

From each tape used, three 5-minute segments of 
client speech were extracted, one each from the first 15 
minutes, second 15 minutes, and third 15 minutes of the 
tape (always starting as close to the beginning of the 
15-minute period as possible). All counselor responses 
(except an occasional “Um-hmm” or one word or over- 
lap of two or three words that’ were almost impossible 
to omit) were removed, as was the first sentence of any 
client communication that was simply an answer to a 
direct counselor question. All pauses between client 
and counselor communications were omitted, as were 
all silences, after the first 5 seconds, within the client 
speech. For both rating methods, a rating was made 
after each 15 seconds of client speech, thus totalling 
60 ratings for each 15-minute segment. A separate tape 
was made for each group of three 5-minute segments of 
client speech and these were rated in a random order. 

For ratings of the extent to which the client is “deal- 
ing with his own feelings,” a tape of three S-minute 
segments of a client’s speech was heard continuously, 
and a score was made after each 15-second segment 
signifying whether the client was dealing mainly with 
his own feelings (V), not dealing mainly with his own 
feelings (—), oF equally dealing and not dealing with 
his own feelings (?). After the entire 15-minute tape 
was heard, it was replayed, and ratings were changed if 
necessary; this second group of ratings comprised the 
actual data used. Since one-half credit was given for the 
(?) category, the total scores could range from 1to 60, 
with half-step intervals. After some preliminary practice 
with the scale, a detailed list of instructions had been 

ed, including, €.g-: 
devo with feelings” means any type of dealing 
with them; that is, they may be merely discussed rather 
than expressed in tone of voice, etc.; this still counts as 
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dealing with feelings; what is important is that the 
client’s own feclings are the subject matter under 
consideration. As can be seen, the scores on this rating 
system are essentially a measure of the amount of time 
during which the client primarily discusses his own 
feelings. f n 
Again, in making ratings of the “amount of feeling 
expressed,” 15 minutes of client speech was heard 
continuously and ratings were made each 15 seconds. 
Here, ratings were based on a 3-category system: N 
for no feeling expressed, M for a moderate amount of 
feeling expressed, and I for an intense amount of feeling. 
Arrows could also be drawn from one category to the 
next, in a system similar to that used for rating coun- 
selor attitudes. Here, though, the ratings were converted 
to a 5-point scale (which, in pilot work, seemed the 
most reliable method), with =1; @—M and 
N—@ = 2; @ = 3; @—I and M-© = 4; and 
= 5. The 60 ratings for each client were then 
averaged. A short list of instructions was developed 
for use with this rating system, with major emphasis 
on ignoring content and concentrating on the amount 
of feeling actually being expressed. 

All 28 tapes (two for each of 14 clients) were rated 
by the author, according to the systems just outlined, 
and these ratings comprised the basic data on client 
behavior. To assess interrater reliability, a second rater 
rated 14 tapes (which included one of each client and 
seven early and seven late sessions, but which were 
otherwise randomly selected) on the first client variable, 
discussion of feelings (r = .93) and the remaining 14 
tapes on the second client variable, expression of feel- 
ings (r = .67). It should be mentioned that the .67 
reliability figure, although not as high as could be de- 
sired, is actually higher than one might expect from 
the narrow range of scores obtained by both raters. 

Correlations among the ratings of client material 
revealed a high relationship between early (first inter- 
view) and late (twentieth-thirtieth interview) ratings 
for each variable (r = .74 between early and late ratings 
of discussion of feelings and r = .68 between early and 
late ratings of expression of feelings) and negligible 
relationships between the ratings of the two variables 
(r = .26 for the early interview and r = .06 for the 
later interview). 

The procedure followed thus provided us with eight 
sets of reliable ratings of counselor characteristics in 
the first or second interview which could be related 
to ratings of two aspects of client behavior in the first 
interview and in an interview between the twentieth 
and thirtieth. 


RESULTS 


Sixteen partial correlations were computed, 
relating each of the eight counselor variables 
that had been reliably rated with each of the 
two client variables. The use of partial correla- 
tions allowed an analysis of the relationship 
between counselor attitude ratings and client 
behavior in the later interview, while control- 
ling for the effect of the initial level of client 
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TABLE 1 


PARTIAL CORRELATIONS BETWEEN COUNSELOR 
ATTITUDE AND CLIENT BEHAVIOR 
MEASURES 


Client behavior 


Expression 


Counselor Discussion m 
attitudes of feelings of feclings 
Filtered: e 
Judgmentalness .57* a l 
Expressiveness AS z i 
Unfiltered: m l 
Judgmentalness 74 Be 
Expressiveness —.03 10 
Interest —20 . 
Transcript: P 
Judgmentalness .59* i 
Interest .02 tA 
Acceptance —.58* = 
© p:-<..05.. 
*p <.01, 


jn 
: pe a ented 
behavior. The final results are prese 


Table 1. i. t result 
Since the only consistently significan rA 
were found for the judgmentalness 7 F 
the means and ranges of these rape is 
presented in Table 2. The mean ratings or the | 
variable are approximately the same difiet 
two groups of counselors (representing ribt 
ent counseling centers), although the d 0 
tions are somewhat different, wit? mnseloe 
treme ratings given to the Illinois co l 
especially in the unfiltered condition- 1 facto” 
In interpreting the results, es relatio” 
must be kept in mind: A number © je mi pt 
ships are being investigated and < peat 
expect at least one relationship tO a sel0 
the significance range by chance; tue ay 
attitude dimensions are not comple 
pendent and two related attitu 
have a similar relationship to 4 thi „Jor # 
the reliabilities of each of the CO eas 
tude measures and of the two clien? 
set upper limits on the degree © th 
ship to each other. : sign! 
Although four relationships 2T° "coun 08” 


ur 
one must note that only one of Jo! gto 
te ly 
la io? p 


| 
we! 
des may ple 


characteristics is consistently Te, 5 
pect of client behavior. There a0 gin, 
need for replication of this fin this, gg 
studies. One is struck, bowery a 
sistency of findings in regard to € 
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TABLE 2 
MEANS aw 
i oe RANGES oF JUDGMENTALNESS RATINGS 

N THE THREE RATING CONDITIONS 


Rating 
condition M Range 
Filtered 3.9 5 7 
i 3.0-5.6 
Unfiltered 4.0 2 9-5 3 
Transcript 3.8 1.8-5.3 


Mentalness y 
Pies ‘ated in all three conditions, this 
Client d e icantly related to the amount of 
ificant ie of feelings. The only other sig- 
Veen acce i is the negative relationship be- 
condition a as rated in the transcript 
Tatings of ae c ene discussion of feelings. Since 
relation hee have a high negative cor- 
transcript ¢ à ratings of judgmentalness in the 
ance to dis ondition, the relationship of accept- 
With as a cussion of feelings will not be dealt 
bete Separate finding of this study. 
Seconq Pe no significant findings for the 
Only ok lent variable, expression of feelings. 
Variable P on the correlations involving this 
the ran: (relating it to counselor interest in 
Cance, Te i ript condition) approaches signifi- 
si ificg R difficult to interpret the absence of 
*eliabilitne findings for this variable since its 
Neve Phe C = .67), in conjunction with the 
charg a high reliabilities of the counselor 
ihe tirn sets a fairly low upper limit to 
Ded a lons that could be obtained. It is 
o om future research a more reliable 
Maen client expression of feelings 
* ae thus more definitively re- 
en it elationship or lack of relationship 
The abs and counselor attitude variables. 
e Pepe of significant relationships be 
client ju, TPE counselor attitude measures 
b è piesa of feelings will similarly 
Ke with here. The fact that judg- 
mago Ost rdiet in all three rating conditions, 
Wien’ rec ly rated characteristic again 
Other possible its significant relationship 
dy, A order Re less 
be ber A rule out the possibility that the 
doy Ponsible f spoken by the counselor might 
ee, le Darti i for the major results obtained, 
hep ing t a" correlations were computed for 
taln S € relationship between mean judg- 
ratings for each of the 14 counselors 
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(as rated in each of the three conditions) and 
client (late interview) discussion of feelings, 
while controlling both for the initial level of 
client behavior and for the total number of 
words spoken by the counselor. The resulting 
correlations differ very little from the original 
ones; they are: filtered, 7 = .55; unfiltered, 
r = .76; transcript, 7 = .57. Thus, number of 
words is not a factor that need be considered 
in interpreting the findings of this study. 


DIscussIoN 


The direction of the relationship of counselor 
judgmentalness to the amount of client dis- 
cussion of feelings is, in all cases, opposite to 
the direction predicted, i.e., the amount of 
client discussion of feelings is greater when the 
counselor is rated as being more judgmental. 
This result is somewhat puzzling: regardless of 
orientation, many theorists would predict that 
a judgmental counselor would inhibit, rather 
than facilitate, the client’s discussion of feelings. 

One question that arises is whether some 
other counselor characteristic may be highly 
related both to ratings of judgmentalness and 
to the client’s increasing discussion of his 
feelings, and whether such a variable may be 
indirectly responsible for the results found. 
Two specific possibilities are: (a) that the more 
judgmental counselor is directly reinforcing 
the client’s talking about his feelings and (b) 
that the more judgmental counselor is also 
seen as being more “forceful,” and this factor 
may be responsible for the client’s increased 
talking about his feelings. Both of these pos- 
sibilities were briefly investigated, in an effort 
to get a better understanding of the present 
findings and in order to develop hypotheses 
for further research. 

A system was devised for rating the degree 
to which the counselor focuses on the client’s 
feelings, with ratings based on the transcript 
material. Each response in a 3-response coun- 
selor unit was categorized as to whether the 
counselor was focusing at all on the client’s 
feelings (F), not focusing on feelings (N), or 
whether this could not be determined (?). 
The three separate ratings for each unit were 
then systematically combined into one rating 
on a 7-point scale (e.g., one N and two ? = 3, 
three F = 7). The 56 counselor units were 
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rated in the same random order as had been 
previously used and the four ratings thus ob- 
tained for each counselor were averaged. The 
estimate of interrater reliability for these 14 
mean ratings was r = .91. The relationships of 
these ratings to transcript ratings of counselor 
judgmentalness (r = .02) and to the degree to 
which the client discusses his feelings (partial 
r = —.22) were, however, not significant. The 
degree to which the counselor focused on the 
client’s feelings seemed in no way to influence 
the relationship between the ratings of coun- 
selor judgmentalness and the client’s discussion 
of feelings (a double-partial correlation be- 
tween counselor judgmentalness and client 
discussion of feelings in the late interview, 
controlling for amount of counselor focusing 
on feelings as well as amount of client early 
discussion of feelings, yielded an r of .61). 
Thus, the present results cannot be interpreted 
as due to a direct reinforcement of the client’s 
discussion of feelings by the more judgmental 
counselors. 

The other question considered was whether 

a more forceful counselor might somehow 
facilitate a client’s discussion of feelings and 
whether such forcefulness might be signifi- 
cantly related to judgmentalness as rated in 
this study. In a recent study of the factorial 
structure of therapist responses (Howe & Pope, 
1961), one of the factors that emerged was 
labeled “‘precision-potency.” The five seman- 
tic-differential scales with highest loadings on 
this factor were defined at their end points by 
the adjectives colorless-colorful, still-vibrant, 
vague-precise (positive loadings) and energetic- 
inert and strong-weak (negative loadings). 
Since it seemed that these scales (with per- 
haps the exception of vague-precise) might be 
useful in trying to tap the general “forceful- 
ness” of a counselor’s responses, they were 
applied to the tape-recorded (unfiltered) 
material. 

Four raters made a global rating for each 
of the 56 3-response units on each of the five 
semantic differential scales. The general in- 
structions for use of the semantic differential 
scales were similar to those used in the Howe 
and Pope study. The reliabilities of the average 
ratings of the four raters for the 14 counselors 
were: strong-weak, r = .67; colorful-colorless, 
y = .70; energetic-inert, r = .88; vibrant-still, 
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r = .68; precise-vague, r = .84; and for a 
summed combination of the first four of these 
scales, r = .81. The interrelationships among 
these four scales were very high, ranging from 
r’s of .83 to .97, while the relationships of ea : 
of the four with precise-vague ranged from 7$ 
of .46 to .57. athe 

Although each of the potency factors ant ; 
combined scale significantly gitteret a 
among the 14 counselors (F ratios for a ae 
probabilities <.01), none of them we k 
nificantly related to counselor judgment ue 
(unfiltered tape condition) or to client ba 
sion of feelings. In addition, none of them a 
a significant effect on the relationship betw a 
counselor judgmentalness and client dios 
of feelings (as estimated by double-pa aa 
correlations, as described in the section 
counselor focusing on feelings). Thus ee bs 
tionship found between counselor judgm x 
ness and client discussion of feelings ares e- 
attributed to the judged forcefulness A ihe 
fined by these “potency” variables) ° 
counselor. ro 

There was no support for either of arf 
posed explanations of the judgmen f tiga 
results in these two supplementary I2 don 
tions. We have thus far, however, looke 
at the possibility that the judgm o 
results might have been spurious!y Prab es 
by the presence of other relevant wie ha 
One must also consider the possibility ant 
judgmentalness, as defined, was the ea was 
variable, but that the initial nye ental 
incorrect. It is conceivable that the J“ 5 sing 
counselor may, by the very act Of g this 
judgments, arouse feelings in the clien a 
is the case, however, it is surprising rae i 
was not even a tendency toward 4 E clic? 
relationship between judgmentalness argi? 
expression of feeling. Another oe that ™ 
against this interpretation is the fac iss 8 
of the feelings that the clients dis" gu 
directly related to the counselor oF t 
ing situation. 

In an attempt to find 
interpretation of the results, 1 i 
illuminating to look again at t finiti 
variable itself, at the specific j 
judgmentalness that was used. ; ecm Evjen! 
counselor is defined as one who ie f 
tive; sets self up as the judge rat 
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ae d Ea to push own views as correct 
Hes a S mu he knows the answers.” Many 
therapy = ae area of counseling and psycho- 
attitude 8 hold that an evaluative, judging 
pist cose p part of the counselor or thera- 
atmosphere hinder the establishing of a safe 
cuss his ae which the client can freely dis- 
judgmentaln ings. The present definition of 
is dimensi ess, however, includes not only 
he part of of evaluation and judgment on 
of assurance E counselor but also an element 
about, thet 5 hat he knows what he is talking 
tion. An d a Is the authority in this situa- 
Present te this latter component of the 
ortance; it ition that may be of central im- 
acts in this may be that the counselor who 
Security fae hig fashion provides some 
e client t n client—that it is easier for 
Who acts PR iscuss his feelings with someone 
~ as if he will be able to deal with them 
the jug Y, indeed, offer some solutions. Thus 
tea counselor, as seen in this light, 
BNR. cena who, by his own display of assur- 
feelings Te eS the client to discuss his 
‘search, t would seem important, in future 
n Comi 0 attempt to separate out the differ- 
IS use honis of judgmentalness, as the term 
essun ciè and to see if it is indeed this 
telateq t € component which is significantly 
his T a client’s increased discussion of 
Whiasok 
Tha ‘ie the assurance of the counselor 
Ynot oo the client to discuss his feelings, 
BLY toes — him with the freedom ac- 
Wwa O8 sessi lence these feelings in the coun- 
a Ons. One might note here that there 


e Bae Client’s expression of feelings. 

“on o ‘onship of judgmentalness to dis- 
tq Slt i oe may be seen as important 
Stan, ing t 908 assumes that for meaningful 
Dont talkin Out feelings to occur, one has to 
tha, Ant, h & about these feelings. It is im- 


tang ieee to consider the possibility 
Tgp US exp, on of feelings without simul- 
ty h Erk eriencing of them may not be of 

q peutic value. Of relevance here is a 


Cati: ; 
jute tmadlin, Jenney, & Shlien, 1960) indi- 
Teas lent of the client-centered counselor’s 
teq Success in therapy is significantly 
is rating of the degree to which 
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the client does “express his feelings,” rather 
than “talk about them.” 

The major purpose of the present research 
was to relate measures of counselor and client 
in-therapy. behavior, without relating them to 
outcome measures. Since global success ratings 
(made by the counselors) were available, how- 
ever, these were examined in regard to their 
relationship to the variables of counselor 
judgmentalness and client discussion of feel- 
ings. The trend was in the direction of a nega- 
tive relationship of outcome with discussion of 
feelings and (for the Illinois group) with coun- 
selor judgmentalness. At any rate, there was 
clearly no support for a positive relationship 
between discussion of feelings and counselor’s 
judgment of degree of therapeutic success. 
These results suggest, then, that although a 
judgmental counselor can elicit talk about 
feelings in early counseling sessions, he may 
not facilitate the patient’s dealing with these 
feelings in a meaningful and therapeutic way. 
Thus, in terms of the initial formulation of 
this study, the client may learn to talk about 
his feelings with a judgmental counselor, 
though he may not learn from him that the 
counseling situation is safe for the further 
experiencing and expressing of feelings that 
are necessary for therapeutic gain to occur. 

Two other aspects of the present results 
deserve brief mention. The significant rela- 
tionship with discussion of feelings was found 
for all three methods of rating judgmentalness. 
The fact that the highest relationship was 
found for the unfiltered tape condition (r=.74) 
and somewhat lower and approximately equal 
relationships for the other two conditions (r’s 
of .57 and .59) is especially interesting in light 
of the high correlations between ratings of 
n the filtered and unfiltered 
and between ratings in 
ered conditions (r=.74), 
with a somewhat lower correlation between 
filtered and transcript ratings (r = .49). This 
combination of findings leads to the specula- 
tion that each of the dimensions, i.e., verbal 
(transcript) and vocal (filtered tape), con- 
tributes to the perception of judgmentalness, 
and each makes a contribution which is im- 
portant in the relationship of this variable to 


ent discussion of feelings. 


judgmentalness i 
conditions (r = -73) 
the transcript and unfilt 


cli 
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Also of note is the finding that ratings based 
on contentless voice quality can relate sig- 
nificantly to other variables in the counseling 
situation. Previous research (Starkweather, 
1956a, 1956b) has utilized this filtering tech- 
nique in quite different situations. 
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THE CONSTRUCT VALIDILY OF THE PROGRESSIVE 
MATRICES AS A MEASURE OF SUPEREGO 
STRENGTH IN JUVENILE DELINQUENTS 


WILFRED H. HIGASHIMACHI? 
Utah State Department of Health, Division of Mental Health 


Raven’s Progressive Matrices (1938 & 1947) scales were used to determine 
whether they were sensitive in reflecting differences in superego strength and 
functioning. It was postulated that a high score on the Matrices is associated 
with a measure of strong superego on the Picture Arrangement Test (PAT), 
and a low score on the Matrices with a measure of weak superego on the 
PAT when general ability is held constant. 12 delinquent boys composed 
the strong superego group, and 12 composed the weak superego group. The 
combined Matrices scores revealed that the strong superego group had a 
significantly higher score beyond the .01 level of confidence than the weak 
Superego group. The results give support to the fact that the Matrices are 
sensitive in reflecting differences in superego strength and functioning in a 


population of delinquent boys. 


Booed Togressive Matrices “was designed to 
Ment io Whole range of intellectual develop- 
) ae infancy to maturity” (Raven, 
Binet’ ae England, it ranks second only to 
1948 T. a test of general intelligence (Slater, 
to be a aven S Progressive Matrices purports 
and is hime ig measure of education, 
Nonperfor eee as an untimed, nonverbal, 
i =e test of intellectual function- 
ligence H on Spearman’s g factor of intel- 
n must DEMS Raven (1954) states that 
ces as a not mistake the Progressive Matri- 
should ¢ test of “general intelligence” but 
observatio eT it more as a useful test of 
ince ia or clear thinking.” 

trices to introduction of the Progressive 
been ii to the United States, attempts have 
Various ade by others to correlate it with 
88 the Sro-known intelligence tests, such 
1955. Sore: Binet (Stacey & Carleton, 
Bellevy tacey & Gill, 1955), Wechsler- 
a54), (Desai, 1955; Levine & Iscoe, 
cal, igs Adult Intelligence Scale 
& ildren ee Wechsler Intelligence Scale for 
ip Carleton artin & Wiechers, 1954; Stacey 
a 5 ee 1955), ACE and Otis (Bolin, 
Nge results having a generally wide 
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Bromley (1953) found that the Progres- 
sive Matrices failed adequately to account 
for errors made in disordered thinking of 
psychiatric cases when the test is regarded 
as a test of “abstract” intelligence. Watson 
(1955) noted personality problems affecting 
performance on the Matrices, and felt that 
boys with personality problems would per- 
form better if the test were administered indi- 
vidually with adequate encouragement rather 
than in a group testing situation. Halstead 
(1943) observed that neurotic subjects who 
scored a high grade on the Matrices test 
tended to be less disturbed by their neuroses. 

Johnson (1953a, 1953b, 1953c) believed 
that an objective Raven score may provide 
some impression of superego capacity, and 
attempted to investigate the sensitivity of 
the Progressive Matrices as a clinical pre- 
dictor of play therapy progress. Johnson 
(1953a) considers that: 
ed by this test appears to be the 


the capacity tapp! p to 
capacity to apprehend the governing principle, to 
apply it to the untried situation, and to concur in 
the validity of the solution thus educed. Readiness 
to assume such behavior seems commonly related to 
“character,” as well as to level of intellectual 
potential. 

She feels that if the subject is rarely aware 
of his mistakes, the only pressure upon him 
is his willingness to become absorbed in 


solving the problems according to his recogni- 
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tion of the principles of progression. How- 
ever, Johnson’s investigations have not clearly 
demonstrated that the test performance of her 
subjects was equated to superego capacity. 

There is still much doubt concerning the 
range of applicability of Raven’s Progressive 
Matrices test. Research concurs that the 
Matrices cover a wide range of intellectual 
activities which cannot be comfortably ac- 
commodated by Spearman’s theory of “gen- 
eral intelligence.” A major question is: “What 
functions of an individual’s performance does 
this test measure?” If personality factors play 
an important role in a subject’s test per- 
formance, “Can the Progressive Matrices re- 
flect major differences in personality factors 
of individuals, such as those of juvenile delin- 
quents, consistently and reliably?” Disorders 
of superego development have been strongly 
emphasized in psychoanalytic interpretations 
of juvenile delinquent behavior (Aichorn, 
1951; Eissler, 1949; Forsche, 1956; Fried- 
lander, 1947; Hora, 1952; Redl & Wineman, 
1957). 

Therefore, this study will attempt to clarify 
Johnson’s belief that the Progressive Matrices 
test is sensitive as an instrument in providing 
some impression of superego capacity. It is 
felt that a person who is able to defer im- 
mediate pleasure and become absorbed with 
a difficult or unpleasant task may possess a 
strong superego; while a person who is guided 

by the pleasure principle (in the Freudian 
sense) and who regards all tasks as unpleas- 
ant in which there is no immediate reward 
may possess a weak superego (Oberndorf, 
1951). The purpose of this study was to 
investigate one particular aspect of the juve- 
nile delinquent’s personality, i.e., superego 
strength, in relation to his performance on 
the Progressive Matrices. 

The selection of subjects and interpretation 
of performance on the Progressive Matrices 
in this study were guided by the conception 
of a strong, restrictive, back-to-work type of 
superego as measured by the Tomkins-Horn 
Picture Arrangement Test (PAT). Such a 
conception of a strong superego was con- 
trasted with a weak, permissive superego 

which allows liberal, antisocial impulse ex- 


pression within a work situation. 
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The PAT (Tomkins & Miner, 1957) con- 
sists of 25 plates (each containing line draw- 
ings of the “hero” depicted in three different 
but related situations), with 14 of the plates 
primarily concerned with the work situation. 
There are 655 scoring keys, and the logic of 
the test interpretation rests upon the plate 
arrangements that are rare or improbable 
(occurring at the 5% level of probability oF 
less) in any education, intelligence, or age 
subgroup of the normal sample. 

On the basis that the PAT can differentiate 
juvenile delinquents with strong and he 
superego strengths, the major hypothesis © 
this study may be stated as follows: The 
Progressive Matrices test is sensitive 10 re- 
flecting differences in superego strength an 
functioning; a high score on the Matrices 
is associated with a measure of strong ee 
ego on the PAT, and a low score on = 
Matrices is associated with a measure A 
weak superego on the PAT when genera 
ability is held constant. 

The specific hypotheses to be teste 
as follows: 


d are 


superego 


è i more 
1. Delinquents who have ssion 


strength and control over impulse exPre we 
will obtain higher scores on the Progress le 
Matrices (Colored 1947 scale and 1938 pe: 
separately and combined), as compare 
delinquents who have less superego strene 
and control over impulse expression. 

2. Delinquents who have more sion 
strength and control over impulse sie a 
will be more ego involved with their Der rice? 
ance on the combined Progressive Ma e in 
scales, and thus will take a longer ge: 
their performance than delinquents W 7 yer 
less superego strength and contro 
impulse expression. pitty ° 

3. There will be a greater variabi trices 
scores on the combined Progressive 3 per 
scales among delinquents who have less * Sres- 
ego strength and control over impulse ore 
sion, than among delinquents who hav’ | tse 
superego strength and control ove a 
expression. 


superes? 


METHOD al ected 


” ze als rent 
The 24 subjects for this study We a diffe 
from a delinquent population of boys 
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the la backgrounds who had come in contact with 
Toko ma were committed by the courts to the 
ile ani prigi ior. Baye ae Golden, 
oF 12 0. These subjects were within the age range 
sore o 16 years possessing an average 
Menta Ability. on the Henmon-Nelson Tests of 
ea ion to the above procedures, the following 
subjects: (a) used in the initial selection of all 
the abilit a) At least a sixth grade education, (b) 
dence wee read and write, (c) a minimum resi- 
Physical o Tee months at the institution, (d) no 
T obs T organic handicaps, (e) no psychosis, and 
a ss any psychotherapy. 
of the Saving these procedures with the assistance 
were initiati Counselor at the institution, 48 boys 
in a prem Y secured. The PAT was administered 
guard, thos Setting as a screening test. As a safe- 
Scoring at be Subjects who were one response from 
“Strong S the 5% level of probability on the 
assivity So ego—Back to Work After Injury, 
Sonality t ‘atigue, Sociophilia, and/or Anger” per- 
ego” Gro rait were omitted from the “Weak Super- 
from the W This resulted in the loss of 14 subjects 
Subjecte „eak Superego Group. Furthermore, eight 
the pa’ Were lost through the validity scales of 
final samp and two were lost through illness. The 
in e Ple consisted of two groups with 12 subjects 
iy group: 
Yi 
scoring si I (Strong Superego): Those subjects 
Or less agnificantly at the 5% level of probability 
After ae the “Strong Superego—Back to Work 
Anger» OUY, Passivity, Fatigue, Sociophilia, and/or 
iDeregg sooty trait on the PAT. (This Strong 
Personality trait is defined quantitatively 
T as being at the very extreme end of a 


(Sp ae 
ean age 0). The Weak Superego Group had a 
gona wien 14.83 years (SD = 1.07), mean educa- 
383 SDS (SD =1.24), and mean IQ of 
he aa Ss 
i Oups ois who were finally selected for the two 
the ea all seen individually by the examiner 
eey Ltions itution. Each subject was given the 
the anual pee to Raven’s Progressive Matri- 
in Examiner or individual test administration, vin 
i . . you ig me “You can have as muč 
4 i 
on, Set ney subject had understood the first pattern 
ew the Colored Children’s Matrices (1947) 
th of tie z instructed to carry on through the 
th. answer 1 patterns at his own pace, calling out 
th. a ier the examiner, who scored them on 
dy, length orm. Each subject was then timed for 
ate o of his test performance without his being 
it. The test performance time began when 
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the subject started with the second pattern in Set A, 
working through Sets A, Ab, B of the Colored 
Matrices and continuing through Sets C, D, E of 
the Matrices (1938) scale until he finished the last 
pattern in Set E. The subjects were encouraged 
to give a response to all the 72 matrix patterns. 

The Progressive Matrices answer forms were all 
manually scored by the examiner. As a cross valida- 
tion study, social histories and adjustment records 
were evaluated in hopes of finding any pertinent 
trend which may further differentiate the Strong 
Superego and Weak Superego groups. 


RESULTS 


In analyzing the results of this study, 
major consideration was placed on comparing 
the performance on the combined Progressive 
Matrices (1938 and 1947) scales of delin- 
quent boys possessing a Strong Superego and 
those possessing a Weak Superego as meas- 
ured by the PAT. 

Statistical analysis of the combined Pro- 
gressive Matrices (1938 and 1947) scores 
suggested the use of the Mann-Whitney non- 
parametric test, since the two groups of scores 
were obtained from a very select population. 
The combined Matrices scores revealed that 
the Strong Superego Group had a significantly 
higher score beyond the .01 level of confidence 
than the Weak Superego Group (Group I 
mean score = 53.92, range = 64-46, SD= 
5.68; Group II mean score = 38.75, range 
= 55-16, SD = 12.09). The Strong Superego 
Group also obtained significantly higher 
scores than the Weak Superego Group on the 
separate Matrices (1938) scale beyond the 
(01 level of confidence (Group I mean score 
= 42.67, range = 52-36, SD = 5.44; Group 
II mean score = 29.92, range = 44-11, SD = 
10.49), as well as the separate Colored 
Matrices (1947) scale beyond the .02 level 
of confidence (Group I mean score = S175; 
range = 35-27, SD = 2.45; Group II mean 
score = 25.92, range = 34-14, SD = 5.94). 
A comparison of the test performance times 
of the two groups by the Mann-Whitney 
Test revealed that the Strong Superego Group 
took a considerably longer time, and appar- 
ently seemed to be more ego involved with 
their test performance beyond the .001 level 
of confidence than the Weak Superego Group 
(Group I mean time = 26.10 minutes, range 
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= 38 minutes 14 seconds to 19 minutes 54 
seconds, SD = 5.78; Group II mean time = 
15.46 minutes, range = 21 minutes 40 sec- 
onds to 9 minutes 38 seconds, SD = 3.80). 
On the suggestion of McNemar? (1957), 
an F test was used to estimate the variance 
of the two groups. It was found that the 
Weak Superego Group had a greater vari- 
ability of scores on the combined Progressive 
Matrices, Sets A, Ab, B, C, D, E, as com- 
pared to the Strong Superego Group near 
the .05 level of confidence using a conserva- 
tive estimate. Although the separate Matrices 
(1938) scale, Sets A, B, C, D, E, revealed no 
significant differences (using a conservative 
estimate) between the two groups in varia- 
bility of scores, the separate Colored Matrices 
(1947) scale, Sets A, Ab, B, did reveal a 
greater variability of scores among the Weak 
Superego Group beyond the .05 level of con- 
fidence (using a conservative estimate) as 
compared to the Strong Superego Group. 
The F test was also used to analyze the 
equality of variance of the total errors of 
the two groups on the different subtests of 
the Progressive Matrices (1938 and 1947) 
scales. The results revealed that the Weak 
Superego Group had a greater variability of 
scores than the Strong Superego Group as 
a result of having more errors on Set A 
(near the .01 level of confidence using a 
conservative estimate) and Set Ab (beyond 
the .05 level of confidence using a conserva- 
tive estimate) of the Colored Matrices (1947) 
scale, and on Set D of the Matrices (1938) 
scale (beyond the .01 level of confidence 
using the conservative estimate). 


DISCUSSION 


Significant differences between the Strong 
Superego and Weak Superego groups of delin- 
quents were obtained, thus supporting the 
major hypothesis of this study, i.e., the Pro- 
gressive Matrices test is sensitive in reflect- 


2“For data not satisfying the assumption of the 
F test, it would be far better to proceed with the F 
test and require that an obtained F reach the 01 
level in order to be sure of significance at better 
than the .05 level or that an obtained F reach 
the .005 level for significance at near the .01 or .02 


level [p. 3621.” 


WILFRED H. HicAsHIMACHI 


ing differences in superego strength and 
functioning; a high score on the Matrices 
is associated with a measure of strong supet- 
ego on the PAT, and a low score on the 
Matrices is associated with a measure of 
weak superego on the PAT when general 
ability is held constant. i 

There were consistent and reliable differ- 
ences in performance between the two groups 
of delinquents on the Progressive Matrices 
scales supporting Hypothesis 1, ie, delin- 
quents who have more superego strength an 
control over impulse expression will obtain 
higher scores on the Progressive Matrices 
(Colored 1947 scale and 1938 scale, sepa- 
rately and combined), as compared to delin- 
quents who have less superego strength an 
control over impulse expression. 

In view of the results, it may be hypothe- 
sized that the “Strong Superego—Back 1 
Work After Injury, Passivity, Fatigue, Socio 
philia, and/or Anger” personality trait had & 
considerable influence on the test perform- 
ance of one of the delinquent groups- Sig 
nificant differences between the test scores A 
the Strong Superego and Weak sipa 
groups tend to support Johnson’s (19 a 
1953b, 1953c) belief that an objective pe 
score may provide some impression of moe 
ego capacity. The results of this study ae 
support the possible use of the Progress 
Matrices as a convenient screening test ing 
assessing superego strength and functo ce 
among delinquent boys of average intellige 
who are committed to industrial schools. 
institutionalized delinquents can be super” 
gated on the basis of weak and strong S"? gd 
egos, perhaps different rehabilitation 4 in 
education programs could be incorp O" pution 
helping them to adjust within the inst! 
as well as in society (Aichorn, 1951). ereg? 

It was observed that the Weak ou less 
Group of delinquents tended to be jaa 
haphazard, and impulsive in their Pine 
ance on the Matrices as compare’ g pe 
Strong Superego Group who seeme anc 
more meticulous in their pentoni 
Porteus (1945) noticed that some, 
delinquents were careful and exact mo$i 
performance on the Mazes, althoug owt? 
delinquents had decided tendencies 
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ee and impulsiveness of behavior 

m nullified their planning actions. 
nienn results of this study do show sig- 
De differences between the two groups 
e to length of time involved in test 
i.e. ddin thus supporting Hypothesis 2, 
reien S who have more superego 
will De and control over impulse expression 
thee oe eee ego involved with their perform- 
(1938 a 4 e combined Progressive Matrices 
onger = 1947) scales, and thus will take a 
Ghent: ee in their performance than delin- 
control A have less superego strength and 
Performan- impulse expression. The average 
n w ce time for the Strong Superego 
the Weak a little over 26 minutes, and for 
minutes. Yee Group a little over 15 
changes, would be interesting to note what 
especial] would occur in the Matrices’ scores, 
ii boit y those of the Weak Superego Group, 
time in ene were asked to take a longer 
Work. eir performance and to recheck their 
tee Superego Group's tendency to 
ability to actions seemed to handicap their 
ing princi any te and apprehend the govern- 
appeared |, e in the Matrices’ patterns, They 
and tend ry be less ego involved with the test 
the Stron, to be more prone to guessing than 
ormance = Superego Group. The poor per- 
Sticed y of the Weak Superego Group was 
of one. early in the first two Sets A and 
Scale, Alth olored Children’s Matrices (1947) 
Signs of ough they seemed to show some 
Of their SEn involvement during the middle 
nto thei est performance, they soon lapsed 
Came sin old habits when the patterns be- 
Oticeg ne singly harder. This was especially 
‘ et D of the Matrices (1938) scale. 
Bypot test results also tend to corroborate 
Varia laS 3, ie., there will be a greater 
sive q, 7, 9f Scores on the combined Progres- 
hay, i scales among delinquents who 
impulse Superego strength and control over 
Who ha expression than among delinquents 
ae fom, More superego strength and control 
Score, Pulse expression. Analysis of the test 
Vatianan the two groups revealed a greater 
Subjec ity resulting in more errors among the 
Mon ce the Weak Superego Group than 
e subjects in the Strong Superego 
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Group. Another possible explanation for the 
greater number of errors and guessing by the 
Weak Superego Group may be attributed to 
their chronically poor social adjustment. This 
may have handicapped their early schooling, 
and thus never allowed them the opportunity 
to set up standards of mature, independent 
self-accomplishment when faced with new 
learning situations. 

The Henmon-Nelson Tests of Mental 
Ability have a reported validity coefficient 
around the .80s with other well-known intel- 
ligence tests (Buros, 1945). On the basis that 
the two groups of delinquents possess an 
average level of intelligence as measured by 
the Henmon-Nelson Tests of Mental Ability, 
a comparison of intellectual levels was made 
with the scores of the two groups using 
Raven’s Progressive Matrices (1938) norms. 
According to Raven (1951), the Colored 
Children’s Matrices (1947) scale (by omit- 
ting Set Ab) can be combined with the 
Matrices (1938) scale, Sets C, D, E, in order 
to use the norms for Progressive Matrices 
(1938). 

The Matrices scores of the Strong Superego 
Group would classify two of the subjects 
as “definitely above average in intellectual ca- 
pacity,” six as “intellectually average,” and 
four as “intellectually below average in intel- 
lectual capacity.” In the Weak Superego 
Group, three of the subjects would be clas- 
sified as “intellectually average,” six as “defi- 
nitely below average in intellectual capacity,” 
and three as “intellectually defective.” 

The results of this study seem to support 
the findings of others that the Progressive 
Matrices test measures something other than 
general intelligence. Factor analytic studies 
of the Matrices, as well as batteries of reason- 
ing tests, have found numerous factors be- 
sides “general reasoning” playing an impor- 
tant part in the actual solution of the test 
items. It seems to be the general consensus 
of many investigators that personality factors 
and emotional attitudes affect “clear think- 
ing” in test performance. The test-retest reli- 
ability of the Progressive Matrices may not 
only be influenced by the nature of the test 
items themselves, but also by the emotional 


state of the subject taking the test at dif- 
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ferent intervals. It has been demonstrated 
that personality factors, such as superego 
strength, affect the Progressive Matrices as a 
valid measure of general intelligence and clear 
thinking. 

Juvenile delinquents, as demonstrated in 
this study, seem to have more than one type 
of behavior and personality configuration. 
The two groups of delinquents in this study 
were selected to reveal different cleavages in 
ego and superego alignment described in the 
literature. Consistent and reliable differences 
of the two groups were also revealed in their 
behavior and adjustment records at the 
institution. 

A comparison between the Strong Superego 
and Weak Superego groups of delinquents 
was made on the number of demerits they 
received during their first three months resi- 
dence at the institution. An analysis of the 
total demerits received by the two groups 
(based on the average rating of the demerit 
system at the institution) by the Mann- 
Whitney Test revealed significant differences 
between the two groups beyond the .02 level 
of confidence. It was noted that during the 
first three months’ residence at the institu- 
tion, the Strong Superego Group had earned 
a mean of 4.83 demerits (SD = 5.26), while 
the Weak Superego Group had earned a mean 
of 16.08 demerits (SD = 11.31). 

A further comparison of the two groups 
was made on the number of runaways from 
the institution during their first 3 months 
residence. Using a chi square test of “good- 
ness of fit,” it was noted that the Weak 
Superego was more likely to run away 
(beyond the .05 level of confidence) from 
the institution than the Strong Superego 
Group during its first 3 months period of 
residence. It is interesting to note that after 
the testing was completed, three of the boys 
in the Weak Superego Group did run away 
from the institution and were later appre- 
hended. This may further relate to the Pro- 
gressive Matrices and PAT as meaningful 
predictive tests of important behavior and 
adjustment patterns in the institution. 

Tt was not within the scope of this study 
to cover the major sociocultural and environ- 
mental factors which have been reported to 
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contribute to delinquent behavior, because of 
the very small sampling of the delinquent 
population. However, regardless of socio- 
cultural and environmental factors, the Pro- 
gressive Matrices scales are sensitive in re- 
flecting consistent and reliable differences 1m 
superego strength and functioning in a delin- 
quent population of boys when general ability 
is held constant. 

Although there is a growing awareness of 
the need to establish diagnostic facilities and 
predictive tools for the youthful offender, 
generalizations from the results of this study 
should be limited only to similar groups 
of institutionalized delinquents with caution. 
Further studies are in order with respect tO 
the use of the Progressive Matrices as a pre- 
dictive tool in observing and classifying 
delinquent behavior, as well as behavior 0n 
a large scale. 
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FACTORS OF PSYCHOPATHOLOGY IN THE WARD 
BEHAVIOR OF ACUTE SCHIZOPHRENICS’ 


ALLEN RASKIN AND 


Georgetown University 


DEAN J. CLYDE 


George Washington University 


2 factor analyses were performed on nurses’ ratings of the ward behavior 
of 417 acute schizophrenic patients. The 10 psychotic syndrome scores which 
Lorr, McNair, Klett, and Lasky (1962) derived from behavior observable in 
an interview were included in one of the analyses. 11 factors labeled Self-Care, 
Social Participation, Extraversion, Irritability, Anxiety, Guilt Feelings, Depres- 
sion, Feelings of Unreality, Slowed Speech and Movements, Paranoid Pro- 
jections, and Excitement were extracted from the ward behavior ratings. 
There was good correspondence between the factors of psychopathology 
extracted from patient ward behavior and the Lorr et al. psychotic syndromes. 


Lorr, McNair, Klett, and Lasky (1962) 
present evidence for 10 postulated psychiatric 
symptom syndromes based on behavior ob- 
servable in an interview. As noted by these 
authors, quantified indices of symptom syn- 
dromes have practical value as criteria of 
change in studies on effects of various treat- 
ment procedures and may also provide a use- 
ful approach to the hoary problem of patient 
classification. However, the symptom syn- 
dromes identified by Lorr et al. have certain 
limitations both as measures of improvement 
and as a schema for classifying psychiatric 
patients. First, many of the psychiatric rating 
scale items, which comprise the raw data for 
the Lorr et al. study, describe behaviors in- 
dicative of severe psychopathology; mild and 
moderate aspects of psychopathology are less 
well represented in these scales. Secondly, the 
behaviors sampled are limited to those that 
can be observed in an interview of approxi- 
mately an hour’s duration. 

In the present study an effort was made to 
identify factors of psychopathology in the 
ward behavior of acute schizophrenics. Be- 
haviors characterizing mild, moderate, and 
severe psychopathology are equally repre- 
sented in the ward behaviors sampled. The 
10 psychotic syndrome scores, derived from 
Form Number 3 of the Inpatient Multidimen- 
sional Psychiatric Scale (IMPS), were in- 


1 This research was supported by funds made 
available by Research Grant MYP4663 of the Psy- 
chopharmacology Service Center, National Institutes 
of Health, Public Health Service, United States 
Department of Health, Education and Welfare. 


cluded as marker variables in one of tig 
factor analyses of patient ward pena 
The syndrome scores were included to clarin 
the meaning of emerging factors of p 
chopathology in patient ward behavior an 
extend the meaning of the Lorr et ot 
symptom syndromes by identifying their ©° 
relates in patient ward behavior. 


METHOD 
Subjects 


r collabora- 


The present study is part of a large treatment 


tive study of the effects of phenothiazine 
on acute schizophrenic psychoses. The stu 
consisted of 185 male and 232 female Pi", 


drawn from the psychiatric populations p state 
large, metropolitan receiving hospitals,” 1° Newly 
hospitals, and three private institutions: ith 2° 
admitted patients, 16 to 45 years of age hi pas 
previous psychiatric hospitalization within chit” 
6 months and no evidence of childhood Sprain 


phrenia, mental deficiency, chronic or acl! proni 
syndrome, epilepsy, drug addiction, O" satisfy 
alcoholism were included in the study. +° nosis 
the diagnosis of acute schizophrenic pet two 
patients had to manifest symptoms in at Teolog" 
of the following categories of psychop? tatonic 
(a) thinking and speech disturbance, (b) - pe- 
motor behaviors, (c) paranoid ideation ° 


2D. C. General Hospital, Washington, D. C 
Malcolm Bliss Mental Health Center, 
Missouri. 

3 Boston State Hospital, Boston, en 
Kentucky State Hospital, Danville, york, * 
Rochester State Hospital, Rochester, New Jand: t 
Springfield State Hospital, Sykesville, Mary nection” 

‘Institute of Living, Hartford, °° yan 
Mercy-Douglass Hospital, Philadelphia, ah 
and Payne-Whitney Clinic, New York © 


ust 
Massacht ky: 
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havi PEN 
y a hallucinations, (e) delusional thinking, 
urbances of affect and emotion, and (g) 


disturban i z 
relations. of social behavior and interpersonal 


Procedure 


Be š 
Fe dg kazy second week in the hospital and 
treatment oa assignment to one of the four drug 
two Psychiatrh pase patients were interviewed by 
immediately i s or a psychiatrist and a psychologist. 
viewer rated lowing this interview, each inter- 
Ho: iise nated cat A the IMPS. In addition, 
avior Ratin ated each patient on the Ward Be- 
akerem, & 7, te (WBRS) (Burdock, Hardesty, 
irst Week of gea 1960) at the end of the patient’s 
signment to ospitalization and prior to his as- 
om N one of the drug treatment groups. 
9-point ae 3 of the IMPS consists of 46 
Addition, 1 13 S-point unipolar rating scales. In 
absent, Each checklist items are rated present or 
Mm the ratin of the 10 syndrome scores is represented 
pae schedule by 5 to 11 scale variables. 
Pr rome scores are labeled Excitement, 
ojections, Motor Disturbances, Hostile 
i on Depression Agitation, Perceptual Dis- 
athy and Retardation, Grandiose Ex- 
= ion Thinking Disorganization, and Dis- 
he ¥ 
lems ERS contained the original 150 check list 
enal dime OE al al. (1960) plus three ad- 
Prince a puted by the staff of the Psy- 
y Mental ET Service Center, National Institute 
tesen; ealth. Each check list item is rated 
ioni absent. 
rto See electronic computer programs for a 
10 Psible to were limited to 100 variables, it was 
: ine the 153 WBRS items and the 
on pOf BRS i scores in a single analysis. Selec- 
th asks items for inclusion in the analysis 
the is peg rationally derived syndromes ap- 
Hop tor Üa the best approach for identifying 
eve tors of psychopathology in the WBRS. 


Wi r, s 
He be ped it is anticipated WBRS factor scores 
merit l oth as criterion measures in studies of 
item in uin treatment effects, there was also 

g item reliability as a criterion for 


Were prlection, 
Pri “formed 


Wer Y to 
"nd tation first analysis, the 153 WBRS items 
inica eS of grouped by the authors and 14 
assign, Psych psychopathology identified. Five 
ned e Ologists, working independently, then 
of the WBRS items to one of the 
derived syndromes. The 90 WBRS 
in the first factor analysis met the 


Consequently two factor analyses 


e 
Second factor analysis was based on 
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100 WBRS items meeting the criterion of an 
intraclass reliability coefficient of 35 or higher 
between the paired nurses ratings. Twenty-seven 
items failed to meet the criteria for inclusion in 
either analysis. 

The dichotomous and continuous variables in the 
first analysis were intercorrelated by the Pearson 
product-moment, point biserial, and phi coefficient 
methods.5 The correlation matrix for the dichoto- 
mous variables in the second analysis consisted 
of phi coefficients. Both correlation matrices were 
factor analyzed using Hotelling’s principal com- 
ponents method with unity in the diagonals. A 
normal varimax rotation, which provides an or- 
thogonal solution, was then performed on all factors 
with eigenvalues of one or greater. 


RESULTS 


Eleven factors containing at least three sig- 
nificant WBRS items were extracted from 
both factor analyses. Item significance was 
arbitrarily defined as a loading of 40 or 
higher on one factor and no loading of .40 
or higher on any other factor. As the IMPS 
syndrome scores are based on a weighted 
combination of at least five individual scales, 
were derived from an oblique rotation and 
provide useful frames of reference for the 
WBRS factors, they are included in a WBRS 
factor if they load .27 or higher on that 
factor. With minor exceptions the same 
WBRS factors emerged from both factor 
analyses despite the previously described dif- 


5 Although the point biserial and phi coefficient 
are systematically lower than the respective Pearson 
r equivalents, the biserial correlation coefficient and 
the tetrachoric correlation coefficient, they do not 
make the assumptions that the dichotomous vari- 
able(s) is actually continuous and its underlying 
distribution is normal bivariate. As many of the 
WBRS items fail to meet these assumptions, the 
point biserial and phi coefficient methods were 


chosen. 
6 Unity was 
reasons. First, 


chosen as the diagonal entry for two 
in an empirical study, Tyler and 


Michael (1958) compared the factor structure of 
anthropometric data when unitics and communality 
estimates were used. The same common factors were 
jdentified for both types of diagonal entry with 
the degree of similarity of loadings on the same 
factors exceedingly high as shown by two different 
indices of similiarity. Secondly, use of unity as the 

vation of exact factor 


diagonal entry permits deri c 
scores for individual patients whereas with com- 


munality estimates approximate factor scores are 


obtained. 
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TABLE 1 


Factor A: SELF-CARE 


Factor loading 


Number Item content Is Ir 
9 Needs help in dressing a05 —72 
10 Follows instructions 60 56 
28 Repeats words and phrases -H 
34 Answers questions about himself 41 
36 Takes care of personal appearance 76 78 
42 Is able to make own bed 69 68 
46 Knows new and old personnel 40 
53 Does chores about ward 42 42 
54 Makes a nuisance of himself —42 —40 
60 Gets along with other patients 52 49 
65 Has to be helped in the bathroom —61 -59 
72 Speech is sensible and connected 58 50 
93 Refuses things expected of him —68 —68 
95 Looks sloppy —66 — 69 
98 Has to be helped to stick to things —42 <4 
100 Keeps self clean 77 79 
102 Washes or showers himself 78 80 
109 Cooperates with hospital personnel 61 64 
120 Behaves acceptably off ward 59 
IMPS Disorientation —62 
IMPS Motor Disturbances —68 
IMPS Thinking Disorganization —42 
a Percentage of variance, 10.5; first factor analysis. 
b Percentage of variance, 9.5; second factor analysis. 
ave 
ference in criteria for item inclusion.” Con- typed speech, incoherent answers) h 


sequently, combined results for both analyses 
are reported. 

Factor A (see Table 1) contains the 
largest number of significant items and also 
accounts for the greatest percentage of total 
item variance. This factor, labeled Self-Care, 
reflects the patient’s ability to care for his 
hygienic needs, personal appearance, and 
daily ward chores. Conversely, this factor is 
indicative of the extent of nursing care and 
attention required by the patient. Three 
IMPS syndromes, Disorientation (e.g., dis- 
oriented as to hospital, age, year), Motor 
Disturbances (e.g., talks to self, slovenly ap- 
pearance, repetitive movements), and Think- 
ing Disorganization (e.g., neologisms, stereo- 


7 Identification of the same common WBRS fac- 
tors in both analyses and the high degree of 
similiarity of loadings on the same factors for both 
analyses indicate little resultant distortion of the 
first factor analysis as a consequence of analyzing 
nt biserial correlations and phi coefficients in 


pon product-moment correlations. 


the presence of 


significant loadings on Factor A. For Se 
part, these IMPS syndromes have in Gn ac 
behaviors characterizing severe loss of C9 
with reality. Hence, as one might exp 
patient’s inability to care for bis P a 
needs and the extent of his demands BE E 
time and services of nursing pason with 
inversely related to degree of contac thet 
reality. From a nursing standp in tility 
aspects of psychopathology such as re de 
paranoid ideation, and depression ffc? 
closely associated with patient 5 
activities. .. . measul 
Factor B, Social Participation, iS & | nef 
of the patient’s interest in being W soci?! 


ect, 
erso” 


` +s ation i a 
patients and extent of participation. í p 
and recreational activities. In term’ | oci 5 
chopathology, it is the converse calè 


isolation and withdrawal. As the mE ja ao 
do not sample patient behaviors 9 ` ple p 
there is no IMPS syndrome comP z 

this factor. 
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TABLE 2 


Factor B: Soctar PARTICIPATION 
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TABLE 4 
Factor D: IRRITABILITY 


Factor 
Number loading pedir 
Item content eo IP Number Item content T Ib 
25 a 
26 Te ae hae someone on ward —50 —47 7 Gets angry when kidded 72 
39 Makes ES SNe RS HERS -73 -74 14 Has temper tantrums 44 56 
BS oime inci talk —62 16 Is irritable and grouchy 70 70 
57 Shows ra cial games —71 —74 23 Is impatient 66 
SS Likes ates in going to O.T. -61 27 Gets angry when questioned 70 70 
6 Shows ph al about his interests —62 49 Shows frequent changes of mood 42 49 
69 Sfiaiis a asure tn recreation —70 —80 74 Is easily upset 63 
117 Takes interest in outside events —56 —S7 119 Accepts hospital routine —S4 
121 Tatks Part in ward activities -76 136 Makes sarcastic remarks 63 62 
124 Likes oe happenings on ward —63 146 Has an angry expression 70 70 
= occupied —69 IMPS Hostile Belligerence 41 
b pecentage of vari r 
Sa ‘Vallance, 9p Bratfactor analys a Percentage of variance, 5.0; first factor analysis. 
4; second factor analysis. 


ntage jari 
ge of variance, 8.1; second factor analysis. 


Al 
al sie Factor C, Extraversion, is ration- 
Scribes a ed to Social Participation, it de- 
With the More active and assertive social role 
Volunteer ont initiating conversation and 
. The IMS information about himself. 
'S repres S Hostile Belligerence syndrome 
Irrita teen in the WBRS by Factor D, 
tituqe D This factor characterizes an at- 
p Ptessig discontent and sullenness. Motor 
teaks th of hostility such as “Tears or 
loag on m and “Hits others” do not 
Syndr a factor. The Hostile Belligerence 
Drise ee the IMPS is similarly com- 
Andy items reflecting hostility in attitude 
Ae on expression. 
Weiety to the IMPS, a separate 
of R items, { Factor E, emerges from the 
sp tiety nt —— to observable signs 
of eves istress this factor also de- 
a eight in behavioral accompaniments 
ened anxiety state such as hesi- 


b Percentage of variance, 


tancy in making up one’s mind and an ap- 
pearance of helplessness and perplexity. The 
IMPS has three anxiety items. Two relate 
to verbal reports of anxiety in anticipation 
of future problems and in anticipation of 
efinable future misfortunes. In the 


vague ind 
study by Lorr et al. ( 1962) these items had 
significant loadings on the IMPS Agitated 


syndrome. The third IMPS 
anxiety item, “Manifests overt signs of ten- 
sion,” did not have a significant loading 
on this syndrome. Perhaps the patient who 
is both anxious and depressed verbalizes 
feelings of anxiety to the interviewer whereas 
many anxious patients lacking depressive 
features do not. 

The IMPS Depressive Agitatio 
had high loadings on two factors, Guilt Feel- 
ings (Factor F) and Depression (Factor G). 
Many of the behaviors in the IMPS Depres- 
sive Agitation syndrome are represented in 


Depression 


n syndrome 


Rtg, ta, 

ent Be of N 

tage of Variance, 1.8; first factor analys 
ariance, 2.3; second factor analysis 


TABLE 3 TABLE 5 
F, D: 
ACTOR C: EXTRAVERSION Factor E: ANXIETY 
u Factor Eagior 
Fa ondi 
Mber loading Ees 
Mii 
8 Ttem content Ia re Number Item content Is IIb 
So Vor 
Unteers jı A Is tense and anxious —61 
6 i s inf -42 18 s ten: 
s OTTOA sige E teal 4 18 “Appears helpless or perplexed —40 —63 
73 i ten relatives visit 59 S 
Un Se conversation = 48 Is fidgety and nervous —70 
e a j i ki i = 
Pe S to be with the opposite sex —57 —59 114 Js hesitant in making up mind 71 
Ce; = 
of variance, 2.4; first factor analysis. 


a Percentage 


b Percentage 2.6; second factor analysis. 


of variance, 
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TABLE 6 TABLE 8 
Factor F: GUILT FEELINGS Factor H: FEELINGS OF UNREALITY 
na a 
F: Factor 
Tea joading 
loading — 
+ p Il 
Number Item content p 1I» Number Item content a ee 
i 2 o7 
35 Loses things —45 126 Claims he is being controlled — 15 a 
77 Talks about his unworthiness -77 —49 135 Says things have special meanings 
78 Mixes up words -45 142 Says that all or part of himself is 
140 Says he is no good =e outside his body, or that someone gi 4 
A itati is in his body j 
IMPS Depressive Agitation -41 
IMPS Grandiose Expansiveness 


ii i factor analysis. 
a Percentage of variance, 1.9; first . 
ennn of variance, 1.6; second factor analysis. 


Factors F and G of the WBRS, i.e., depressed 
mood, guilt and remorse, ideas of sinfulness, 
and recurring thoughts. 

The Grandiose Expansiveness syndrome 
had a significant loading on WBRS Factor H, 
Feelings of Unreality. Both factors de- 
scribe delusions of unusual power or an out- 
side force controlling or directing one’s 
behavior. 

The IMPS Apathy and Retardation syn- 
drome is represented on the WBRS by Factor 
I, Slowed Speech and Movements. These 
factors have in common retardation in motor 
behavior and in speech. 

IMPS Paranoid Projections had its highest 
loading (.27) on WBRS Paranoid Projections 
(Factor J). However, in contrast to the 
WBRS items, the IMPS scales tend to de- 
scribe a higher level of psychopathology, e.g., 
ideas of reference, ideas of persecution, and 
ideas of conspiracy. This difference in level 
of psychopathology may account for the 
relatively low loading of IMPS Paranoid Pro- 
jections on the comparable WBRS factor. 


TABLE 7 


Factor G: DEPRESSION 


Factor 
loading 
Number Item content Is IIb 
6 Looks sad 76 77 
138 Speaks in a sad voice 72 80 
147 Says he feels blue 67 
IMPS Depressive Agitation 40 


ariance, 2.7; first factor analysis. _ 
b Teenage of variance; 2.8; second factor analysis. 
e 


first factor analysis. | 
second factor analysts: 


a Percentage of variance, 2. 
b Percentage of variance, 2 


: c 
The final WBRS factor, Excitement (Fa 


: , pxcite- 
tor K), is rationally related to IMPS De 
ment. Both contain items sampling the 


talkativeness and braggartism. Howèven 9 
three WBRS items in the Excitement J sis 
were not included in the first factor an 
containing the IMPS syndrome score® en 
sequently, the loading of IMPS Excite 
on the comparable WBRS factor '$ 
known. ‘ o ap’ 
The only IMPS syndrome having ae js 
parent counterpart in the WBRS fac con” 
Perceptual Distortion. This syndrome 15 
prised of scales which describe ee 
auditory hallucinations. Comparable ite re & 
included in the WBRS but most WO ing 
cluded from the factor analysis ¢°? 
the psychotic syndrome scores because 
occurred with less than 10% frequenc havior 
In sum, despite differences 3" jan el’ 
sampled (The IMPS samples patien pase? 
view behavior whereas the WBRS ia ; 
on patient ward behavior) and deve " ati? 
chiatric sophistication of individua 


TABLE 9 


Factor I: SLOWED SPEECH 
AND MOVEMENTS 


= rac 
Ending 
= 
Number Item content a m 
59 
67 Is slow in his movements | w 
112 Voice is flat and monotonous yer 
ks i 7, n out man 33 
116 Speaks in slow, drawn # 
IMPS Apathy and Retardation ee 


ssis- 
analys’ 
a Percentage of variance, 2.7; first factor 
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TABLE 10 
Factor J: PARANOID PROJECTIONS 
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TABLE 11 
Factor K: EXCITEMENT 


Factor 
x loading 
Numbe; 
mber Item content r Ib 
85 
<3 Says people hate him 70 
Be Says others want to hurt him 66 
Says people dislike him 83 76 
IMPS Paranoid Projections 27 
` Percent: 


age of variance, 2.5; first factor analysis. 


bp 
ercentage i i ; 
ge of variance, 1.4; second factor analysis. 


th 
li and WBRS, there is good cor- 
Bila between the IMPS psychotic 
ing dupses and the WBRS factors. In vary- 
ave ei es, 8 of the 10 psychotic syndromes 
identific nterparts in the WBRS factors. The 
Syndrom on of correlates of the psychotic 
S ine es in patient ward behavior clarifies 
nds To of these syndromes and also 
Classify; Bpert to their use as a schema for 
Practical g psychiatric patients. Further, the 
ange value of psychotic syndromes as 
ining i easures may be enhanced by com- 
compar, bie psychotic syndrome scores with 
Finall able factor scores from the WBRS. 
there Y, the study has demonstrated that 
olo. are meaningful factors of psychopa- 
findine in patient ward behavior. The latter 
Wrpe Suggests that current use of the 
an ee obtain a total morbidity score is 
Conomical use of this instrument. 


Factor 

loading 
Number Item content Ts 
82 Brags or boasts 46 
108 Talks and talks 62 
111 Is constantly moving about 54 


a Percentage of variance, 2.4; second factor analysis. 


Utilization of separate WBRS factors as 
change measures may provide more effective 
means of discriminating the effects of various 
treatment procedures. The separate WBRS 
factor scores may also be useful for describing 
or categorizing nuances of patient ward be- 
havior not possible with a single morbidity 


measure. 
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INTERPERSONAL ORIENTATION IN RELATION 
TO HYPNOTIC SUSCEPTIBILITY’ 


P. M. BENTLER 
Stanford University 


2 samples of women (N=37, N=47) described themselves on the Titer 
personal Check List (ICL) and were hypnotized according to the — 
scales, Forms A and C. One sample of 43 men described themselves on = 
ICL; these men were hypnotized according to procedures of Form A. T a 
Docile-Dependent, Cooperative-Overconventional, and Responsible-Hypernorma 
octants showed the highest correlations with hypnotizability. ICL scores 
based on factor analyses showed that a “positive” and affiliative interpersonal 
orientation correlated positively with hypnotizability, while the “negative 
orientation factor and the dominance-submission factor were unrelated to 
hypnotizability. A comparison of correlations of hypnotic measures with ICL 
scores indicated that the correlations which approached or reached significance 
for the combined female samples were almost uniformally smaller for Form C 


than for Form A. 


Most researchers in the field of hypnosis 
feel that the personalities of highly sus- 
ceptible hypnotic subjects differ from the 
personalities of refractory subjects, yet the 
nature of this difference eludes precise 
quantitative measurement. Hilgard and Lauer 
(1962), for example, found none of the 18 
California Psychological Inventory (CPI) 
(Gough, 1957) scales to yield correlations 
with hypnotizability which replicated signifi- 
cantly across sexes. In samples with more 
than 100 subjects per sex, only three out of 
36 correlations were higher than .20, and the 
highest was .26. An item analysis was not 
much more successful. Only 33 items for 
males and 36 items for females out of the 
480 items of the CPI discriminated the upper 
half from the lower half of hypnotic subjects. 
Scales constructed of these items correlated in 
the .60s on the sample from which the data 
came, but on cross validation these correla- 
tions fell to less than .10. Weitzenhoffer 


1From the Laboratory of Human Development. 
This investigation was supported in part by a Public 
Health Service fellowship number MPM-1s, 840 
from the National Institute of Mental Health, 
United States Public Health Service. The hypnotic 
scores were obtained within the investigations 
supported by a grant from the National Institute of 
Mental Health (Grant M-3859) to Ernest R. Hil- 
gard. I am indebted to Nancy Wiggins, Kenneth 
M. Colby, Ernest R. Hilgard, Perry London, and 
André M. Weitzenhoffer for their constructive criti- 


cisms of the manuscript. 


ed the 
Survey 
jonnaite 
00 sub- 
correla” 


sults are 


and Weitzenhoffer (1958) administer 
Guilford-Zimmerman Temperament 
(1949) and the 16 P.F. ga 
(Cattell, Saunders, & Stice, 1953) to 

jects. The highest of 24 obtained 
tions was .18. When significant Te t mio 
reported, the prognosis is generally n results 
more favorable. Replicating significant 5 
is quite difficult. Furneaux and Gibson a 
and Lang and Lazovik (1962) have, egati? 
ample, reported data indicating 4 sonality 
correlation between the Maudsley 59) 
Inventory (Bendig, 1959; Eysen¢ ibility 
measure of neuroticism and suscep on thé 
hypnosis. Hilgard and Bentler (16h with 
other hand, found a correlation Ot + 


143 subjects. As itself 

The study reported here abe) «pas 
with the personality characteristics (1950); 
trust,” as described by Erikson ones? 
Trust is essentially an attitude towar P se 
and the world, and it is basic 1" ed not Z 
that it appears early in life and ically F 
conscious. This variable is theore nosis an 
lated to hypnotizability since hyP erent 


volves relinquishing control to ang etd ve 
In his developmental-interactive : that id 
hypnosis, Hilgard (1962) points pes: Oe 
love relationship between paren "i dev op 
is an antecedent which may jlity m 
mentally related to hypnotizabl oi f 

basic trust. Furthermore, this tY! 
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has implications for the transference aspects 
of hypnosis, 
Basic trust, of course, cannot be measured 
easily. Only superficial manifestations of this 
trait could be assessed. Barber (1960) at- 
mpted to construct a questionnaire measure 
z this trait from the item pool of the 
uilford-Zimmerman Temperament Survey 
(1949) and the Webster-Sanford-Freedman 
ASS) F Scale. This scale differentiated the 
A Tefractory subject out of 20 hypnotizable 
a Ra but apparently it has not held up 
-V; i 
1962), ross-validation (Barber & Glass, 
oe trust may be related to hypnosis by 
= ie that trust established early in life 
inter Sidues which are reflected in a person’s 
Re Sa S orientation. Recent factor analy- 
ee ormed at Stanford (Wiggins, 1961) 
Leary ee instrument indicates that the 
czek i terpersonal Check List (LaForge & 
appear, 955) contains a “Love” factor which 
frien ath to reflect affiliation, approval seeking, 
consider a> and sociableness which may be 
tion é eda Positive” interpersonal orienta- 
eerie with basic trust.’ Since the 
Ypnoti we is theoretically linked to 
chosen Zability, the existing instrument was 
the e a first approximation in evaluating 

mosis. Tee of this type of orientation for 
Next Step f the results appear promising a 
Measure Would be to construct an instrument 
E this factor more adequately. 


nte METHOD 
bersona? Check List 


e 
ating Interpersonal Check List (ICL) is a self- 
Prge an Jective-type check list developed by La- 
Suczek for inclusion in the Leary (1957) 
but Measure Diagnosis of Personality. The ICL 
tig Leary, More than one “level” of personality, 
ai Ben Level II, the level of conscious descrip- 
Bg, “rally used in conjunction with the ICL. 
tep. fants, o, CONCerned with that level. There are 
On "ented 7 Variables, in the ICL, each of which is 
tati; the basi Y 16 items. The items were selected 
Rene 8S ang S of their rational content, intensity 
Stall i endorsement frequency. The octants are 
entified by two alphabetical letters and 

Wiee: 

of Bei 
the a Nancy, & Bentler, P, M. The structure 
> 1962 Personal Check List. Unpublished manu- 


427 


are descriptively summarized as follows: AP = 
managerial-autocratic; BC = competitive-narcissistic; 
DE = aggressive-sadistic; FG = rebellious-distrustful ; 
B= self-effacing-masochistic; JK = docile-depend- 
ent; LM = cooperative-overconventional; NO = re- 
sponsible-hypernormal. A person checking some of 
the items in each octant is considered normal, 
while a person checking most or all of the items 
in an octant has unhealthy amounts of the trait, 
For example, being managerial in orientation is fine, 
but having too much of this trait shows undesirable 
tendencies to being autocratic. 

Although the ICL is usually described in terms 
of two bipolar dimensions named Love-Hate and 
Dominance-Submission, recent factor analyses by 
Wiggins (1961) indicate that the raw scores are 
better described by three factors. Wiggins (1961) 
performed a Thurstone centroid method of factor 
extraction and rotated to visual criteria of simple 
structure. Although the factor structure differed 
slightly for the sexes, three orthogonal factors 
emerged: Love, Hate, and a bipolar Dominance- 
Submission factor. A principal axes method with 
Varimax rotation was used by Wiggins and Bentler 
on the same data, and three orthogonal factors were 
found again.? These three factors will be described 
briefly since they are used in this study, Factor I 
is Leary’s dominance-submission bipolar factor. 
Using a loading of .32 to define a significant factor 
loading, octants AP, BC, and DE loaded positively 
and HI loaded negatively on this factor for both 
sexes. The highest positive loading was on the 
Managerial-Autocratic octant for both sexes. Factor 
II represents a positive interpersonal orientation and 
has been called Love. It loads significantly on octants 
HI, JK, LM, and NO for both sexes, The highest 
loading is on the Cooperative-Overconventional 
octant for both sexes. Factor III represents a 
negative interpersonal orientation for both sexes; it 
loads highest on the “hate” side of the Leary circle, 
though the significant loadings are not identical for 
the sexes. For females, Factor III loads on DE, 
FG, HI, and JK and for males on BC, DE, FG, and 
HI. For both sexes, however, the highest loading 
is on the Rebellious-Distrustful octant. 


Factor Measurement and Hypotheses 


As was pointed out in the introduction, a Positive 
interpersonal orientation is hypothesized to be re- 
lated to hypnotic susceptibility. The ICL octants 
comprising the trusting, cooperative orientation 
should correlate highest with hypnotizability (HI, 
JK, LM, NO) and Factor II scores should correlate 
positively with hypnotizability. Factor scores were 
arrived at for each person by summing scores on 
octants on which the loadings for a factor were 
over .32. Octants were given equal positive or nega- 
tive weight, depending on the sign of the loading, 
but irrespective of the size of the loading as long 
as it was over .32. Thus, subjects’ scores on Factor IT 
were computed by summing their item endorsements 
of octants HI, JK, LM, and NO. 
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TABLE 1 


CORRELATIONS BETWEEN HYPNOTIZABILITY AND ICL SELF AND Factor SCORES 


Form A Form C 
= F II (F) 
Sample (i I(E) IKF) IHI) IOD 1E) mr) IH 
ICL Octant . nis 
AP Managerial-Autocratic —.04 23 Ww —.04 —.07 5 ‘as 
BC Competitive-Narcissistic —.24 Pu | —.03 A2 —.20 dà =i 
DE Aggressive-Sadistic 12 —.14 =13 —.02 13 se ge 
FG Rebellious-Distrustful —01 =.19 - —.08 14 Eo ia 
HI Self-effacing-Masochistic -08 13 10 OF = i E 
JK Docile-Dependent AS 25 22* 23 25 3 ‘gge 
LM Cooperative-Overconventional 40 30* 340 24 390+% ue Bt 
NO Responsible-Hypernormal 26 328 24* 20 22 22 
ICL Factors —.01 
I. AP+BC+DE-HI —.19 .03 —.06 =.10 —.20 10 per 
I. HI+JK+LM+NO 31 -32* FS baaa 24 a5" 25 ‘o7 
III. Women: DE+FG+HI+JK 05 08 .06 16 OL 
III. Men: BC+DE+FG+HI 03 
*p <.05. 
ep <.02. 
sp <01. 
Factor III, which represents a negative inter- Subjects and Procedure? 
personal orientation, might be negatively related to Š described 
hypnotizability. The method of computing factor For purposes of deriving the factor ICL 
scores, however, precludes this being a reasonable above, Wiggins (1961) administered ctory PSY" 
hypothesis. An inspection of the factor loadings to 235 men and 154 women in introdu rate 


reveals that for women Factor III has loadings on 
the submission and dependent quadrants HI and 
JK; since these scores enter as equally into the 
factor score as aggressive or rebellious type items, 
the negative correlation which could otherwise be 
expected would be minimized. This overlap in fact 
produces a good-sized correlation between Factor IT 
and III scores in this sample. In two samples of 
women to be described this correlation was .51 and 
-58, respectively. For a sample of men, the Factor 
II and III scores correlated .22. This correlation is 
lower since only one octant is common to both 
factors. Since both a positive (II) and a negative 
(ITI) correlation with a criterion is not to be ex- 
pected between variables which are themselves posi- 
tively correlated, the net expected effect would be to 
attenuate the Factor III correlation with hypnotiza- 
bility. 

Factor I, the dominance-submission factor, could 
be linked to hypnotizability by considering suscepti- 
ble subjects as tending to “submit” to the hypnotist 
while unsusceptible subjects tend to do battle with 
the authority represented by the hypnotist. White 
(1937) has advanced the hypothesis that active 
hypnotic subjects are judged higher in the trait of 
deference, and he has presented evidence to sub- 
stantiate this claim. These are the type of subjects 
n at universities (Lang & Lazovik, 1962). 


see A 
generally Factor I should correlate negatively 


From this view, F 
with hypnotizability. 


chology classes under standard instructions sample? 
themselves on the 128 items. Three separate ductor 
of subjects were drawn from the same 19 ; e pro 
classes and were hypnotized according to coptibilitY 
cedures of the Stanford Hypnotic re G 
Scale, Form A (Weitzenhoffer & Hilgard 
Two of the samples consisted of mO iia 
N=47), and one of men (N=43)- i Form 
were hypnotized a second time according & Hig? 
of the Stanford Scale (Weitzenhoffer items SU 
1962). This scale contains more cognitive } ination 
as a dream, an age reggression, and i oluntee™ 
These subjects were all “coerced wai the? 
(Boucher & Hilgard, 1962), and, ane rel 
hypnosis scores were in accord with d itze! 
norms for the Stanford Scales i d 
hoffer, Landes, & Moore, 1961; Wei 
Hilgard, 1962). 


oP 
us CO 
3 The author is grateful for the genet? 


tion of various persons. While the entire 
assistants of the Laboratory of Human part % ould 
carried out the hypnotic sessions aS kion Daon 
regular research program, specific E ae 5 
be made of Nancy Wiggins and alee cial val 
who made the ICL scores available. p e stat! 
go to Forrest Young for helping wit 

analyses. 
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RESULTS 


The correlations between hypnotic sus- 
Ceptibility and the various ICL measures 
ate reported in Table 1. As can be seen the 
octants reflecting affiliation, approval seek- 
ing, and a generally conventionally accepting 
attitude correlate highest with hypnosis. For 
the two samples the Cooperative-Overconven- 
tional octant LM exhibits the highest 
Correlations with hypnotizability in these 
‘amples. These correlations are the only ones 
that replicate significantly for two samples 
e parately (r = .40; r= .30). None of the 
aie ations in the male sample is statisti- 
ideri significant, though their pattern is 
tion ical to the two female samples’ correla- 
the « The octant correlations are about 
fee for Form A as for Form C in the 
which Samples. However, the correlations 

> approach or reach significance for 
i bined samples are almost uniformly 

aller for Form C than Form A. 

Hs the three factors, only Factor II correla- 
cant g PProach significance. It is not signifi- 
Foirm Fle (r = .24), is significant for 
and is in the first female sample (r= 85), 
emal Significant for Form A in the second 
sample Sample (z = .32). For the two female 
Signin Combined, the correlation is highly 
> icant for both Forms A and C. 


S 


A DISCUSSION 


lange Ouh the obtained correlations are not 
The Ai ey do replicate over several samples. 
tie of the correlation between 
Orie: tati, ility and a positive interpersonal 
meth; On (Factor IT) suggests that here is 
dicts, "8 Upon which to build. No contra- 
literatu results have yet appeared in the 
th ok, €. More importantly, the results make 
entalis = Sense in light of the develop- 
Hilga, nteractive propositions outlined by 

td (1962) 4 

4 

i in press, 
Peso ne s Caer sented to pots the 
nication They were unsuccesful (personal 
There were a variety of differences 
© replication and the procedures of the 
Y- The hypnotic scales, the ICL format, 
instructions and prehypnotic experi- 
© experimenters were some of these. 
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In a recent study (Lang & Lazovik, 1962) 
the Edwards Personal Preference Schedule 
(EPPS) (1953) was administered to 32 sub- 
jects. The scale correlating most highly 
with hypnotizability was the Affiliation Scale 
(r = .36). This is clearly in line with the 
present results. The only other EPPS scale 
showing a positive (nonsignificant) correla- 
tion with hypnotizability was the Deference 
Scale (r= .27). This result, plus the fact 
that the Deference Scale and the ICL 
Docile-Dependent octant correlate positively 
(Cairns & Lewis, 1962), would indicate that 
ICL dominance should correlate negatively 
with hypnotizability. As seen in Table 1, the 
correlations obtained in this study were not 
significant. That there is no consistent rela- 
tion between dominance and hypnotizability 
is further supported by Hilgard and Lauer’s 
(1962) CPI data. These authors reported a 
correlation of —.11 in a sample of 110 males 
and +.23 in a sample of 106 females. 

In a study with 87 female subjects, Schul- 
man and London (1963) found that more 
susceptible subjects scored significantly lower 
on the Pd scale of the MMPI, indicating that 
these subjects may be less aggressive, more 
compliant, and less negative about relations 
with authority figures than are others. This 
interpretation of the Pd scale results would 
fit in well with the findings of this study, 
since the Docile-Dependent, Cooperative- 
Overconventional, and Responsible-Hyper- 
normal octants of the ICL yielded the highest 
correlations with hypnosis. 

An important next step in further investi- 
gating the relationship of interpersonal trust 
to hypnosis would involve the construction of 
an item pool which emphasizes the ICL 
octant items which correlate most highly 
with hypnosis—JK, LM, and NO. Items such 
as “appreciative,” “cooperative,” “consider- 
ate,” “accepts advice readily,” “sociable and 


It is difficult to specify which of these differences 
could account for the differing results; that method 
variance has as great effects as it does indicates that 
the “true” correlations between ICL and hypnotic 
responsiveness are probably quite small. However, 
since the date of Lang and Lazovik and Schulman 
and London seem consistent with the present results, 
the construction of an item pool as described would 
still seem to be in order. 
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neighborly,” “likes to be taken care of,” 
“agrees with everyone,’ and “loves every- 
one” appear in these octants. These items 
enter into Factor II. It should be pointed out 
that although items from the Self-effacing— 
Masochistic quadrant HI enter into Factor II, 
HI correlations with hypnosis are quite low. 
Thus, items of the type, “apologetic,” “can 
be obedient,” “shy,” “meek,” or “spineless” 
should not enter into such a rationally formed 
item pool. It may be pointed out here, in 
fact, that the correlations of the sum of the 
octants JK, LM, and NO with hypnotiza- 
bility are uniformly higher than the correla- 
tions of Factor II which includes the octant 
HI. For Form A, for example, the correlations 
in the three samples are .36, .33, and .27 as 
compared to .31, .32, and .24. The item pool 
should also contain items from the EPPS 
Affiliation Scale. It should be pointed out that 
items of the two octants correlating most 
highly with hypnosis scores are, on the aver- 
age, more socially desirable than items of the 
other octants (Bentler, in press). This char- 
acteristic of the items should be taken into 
account in the construction of an item pool. 
Although Form C has been As’ (1962) 
preferred scale for correlating hypnotizability 
and subjective experiences presumably related 
to hypnosis, the results of this study indi- 
cate that Form A may yield as high or 
higher correlations with personality measures 
as Form C. The choice of scale for this pur- 
pose thus appears to be somewhat arbitrary, 
though it may well be that finer discrimina- 
tions may be possible in the future with 
Form C since it taps a greater variety of 
hypnotic experiences. It is quite possible, 
for example, that a person may be able to 
experience some cognitive alterations without 
experiencing others; the extent to which these 
discriminations are important to make would 
determine the choice of scale. 
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RESPONSE SET PARALYSIS: 
IMPLICATIONS FOR MEASUREMENT AND CONTROL? 


JEROME SILLER asp ABRAM CHIPMAN 


Engineering Research Division, New York University 


Data on the relationships between 


7 response set measures and 42 personality 


and other variables are reported. 284 college students were administered a 574- 
item battery, including MMPI-type and Adjective Check List scales, measures 
of acquiescence, social desirability, and deviance sets, and miscellaneous vari- 
ables. Analyses of intercorrelations reveal response sets are mostly idiosyn- 
cratic to the measure used. These scales, furthermore, account for a small 
portion of the variance in the personality instruments. An orthogonal factor 
analysis of the total battery reveals only acquiescence as an independent 
response set factor. Questions are raised about the utility of burdening test 
batteries with a mass of response controls. 


A series of response set measures was inter- 
posed within a large self-report test battery 
as part of a study of attitudes toward the 
physically handicapped (Siller, 1962). It is 
believed that the resulting data have a bear- 
ing on two major issues of technical concern 
in the use of self-report personality question- 
naires: (a) What is the relationship between 
the set to check “true” to items (ie. 
acquiescence) and the set to check the socially 
desirable alternative? (b) In view of the re- 
peated findings of substantial correlations be- 
tween response sets and personality scales, 
what is the justification for continued content 
interpretations of such inventories? 

Factor analyses of the MMPI, on which 
most work has been done, generally find two 
basic factors to which similar labels are 
attached (Kassebaum, Couch, & Slater, 1959; 
Eichman, 1961; Finney, 1961; Welsh, 1956). 
One is defined in terms of general psy- 
chopathology versus ego strength. The sec- 
ond appears to be akin to introversion- 
extroversion. Edwards interprets the first fac- 
tor as one of social desirability and the sec- 
ond as acquiescence (Edwards, Diers, & 


1 This investigation was supported, in part, by a 
research grant from the Vocational Rehabilitation 
Administration, Department of Health, Education, 
and Welfare, Washington, D. C. We wish to 
acknowledge our indebtedness to Jacob Cohen for 
is contribution to the statistical analysis. A brief 
rele of this paper was presented at the 1962 
es Be of the American Psychological Association. 


Walker, 1962; Edwards & Heathers, we 
Messick and Jackson (1961) believe sai 
“acquiescence, as moderated by item “ity 
ability, plays a dominant role in mange 
inventories like the MMPI.” However; | able 
are those who argue that socially eco 
responding may relate to the actual @ n 
of psychopathology and anxiety (Coni a 
Marlowe, 1960), and that acquiescence iiye 
be part of a broader syndrome not ath 
introversion-extroversion (Couch & Ken 
1960). 

Specific purposes of this study a 
cuss the relationship between seve? 
set measures on a correlational and 
level, and to determine their correlatio: 
various personality scales. A final pu" sa 
to determine the factorial conseque" itn 
merging adjective check list scales 
MMPI-type scales. 


re to dis 
response 
factoria 
ns wil! 
ose } 
P ae 


METHOD , stored 
ister 
A 574-item questionnaire was group ye unity 
to 284 students from municipal and sexes jak 
colleges in the New York City area. The ether. T 
fairly evenly divided and are treated 1a most! 2 
population was almost entirely white an este 
middle socioeconomic status. 
during a regular session of their cl F 
informed of the nature of the study, AA the bie 
the way a person feels about himself no 
he feels about the physically disabled, 
defensive test atmosphere was promote?- 
responded anonymously. on 
The selection of a test battery was based per 
considerations: the inclusion of a variety 


tw 
op" 
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ality and stylistic measures, and available classroom 
time. In selecting response set scales, the criteria of 
frequency of use and brevity of administration were 
followed. Thus, the Edwards Social Desirability 
Scale and an overall agreement score for measuring 
acquiescence are the ones most frequently referred 
to and are included. The choice of other response- 
Set measures was importantly determined by the 
number of new items that would be required in our 
Saat Inasmuch as the primary focus of the study 
of ee Personality factors within the normal range 
ši unctioning, the major content instrument em- 

zel was the Gough Adjective Check List (ACL). 
nclusion of the MMPI was deemed inappropriate 
ie ed of the absence of marked clinical pathology 
ce © population tested. As markers for MMPI- 
Faek variables, however, several scales were retained, 
Bi as Welsh’s A and R which have been found 
Maree heavily on the two principal factors of the 
Pate (Welsh, 1956). Demographic variables and 

eral measures of attitudes to the disabled brought 


th 
© total number of variables to 49, enumerated 
as follows: 


D pale aT : 

“mographic and Disability Variables 

1. Age, 

2. Sex, 
lst op Sial Distance Scale (Siller, 1962). This is a 
ot six possible relationships varying in intimacy. 


e subject must check those to which he would 
admit disabled persons, 


cheap ling Check List (Siller, 1962). The subject 
S those of seven listed reactions to the dis- 


abl i 
Fa Which apply to his feelings, eg., repelled, 
g athetic, indifferent, 
(Ap ititudes toward Disabled Persons Scale 
) (Yuker et al., 1960). 


Scale ee Familiarity (Siller, 1962). A 3-point 
ändicappeg of the subject's contact with the 


Re 
‘Ponse Set M easures 


1, 
(Cront@#owe-Crowne Social Desirability (MC-SD) 
© & Marlowe, 1960). 


los» wards Social Desirability (E-SD) (Edwards, 
). 


3, 
ARG) *Uh-Keniston Agreement Response Set (CK- 
4, Ry ouch & Keniston, 1960). 
3 Gren Acquiescence (Fulkerson, 1958). 
D. The tall Agreement Score for Adjectives (OAS 
the aq, Umber of “true” responses out of 300 in 
SUbjec Jective Check List. In our instructions, the 
*DDlieg re to check “true” those adjectives which 
NOt chars, Aim, and check “false” those that were 
7 aracteristic, 
RON Thea Agreement Score for Statements (OAS 
MPI number of “true” responses out of 197 
?. Gae statements. 
(Grigg ha & Thorpe Adjective Check List Deviance 
Orpe, 1960). 


MMPI-type Scales 


1. Welsh Anxiety (A) (Welsh, 1956). 
. Welsh Repression (R) (Welsh, 1956). 
3. Barron Ego Strength (Barron, 1953). 


ey 


6. Couch-Keniston Low F (Couch and Keniston, 
1960). 


Adjective Check List Scales 


1. Zuckerman Affect Adjective 
(AACL) (Zuckerman, 1960). 
2. Berdie Femininity (Berdie, 1959). 
Scales 3 through 16 are from Gough, 1955.2 
. Rigidity. 
. Positive Character Integration. 
. Self-Insight. 
. Self-Acceptance. 
. Flexibility. 
. Role Playing. 
- Social Poise and Presence. 
10. Potential Success. 
11. Originality. 
12. Self-Criticism. 
13. Good Judgment. 
14. Personal Adjustment. 
15. Likeability. 
16. Responsibility. 
Scales 17 through 30 are from Heilbrun, 1959, 
17. Aggression. 
18. Heterosexuality. 
19. Dominance. 
20. Exhibition. 
21. Achievement. 
22. Deference. 
23. Nurturance. 
24. Change. 
25. Endurance. 
26. Abasement. 
27. Intraception. 
28. Succorance. 
29. Autonomy. 
30. Affiliation. 


The instructions here employed for the ACL dif- 
fered from those usually given. Subjects were 
instructed not merely to check only those adjectives 
which applied to them but to check every item as 
being true or false of them. The present method 
permits both a direct acquiescence index, based on 
the number of true’s on the ACL, and, in our 
opinion, a more effective control of acquiescence 
within each scale than the standard score conversion 
for acquiescence recommended by Heilbrun (1959). 
Our major criticism of Heilbrun’s standard con- 


Check List 


auone w 


o 


2A revised version of the ACL with several dif- 
ferent scales and changed scoring is soon to appear, 
The scales employed in this study consist of those 
mentioned in Gough (1955) and some sent by him 
to us since then. 
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TABLE 1 
INTERCORRELATIONS OF RESPONSE SET WITH PERSONALITY AND OTHER VARIABLES 
(N = 284) 
Edw.- MC- Fulk. CK- OAS OAS ACL 
sD sD Acq. ARS I II Dev. 
Edwards SD 
Marlowe-Crowne SD 38 
Fulkerson Acquiescence —43 —31 
Couch-Keniston ARS —39 —14 46 
Overall Agreement Score I —25 —08 39 43 
Overall Agreement Score IT -75 —27 67 57 42 
Grigg-Thorpe ACL Deviance —54 —32 27 32 30 44 
Ae 16 03 —06 —20 —03 —19 —08 
ee —06 03 4 -12  -0o7 -08 —0l =03 
Social Distance Scale 06 00 —04 11 04 —02 03 
Feeling Check List 03 17 —01 —05 04 —03 00 
ATDP 15 09-07 -16 -10 -13 -12 
Disability Familiarity 00 —03 08 06 10 07 —02 
Barron Ego Strength 67 15 —35 —26 —20 =57 —33 
Welsh Anxiety —88 =45 50 43 22 79 51 
Welsh Repression 14 05 —48 —35 —40 53 —09 
Zaks-Walters Hostility ` 63 ~29 55 53 30 73 4l 
Maslow Security-Insecutiry 70 37 —23 —22 —12 —49 —48 
Zuckerman AACL -67 -37 27 23 26 48 60 
Couch-Keniston Low F —22 —14 12 13 15 19 18 
Berdie Femininity ~18 —08 08 13 29 20 11 
ACL Rigidity 00 08 —12 —20 —12 —09 —10 
ACL Positive Character 
Integration 4 32  =3 -42 -47 -44 96 
ACL Self Insight 43 25 =23 -22 —19 —38 =59 
ACL Self Acceptance 42 31 —06 —06 22 —20 oh 
ACL Flexibility 09 03 02 06 13 00 —09 
ACL Role Playing 25 2 =i = i -31 
ACL Social Poise and Presence 50 34 —26 —26 —31 —40 —55 
ACL Potential Success 42 2%  =35 —-44 —s4 -47 —44 
ACL Originality 08 04 21 36 4 —29 -07 
ACL Self Criticism = =F 49 B 60 56 15 
ACL Good Judgment 38 5 -27 -30 g g 33 
ACL Personal Adjustment 51 37 —25 —28 = —30 =e 
ACL Likeability 49 40 E E =15 —36 = 68 
ACL Responsibility 36 29 30 38 27 a 
ACL Aggression —26 —33 26 19 37 25 51 
ACL Heterosexuality 21 10 07 10 22 —04 80 
ACL Dominance 39 15 00 —05 11 —25 =37 
ACL Exhibition 10 04 15 15 29 04 05 
ACL Achievement 45 27 —05 —18 16 —30 = 
ACL Deference —02 14 23 12 35 10 2i 
ACL Nurturance 44 39 —29 =25 —38 —35 =i 
ACL Change —08 —10 21 14 14 16 ag 
ACL Endurance 50 37 —26 +33 —10 —43 "95 
ACE Abusement —40 —13 01 10 —12 23 A 
ACL Intraception 50 32 23 27 12 38 E 
ACL Succorance -58  —31 25 26 30 5 29 
—06 —16 24 13 44 18 a 


ACL Autonomy 
ACL Affiliation J 
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TABLE 2 
QUARTIMAX-ROTATED CENTROID Factor Loaprncs 
(N = 284) 
I Ib II IV h? 
Adjective Check List: Likability 86 —09 13 10 78 
Grigg & Thorpe: ACL Deviance —84 04 —02 —03 70 
ACL: Personal Adjustment 83 —10 10 12 73 
ACL: Self Criticism -81 15 32 =13 79 
ACL: Nurturance 80 —27 01 39 87 
ACL: Intraception 79 —07 15 —04 65 
ACL: Social Poise and Presence 77 —08 —12 03 61 
ACL: Endurance 76 —02 07 —40 74 
Zuckerman: Affect Adjective Check 
List —75 -17 19 —05 63 
ACL: Self Acceptance 73 17 #2 —07 75 
ACL: Succorance —71 —28 18 01 ol 
ACL: Self Insight 70 —08 02 —18 53 
Edwards: Social Desirability 70 32 —39 —18 77 
ACL: Affiliation g 70 —07 21 45 74 
Maslow: Security-Insecurity 
Inventory g 39 —17 (o) 64 
Welsh: Anxiety —38 44 19 83 
ACL: Responsibility -42 00 —27 67 
ACL: Achievement 40 23 —26 66 
ACL: Potential Success —23 —40 =14 51 
Zaks-Walters: Hostility —03 43 12 47 
Marlowe-Crowne: Social Desirability 01 —12 04 24 
3arron: Ego Strength 24 —37 —13 43 
ACL: Role Playing —24 —34 23 36 
ouch-Keniston : Low F —01 11 08 06 
CL: Abasement —80 05 04 73 
ACL: Exhibition 77 08 26 67 
ACL: Autonomy 75 11 —11 69 
ACL: Aggression 62 06 -13 76 
ACL: Dominance 56 10 -1 52 
ACL: Change 50 15 40 44 
ACL Heterosexuality 40 28 39 49 
asteement Score: ACL-type items 33 67 —14 63 
percement Score: MMPI-type items —09 65 19 76 
Wikerson: Acquiescence 13 56 05 46 
elsh: Repression —30 —52 —01 36 
DCL: Rigidity —49 -04  —51 52 
pa te: Femininity (ACL) —01 25 39 24 
‘ttitudes Toward Disabled Persons 11 —12 30 14 
cling Check List 19 —03 25 1 
cial Distance Sea 06 —01 21 
Sa i 4.64 3.18 2.01 22.34 
œ~ centage of Variability 20.76 14.23 9.01 100.00 
Vers; 
te ath technique is the fact that only four categories centroid method, using the „highest coefficient per 
s Shest o * of words checked are provided, the column as the communality estimate, without 
are K Which covers the extremely heterogeneous iteration. The factors were rotated by the quartimax 
i tough Se of 122-300. Raw scores were used analytic rotational procedure (Harmon, 1960). All 
nee dire Ut our study, and our data are therefore computations were performed on the IBM 650 
v Heip comparable to what would be given electronic computer. The capacity of the computing 
of iata method. program available for centroid extraction is 40 vari- 
Correlati? Pearson product-moment coefficients ables. Nine of the 49 variables judged least im- 


n was computed and factored by the 


portant were omitted from the factor analysis, The 
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Couch-Keniston ARS was one of those dropped be- 
cause a similar, larger scale (OAS II) was in the 
matrix and because the pattern of correlations for 
ARS was virtually identical to that of the Fulkerson 
scale. Although separate analyses were not run for 
each sex, the original matrix included sex as a vari- 
able. The few sex differences found in the inter- 
correlations were similar to those found for the 
Berdie Femininity Scale which, as a marker vari- 
able, is retained in the 40 X 40 factor analysis. 

Materials on disability are included in the tables 
but not discussed here as they form a unique 
factor extraneous to this study. 


RESULTS AND DISCUSSION 


We are concerned in this report only with 
the zero-order correlations and the first three 
rotated factors. These factors account for 31, 
12, and 8% of the total variance, and 56, 21, 
and 14% of the common variance, respec- 
tively. Table 1 contains the zero-order cor- 
relations between the response set and other 
measures. Table 2 presents the results of the 
factor analysis. 

In support of similar findings by McGee 
(1962) and Foster (1961), only modest cor- 
relations between acquiescence measures were 
found. The highest correlations were between 
the Fulkerson Scale and the OAS TI (+ .67) and 
between the CK-ARS and the OAS II CEST] 
All other correlations were between .39 and 
:46. The two measures of SD show a low but 
significant relationship (r= + .38). The 
most appreciable correlation is between the 
Edwards SD scale and the OAS II (= 15): 
The last correlation is understandable when 
one notes that 30 of the 39 items on the 
Edwards SD scale are keyed false and that 
the OAS II is heavily loaded on pathology 
acceptance. By consistently denying path- 
ology, one would receive a high Edwards score 
and a low OAS II score. To a considerable 
extent, then, response sets are mostly idiosyn- 
cratic to the measure used. 

While the expected negative relationships 
between acquiescence and social desirability 
occur in our data, it is essential to temper 
our interpretation of this with considerations 
of the relative worth of the different indices 
of each response set. For the the delineation 

of SD as a source of error, Marlowe and 
Crowne felt their own SD scale to improve 
upon major failings of the Edwards. Their 
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argument is that Edwards’ scale requires the 
subject to deny psychopathology at the same 
time he is affirming socially approved be- 
haviors. It is most parsimonious for non- 
psychiatric populations to assume that actual 
absence of pathology may be the causal basis 
for SD scores, even though SD ratings show 
high correlations. Marlowe and Crowne, 1? 
their scale, avoid this confounding effect by 
requiring the subject to base his avowals of 
SD on behaviors which are not merely “nor- 
mal” or “healthy” but up to a standard of 
perfection which is generally improbable 
a mere mortal. The rejoiner of Edwards, 
Diers, and Walker (1962) that the MC-SD 
likely is a lie scale and therefore not com- 
parable with the Edwards as an SD measure 
does not alter the situation, since if the sub- 
ject is not lying, his giving of the SD 1e- 
sponse does not constitute a source of COP- 
tamination of the content scale. Hence, W° 
regard the M-C as the preferable SD 1 
dicator because of this freedom from the 
confounding effect of psychopathology 2” 
because of its additional merit of evenly 
balancing items keyed true (18) and items 
keyed false (15) whereas Edwards’ scale m 
a predominance of keyed false items. AS $ 
the relationships with acquiescence measures, 
the average correlation of the M-C is 7: 
as opposed to a value of — .46 for 
Edwards. 

A similar distinction can be made beter 
the different measures of acquiescence. Ag 
ment scores based on MMPI-type items = e 
subject to the criticism that answering 108" 
consistently implies acceptance of patho nt. 
ical and/or socially undesirable item cor pT: 
Hence it is not surprising that an M alates 
type agreement score (OAS II) a tl 
—.75 with the Edwards scale and — 21 ee- 
the Marlowe-Crowne. The ACL-derived A in 
ment score (OAS I) is less contaminate 
this manner inasmuch as the ACL jon ig 
traits within the normal domain of emai? 
and contains as many apparently rae þe- 
traits as positive ones.* The correlation 


the 


r tof 

3 Objection may be made to this in the na of 

Heilbrun’s finding (1959) of a rho correla and 

-98 between frequency of adjective endorser? bru? 
judged social desirability of the adjectives 
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> this scale and SD are —.25 for Edwards 
oie and — .08 for Marlowe-Crowne. The 
shared Pression of these findings is that the 
desirability Tlance of acquiescence and social 
measure a Pie ee zero, when the purest 
> of both is used. 

actorial ® on zero-order correlations to 
acquiescen, ructure, it is found that the 
ird fact ce measures load principally on the 
Correlation, : bit entirely account for it. The 
Scale ke A etween proportions of items in a 
a factor a true and that scale’s loading on 
actor iG. + .90 for Factor III, + .10 for 
Point we bee — 39 for Factor I. At this 
Son (1961), ude, unlike Messick and Jack- 
‘nimportant Le acquiescence is relatively 
in the ne A or Factors I and II. Evidence 
reinforces ‘en utilizing content scales 
™pact on ie position. SD has its highest 
70 and 4. e first factor, with loadings of 
Crowne 47 for the Edwards and Marlowe- 
factors spectively, The loadings on all the 
Variance ccount for only 24% of the total 
SD mea of the Marlowe-Crowne (the better 
Sure), Hence there is a strong implica- 


tio; 
n that 
to Our SD as such is a minor contributor 
battery, 


lation Shi 


‘bon b between Content Scales and Re- 


Se Sets 


he 

Personality wee zero-order correlation with 
Gy, Sure zA measures for each response set 
one o Edwards SD + .39; Marlowe- 
4; Fulk + .24; Couch-Keniston Acq. 
t Ir if Pee Acq. + .23; OAS I +.27; 

lig t has ae Deviation Score + .43. 
Ve tha, cady been indicated why we be- 


t 

Yes i aeir I is not interpretable in 
the“? consia desirability or acquiescence. 
Mus 84 lain a content interpretation, 
Stigt, sal ng of the ACL Deviance Scale 
Ryne uated. Since ample evidence 


to 
q Othesis gmonstrate that the Deviation 
ati olds even for noncontent ma- 
wa aS th, 
ast 


e mE 

tongi > e tive meaning of this correlation 
tHeepetttes thag dent study (Heilbrun, 1962) 
an ed fop gpt an ACL Achievement scale, not 
Eppg ’,*ctually has better predictive value 
is is ehievement scale that is corrected 
ost irely in accord with the simple 
Or so „People act in terms of their values 

cial aPproval. 
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terial, one might interpret Factor I as a 
tendency to deviate. But why do people 
deviate? It is not feasible to deal with other 
studies of the Deviation Hypothesis; how- 
ever, an answer for the specific measure used 
here is possible. The ACL Deviance Scale 
was validated on college students requiring 
mental hygiene help versus those not requir- 
ing such help. Therefore these groups pre- 
sumably had greater and lesser degrees of ego 
strength, adaptability, anxiety, etc. This in- 
deed is what is found on Factor I. It is con- 
cluded then that a nonresponse-set inter- 
pretation of this factor in terms of Ego 
Strength-Ego Weakness is possible. 

Having identified Factor I as Ego Strength- 
Ego Weakness, Factor III as Acquiescence, 
and Factor IV as Acceptance of the Phys- 
ically Disabled, we are left only with the 
interpretation of Factor II. Factor II loads 
most heavily on variables relating to ascend- 
ance, i.e., Dominance, Exhibition, Aggression, 
and negatively on Abasement. However, a 
high positive loading on Autonomy compli- 
cates interpreting the Factor as introversion- 
extroversion. Recently Factor II in self-report 
questionnaire studies has been interpreted 
instead as Acquiescence. The fact that we have 
only a few of the MMPI scales undoubtedly 
has contributed to these discrepant findings. 
However, the Welsh A and R scales, generally 
considered anchor points for MMPI Factors 
I and II, were included. Inclusion of the ACL 
seems to have provided a new perspective on 
these issues. We find that acquiescence splits 
off, probably from Factor II, and becomes a 
separate third factor. 

Our review of previous literature in this 
area had led us to certain conclusions. The 
main ones were that if response sets were 
ignored, as almost all factor analyses of the 
MMPI have done, the principal issues were 
the interpretation of Factor III and the effect 
of oblique rotations. If a study included 
response sets, the impulse was to think only 
in noncontent terms. The present study once 
again indicates the dangers of overgeneraliz- 
ing to the whole realm of personality structure 
from a relatively restricted range of test data; 
i.e., factors that could be so changed by the 
addition of differently styled scales could 
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hardly be representative of basic personality 
dimensions. 

It has not been our intention to deny that 
measures of response sets can account for 
some of the variance of self-report inventories. 
It certainly is apparent that new scales should 
be balanced for true and false in their keying, 
and that social desirability be given particular 
attention. Nevertheless, it is our contention 
that content does remain the major com- 
ponent, and that the first two factors of self- 
report batteries are not basically interpretable 
in terms of social desirability and acquies- 
cence. We are inclined to feel that the over- 
valuation of these response set factors has 
caused undue professional effort to be 
diverted from the real task of developing 
rich but rigorous measures of personality 
functioning. 
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COMPARATIVE ANALYSIS OF PSYCHOTIC DEPRES- 
SIVES WITH MATCHED NORMALS ON SOME 
UNTIMED VERBAL INTELLIGENCE TESTS * 


SAMUEL GRANICK, 
Philadelphia Psychiatric Center 


50 psychotic depressives, matched to a group of 50 normals for age, sex, race, 
aan religion, and nativity, are compared for performance on WAIS 
niormation and Similarities, and on Thorndike-Gallup Vocabulary. Intergroup 
differences are not significant, but the normals tend to score higher on all 
tests, Pattern of performance is similar for both groups. Weighted informa- 
reap are significantly higher than Similarities scores, at the .01 level, 
a oth the normals and depressives. A precipitate drop in quality of perform- 
. on the Similarities is found in relation to increasing age within the 
groups. The age factor is, thus, found to be more potent than depression in 
producing psychometric deficit, particularly in the area of abstraction and 


flexibility of thought processes. 

Avelin 
data e of systematically collected 
choti A e intellectual functioning of psy- 
ie tag is sparse. Summaries of 
ane his area by Hunt and Cofer 
A a i recently by Payne (1961) 
iS tentative results can be regarded only 
> on sty goa inconclusive since they are 
Ra les, Gen les with very small and limited 
th Aaly, however, the impression 
et ga and thinking processes of 
E remarkab] e not deteriorated or regressed 
ted Ren different from normals when 
k tin iia ange is not a factor and 
kanor aims Ta are supportive. The present 
Ms th © investigate this impression in 
= following questions: (a) Do 
item on i. ves perform as well as nor- 
(Ww N aS the Ng verbal intelligence test 
ang S Th in sler Adult Intelligence Scale 
(6) the Thor mation and Similarities Tests, 
How do halk Gallup Vocabulary Test? 
e two groups compare in their 


atter ate 
ae *elationsha of functioning? (c) What are 


Cpr: 
Or ess 


Sth; an Ips among test performances, 
WR education for each group of 
Saso 


ion get a 

of ted p resholg wet Project entitled “Vigilance, 
Y ental rant My ranil and Depression,” sup- 

panke Health, 3674 of The National Institute 


t 
of atio te q 
ho this k ang help to Alfred S. Friedman for co- 
trig Port. ìn the research and preparation 

an el Shapiro served as the psy- 


© tested all the subjects. 


METHOD 


Subjects 


The psychotic depressive subjects were selected 
from a pool of mental hospital patients carefully 
screened and diagnosed by two experienced psy- 
chiatrists, Normals, matched with the patient 
group for sex, age, religion, race, and nativity, were 
chosen from a larger sample which had been 
recruited through Golden Age Clubs, other types 
of adult social organizations, and two employment 
services. Information elicited from these subjects 
indicated that they had no history of mental dis- 
orders or hospitalizations for emotional conditions. 
All subjects, furthermore, were white and judged 
to be of middle class socioeconomic status. The 
other background variables are summarized in 
Table 1. Equivalence of intellectual capacity was 
assumed for the two groups because of the close 
matching achieved, particularly with respect to age 


and education. 


Procedure 

The WAIS Information and Similarities and the 
Thorndike-Gallup Vocabulary Tests were admin- 
istered to the subjects individually by the same 
female examiner and in the same order as part of 
a much larger battery of cognitive, perceptual, 
psychomotor, and affective reaction tests. Each 
patient was examined shortly after admission to the 
hospital, following diagnostic evaluation by the two 
psychiatrists associated with the research project. 
Medication was withheld for at least a 24-hour 
period so as to eliminate or reduce any possible 
effects which drugs might have on test performance. 


The normal subjects came to the hospital research 
building individually, where they were tested under 
conditions which were fairly identical to those set 
up for the patient group. It was explained to them 


that they were participating in a research program. 


439 


440 
TABLE 1 
BACKGROUND VARIABLES 
Psychotic 
depressives Normals 
N (N =50) (N =50) 
Male 21 21 
Female 29 29 
Mean age 57.26 57.36 
SD 7.81 7.15 
Mean education . 
(grade) 8.47 8.8. 
SD 3.54 3.17 
Native born 40 40 
Foreign born 10 10 
Jewish 34 34 
Christian 16 16 


A nominal compensation for their time was offered, 
which most accepted either for their own use or 
for a contribution to a charitable program. The 
patient group understood that the testing for them 
was part of the hospital routine, but generally 
there seemed to be an awareness that research was 
involved. 


RESULTS 


In Table 2, the comparison between the 
two groups on their raw test scores shows the 
normals consistently performing better than 
the psychotic depressives. The differences, 
however, are not striking and in only one 
instance is statistical significance approached. 
(Information is a shade below the .05 level.) 
Noteworthy also is the closeness of the means 
and SDs on Vocabulary. This confirms the 
assumption mentioned above of similarity of 
intelligence between the two groups, based on 


TABLE 2 


COMPARISON OF Raw Test Scores BETWEEN 
PSYCHOTIC DEPRESSIVES, AND NORMAL SUBJECTS 


(N =50) 
Psychotic . 
depressives Normals 
reat M Sb M SD t 
i 14.28 5.0 16.20 5.14 194 
tion 
m 8.64 5.88 1018 5.97 130 
am 10.86 3.87 11.28 403 0.53 


Thorndike-Gallup 
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the matching procedure, since the Thorndike- 
Gallup Vocabulary Test was designed as 4? 
intelligence measure (Miner, 1957). 

A further examination of the data, on the 
other hand, along with the intercorrelations 
in Table 3, impresses one with the relative 
similarity in the pattern of performance ra 
tween the normals and the depressives. ar, 
acteristically, however, the normals 
more consistent and approximate more ie 
the relationships among the tests repo" a 
by Wechsler (1958), and Doppelt “a 
Wallace (1955) for the age range cove a 
by the subjects. The rather substantial ¢ 
relations between education and each 0 h 
three tests are noteworthy. Here, t00; ze 
normals perform more like Wechsler SAIS 
tional standardization sample for the y Kd 
than the depressives, but the latter s! 
the same basic pattern. T: 

The low negative, statistically insignifict 
correlation of age and education 1S the rence 
for both groups. In this, there is a diffe i 
from the 7 of —.29 reported by ip iad 
Morrison (1961) in their analysis of = to 64 
tive white subjects in the age range of 2 mple: 
years from the WAIS standardization Sa ively 
Accounting for it, probably, is the Ca e 
narrower age range of the subjects jt the 
present study, which did not per™ easing 
well-known historical trend toward incr 
educational opportunities to show its¢ hows 

Intragroup analysis of the test eae on 
a striking difference between perom phe 
the Information and Similarities Test. as 
average weighted scores for the 
shown in Table 4, are different to @ A 
nificant degree, Reflected herein wa sim” 
decline in quality of functioning hê 
larities by the population of which prelati" 
samples are representative. The nn jg M 
in Table 3 suggest that the age facto 
significant in this decline. 
show some decline with age äl A 
but it is quite small and not 5 gniomha 
significant for Vocabulary and for 
tion in the case of the normal 875 i 
r of —.30 between Information 38 05 levi 
depressives is significant at Ùe ‘ine W 
But for Similarities, a precipitate for 
age is found in both groups, l 


$$$ _ 
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TABLE 3 


INTERCORRELATIONS AMONG TEST SCORES, AGE, AND EDUCATION 


Depressives Normals 

Informa- Similari- Voca- Educa- Informa- Similari- Voca- Educa- 

tion ties tion tion tion ties tion tion 
A 
Eluca —.30  -46 -22  —16 -19 -33 -02 —.15 
ot 58 4 39 61 63 68 
iain: 61 34 65 64 
milarities ‘56 58 


tered subjects being significant at better 
abo e .01 level and for the normals at 
ut the .03 level. 
Kip te particularly for the normal 
quality toa more detailed analysis of the 
ivideat i the subjects’ responses to the in- 
Shows ik ems of the Similarities test. Table 5 
for 5 e percentages correct on each item 
Wechsler oe in comparison with the 
i Soria of course, many more significant 
e Wechal etween the depressed sample and 
ad Wech a norms than between the normals 
nent, ‘email, group. The remarkable ele- 
Cldence of tar’ is the exceedingly large in- 
of the és ailures on the relatively easy items 
a €st by the subjects of both groups. 
th interest is the fact that successes on 
Tee items for both groups included 
© Of scores of “one,” reflecting rather 
> relatively immature performance. 


The 3 DISCUSSION 
a p s 3 
tression a are consistent with the noted im- 
e esearch T derived from summaries of 
literature, that psychotic de- 


Co, TABLE 4 


ARIS, 
A ON 
Nb Sn ae WEIGHTED SCORES FOR INFORMATION 
ae OF Psycuotic DEPRESSIVES 
D Noratars (N = 50) 


Information Similarities 
AAE 
De M sD M sD t 
Nom SSiveg 
ed io? 257 716 331 365 
Ee = 0.24 260 7.96 3.50 3.62% 
OL. — 


pressives are not significantly impaired in 
cognitive functions of the type that the In- 
formation, Similarities, and Vocabulary Tests 
represent. Agreement is also found with 
Friedman’s (in press) forthcoming report, 
based on research of which the present study 
is a part, containing an extensive demonstra- 
tion of the basic intellectual intactness of 
hospitalized depressed subjects. 

Yet it cannot be ignored that the normals 
scored consistently higher on all three of the 
tests. Friedman also noted such a trend, 
reporting the normals significantly superior 
on 11 of the 82 scores analyzed, and the 
depressives generally tending toward erratic 
performance. Depression, therefore, appears 
to take its toll, but when tested under benign 


TABLE 5 


COMPARISON OF PERCENTAGE OF CORRECT RESPONSES 
on SIMILARITIES TEST 


Wechsler 
sample 

Depressives Normals (N =1500) 

Item % Correct % Correct % Correct 
1 70* 78* 93 
2 88 88 90 
3 62* 62* 90 
4 66* 74** 86 
5 56* 66 73 
6 42* 50* 69 
7 36* 48 56 
8 50 46 55 
9 22* 40 46 
10 22%* 44 38 
11 16 10 21 
12 10** 22 25 
13 10 14 18 


iani tly different from Wechsler data at .01 level. 
Pe Sienifcantly different from Wechsler data at .05 level. 


442 


conditions (as is the case in most clinically 
oriented studies), the patient can perform at 
close to normal levels. The present study, 
furthermore, suggests that as the complexity 
of mental operations increases, the depressed 
subject finds it more difficult to use his 
capacities. Almost identical means and SDs 
were achieved by the two groups on the 
Vocabulary Test, a situation in which only 
recognition is involved in a multiple choice 
of synonyms. But the far more difficult and 
complex operations of recall and abstraction 
are involved respectively on Information and 
Similarities, requiring the exercise of sus- 
tained effort and some flexibility of mental 
functioning. The relatively inferior quality of 
performance on these tests by the depressives 
as compared to the normals leads one to 
anticipate that under stressful circumstances, 
particularly when the facilitating support of 
the examiner is absent, the depressed sub- 
jects would show a more marked and sig- 
nificant intellectual deficit. 

Another feature deserving attention is the 
poor quality of performance on Similarities 
by the subjects in both groups. From the data 
available, the age factor seems to be most 
prominently associated with this phenomenon, 
suggesting a sharp decline of functioning as 
the individual moves from the fifth and sixth 
to the seventh and eighth decades of life. 
The educational factor is not significant in 
the decline, despite its high correlations with 
the test scores, since the relationship be- 
tween age and education is small for the 
samples. Thus, when education is partialed 
out, the r’s reduce only to —.44 and —.31 
for the depressed and normal groups, 
respectively. 

A marked decline with age on Similarities 
was not found in the original standardiza- 
tion group, which included individuals up 
to age 64. But Doppelt and Wallace (1955) 
observed it in their supplementary standard- 
ization group, which consisted of subjects 
from age 60 to over 75. No rationale was 
offered for the decline. They did comment, 
however, that other verbal tests, such as 
Vocabulary, Information, and Comprehen- 

sion, are useful to the elderly in communica- 
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tion, but that Similarities “constitutes 2? 
artificial situation by the standards of every- 
day life.” Motivation might thus be an 1M- 
portant factor. If so, it is doubtful whether it 
is of great significance, since no direct ev 
dence has been presented showing that ir 
aged extend themselves less conscientiously 
on this test than on the others. The v 
erratic quality of performance shown by it 
subjects of the present study, on the We 
hand, particularly in their handling of $0 E 
of the relatively easy items of the test, se 
gests that there is probably a factor a 
deterioration involved. One can only nae 
late, however, at this point, as to whet ; 
this is psychological or organic or perhap 
both. a 

Tapped by Similarities are such mena 
processes as abstraction, reasoning, and Ty 
agination. Flexibility must be exercise“ i 
the subject. These are mental functions W 


y with 
lso a5 
defects: 


Additionally, considertion must : 
the fact that later maturity is a perio 
brain deterioration is highly prevalen pa 
cordingly, one is tempted to hypothest ection 
the results of the study are largely a rel 

of organic deterioration with seconda A 
of psychological integration. Forth 
vestigation of this area would seem tO 
important and fruitful. 


i 
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ERRATUM 


In t 
(1963 
reference 
Chooler’s 

aS true fi 


he article by Vogel and Lauterbach in the Journal of Consulting Psychology 
» 27, 236-242), the authors note that “Schooler, 1961” should be deleted as a 
for the statement on page 236, line 38 and again on page 238, line 76. 
finding that schizophrenics are born in the second half of the birth order 


of or all families, independent of family size, and was not limited to families 


Ve or more sibs, as stated in the article. 


Journal of Consulting Psychology 
1963, Vol. 27, No. 5, 444-447 


NOTES AND COMMENTS 


USE OF THE SHIPLEY-HARTFORD CONCEPTUAL QUOTIENT AS 
A MEASURE OF INTELLECTUAL IMPAIRMENT ' 


PETER M. LEWINSOHN 
Larue D. Carter Memorial Hospital 


A study to evaluate the use of the Shipley-Hartford (SH) Conceptual Quotient 
(CQ) as a measure of intellectual impairment. 4 groups of 30 Ss, matched for 
age and vocabulary, were given the SH. Chronic (C) and acute (A) schizo- 
phrenics (schiz), nonschizophrenic patients, and normal Ss were used. Residual 
abstraction scores (RA) were computed for each S. Results were as follows: 
(a) CQs and RA scores of C and A schiz were lower than those of normal 
Ss at the .01 and .05 level, respectively; (b) A correlation of —.25 significant 
at .01 level, was obtained between RA and a measure of conceptual disorganiza- 
tion for a group of 137 mental patients. In contrast to the CQ, the RA is 


independent of age and vocabulary. 


The use of the Shipley-Hartford (SH) Con- 
ceptual Quotient (CQ) as a measure of intel- 
tual impairment is based upon the assumption 
fo. where there is intellectual impairment, 
vocabulary is less affected than is the capacity 
for abstract thinking and that in such cases 
there will be a discrepancy between vocabulary 
level and ability to handle abstract problems. 
While direct evidence for this assumption has 
never been provided, Shipley (1941) was able to 
demonstrate rather consistent differences in CQ 
between a normal standardization group and vari- 
ous groups of psychiatric patients who were 
presumed to show varying degrees of intellectual 
impairment. In Shipley’s study, psychopathic 
personalities, psychoneurotics, and alcoholics did 
not differ from the standardization group, while 
the lowest CQs were obtained by organics, fol- 
lowed by the functional psychotics. 

The failure of Magaret and Simpson (1948) 
to find a correlation between the CQ and psychi- 
atrists’ ratings of “deterioration” casts some 
doubt on the assumption that the discrepancy 
between vocabulary and abstraction levels is 
proportionate to intellectual impairment. (It 
should be pointed out, however, that the 
psychiatrists’ ratings were done 6 months after 
the test was taken and no evidence for the 


1 This study was partially supported by a grant 
from the National Institute of Mental Health 


029). š 4 
One is impairment and conceptual disorganiza- 


tion are used to denote loss of efficiency without 
implying irreversibility. 


ted.) Per 


mg age 5, y: n’ 
reliability of the ratings was prese 3 assum, 


haps potentially more damaging to thi fim 
tion is the existence of a correlation “feld & 
between age and CQ (Fecher, 1946, Gar tween 
Fey, 1948) and of a correlation of .30 pcb 
vocabulary and CQ (Lewinsohn ith age 
1961).2 Thus the CQ normally declines a ley’ 
and decreasing vocabulary level. Since’ hed £0 
(1941) original groups were not mat ardiz 
age or vocabulary level (the normal “er some 
tion group was young and it is likely pes oldet 
of the organic and psychotic groups ye oe of 
and had a lower vocabulary level than = 0s 
the less disturbed groups), it is at len ween 
sible that some of the differences in in 08° 
Shipley’s groups were due to differences 
and vocabulary level between the group: dis: 
The present study was to determine age 2” 
criminative value of the CQ when these? 
vocabulary were controlled. The null 
tested were: 


1. The CQs of chronic 
phrenics do not differ from those à 
control subjects. ances ne 

2. The discrepancy between a we of w 
the abstraction and vocabulary SU! gan atio 
S-H is unrelated to conceptual diso 


nd acute 
7 of appr? 


y 
METHOD pat”! 

zi” jed 
jects, appro” elect? 


First, four groups of 30 sub. were 


matched for age and vocabulary, 

ichols unp4 

2 P, M. Lewinsohn and R. C. Nicho™ 
results, 1961. 
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NOTES AND COMMENTS 


TABLE 1 


MEAN AGE AND VOCABULARY SCORES FOR THE GROUPS 


Age Vocabulary 
Group N M’ SD M SD 
Acute schizophrenics 
Male 15 34.0 8.9 23.9 6.5 
Female 15 35:7 48 25.1 4.2 
Nonschizophrenic psychiatric 
Male 15 34.0 8.2 26.1 6.5 
Female 15 35.7 5.3 24.3 5 
Chronic schizophrenics 
Male 15 34.1 8.2 24.1 S.5 
Female 15 36.7 4.8 25.0 4.1 
Normal control 
Male 15 33.3 9.2 24.7 5.7 
Female 15 35.1 5.4 25.9 4.6 
Groy 
Phrenic consisted of newly admitted acute schizo- subject’s abstraction score is above or below the 
of dieno O were carefully selected on the basis score which would be predicted for him on the basis 
Ben sis and symptoms. Group 2 was a hetero- of age and vocabulary score. RA scores were also 


eo 
hospita] cag ot Psychiatric patients from the same 
Zophrenia ad received a diagnosis other than 
oms we Patients manifesting schizophrenic 
i ere excluded from this group. Group 3 
chronic schizophrenics from a state 
who had been ill for at least 6 years, 
continuously hospitalized for at least 3 
piao history included evidence of dis- 
thie “ted o Te and bizarre behavior. No patient 
S study G eing brain damaged was included in 
ç Bencoug ve 4 was selected from a large and 
group of students, general medical 
ang Stoups hospital attendants. Subjects in the 
S Were individually paired for sex, age, 


een 


Matehin an cabulary score. The results of the 

hahe E shown in Table 1. 

otpale schiza administered individually to the 

Min Su jects PErenics and in small groups to the 
lon cgp) Sing standard directions for adminis- 

aen, gaPley, 1940), 

co ission age and S-H scores of 180 consecutive 


Splet ° a psychiatric hospital were used to 
An abstracts, ultiple regression equation to predict 
i ton scores from age and vocabulary.* 
supeo ttaction score was then computed 
io poe and subtracted from his actual 
thus Core. This residual abstraction score 
TA represents the amount by which the 
We Th 


emae anti 
for gee Kn wishes to express his gratitude to 
is gp tit help į Frances King, and Amanda Roth 


: i i sat 
F nA data the collecting and statistical analy- 


Oee 
Sted for omissions. 


computed for subjects in the four matched groups. 

Finally, ratings on the Multidimensional Scale for 
Rating Psychiatric Patients (Lorr, 1953) for 137 
of the 180 consecutive admittees were used to calcu- 
late the correlation between RA scores and Lorr’s 
Factor K (conceptual disorganization). These ratings 
were made on the basis of an independently con- 
ducted interview shortly after admission to the 


hospital. i 
RESULTS AND DISCUSSION 


The mean CQ scores of the four groups are 
shown in Table 3. The results were subjected 


TABLE 2 


Means, STANDARD DEVIATIONS, AND CORRELATIONS 

USED IN DERIVATION OF MULTIPLE REGRESSION 
EQUATION FOR PREDICTING ABSTRACTION SCORE 
(X’) FROM AGE AND VOCABULARY; X’=—.28Y 


+ .98Z + 3.82. 
Abstrac- Vocabu- 
tion lary 
score (X) Age (Y) score (Z) 
Abstraction —Al -64 
Age —.09 
21.6 34.0 27.8 
A 10.5 13.0 6.5 
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TABLE 3 


Mean SHIPLEY-HARTFORD CQ anp RA Scores ror THE GROUPS 


Conceputal Residual ab- 
quotient straction score 
Group N M SD M SD 
Acute schizophrenics 30 79.0 14.4 —3.6 7A 
Nonschizophrenic psychiatric 30 81.0 14.7 —2.6 6.6 
Chronic schizophrenics 30 71.3 iL? —6.9 6.9 
Normal control 30 86.4 15.7 0.4 7.8 


to an analysis of variance. Differences between 
groups were significant at the .001 level of 
confidence. Comparing the means of each of the 
psychiatric groups with that of the control 
group by means of Dunnett’s ¢ statistic 
(Dunnett, 1955) showed the chronic and acute 
schizophrenics to be significantly inferior to 
the normal controls at the .01 and .05 levels 
of confidence, respectively. The difference be- 
tween the nonschizophrenic and the normal con- 
trol group is significant at the .20 level. Dif- 
ferences between male and female subjects and 
the interaction of sex with groups failed to reach 
statistical significance. 

The results thus suggest that schizophrenics 
obtained lower CQs even though the various 
groups had been approximately matched for age 
and vocabulary. The fact that this impairment 
is much less marked for the acute schizophrenics 
is consistent with earlier findings (Lewinsohn 
& Riggs, 1962; Lomont, 1962) showing that 
acute schizophrenics manifest relatively little 
impairment on abstraction tasks which are 
emotionally neutral in content, as the S-H 
abstraction test is presumed to be. 

The mean RA scores of the four groups are 
also shown in Table 3. An analysis of variance 
of these scores yielded F and ¢ values very 
similar to those obtained for CQ scores. Since 
the groups were equated for age and vocabulary, 
this was to be expected. Negative discrepancy 
scores were obtained by 37% of the normal 
control subjects, by 70% of the nonschizo- 
phrenic patients, by 77% of the acute schizo- 


5 A full table of the analyses of variance has been 
deposited with the American Documentation Insti- 
tute. Order Document No. 7553 from ADI Aux- 
iliary Publications Project, Photoduplication Service, 
Library of Congress, Washington 25, D. C. Remit 
in advance $1.25 for microfilm or $1.25 for 
photocopies and make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 


phrenics and by 83% of the chronic aoe 
phrenics, The advantage of the RA score os 
the CQ is that it is within the age and ken, 
lary ranges of the population of psychia he 
admittees used in this study, independent eT 
subject’s age and vocabulary score. In Me 
to this, interpretation of the CQ is confou 
by its correlation with these two variables. is 
The correlation between RA score an the 
Factor K was —.25 which is significant 3 nt 
01 level of significance. Thus the results ¢0 low 
support the second hypothesis and suggest si red 
but significant relationship between imp ally 
performance on an abstraction test an 
observed manifestations of conceptua 
organization. 


CONCLUSIONS 


on 
Interpretation of the Shipley-Hartford Cy 
ceptual Quotient is confounded by 145 vesidu 
tion with age and vocabulary level. At 
abstraction score, which is independent aie the 
two variables, was obtained by comput rained 
difference between the estimated an a he 
Shipley-Hartford abstraction subtest soa and 
mean residual abstraction scores of ome Be 
acute schizophrenics were significantly ee Jow 
those of the normal control group. f; pet were 
but significant correlation was obtaine m tine 
residual abstraction scores and interv?" jts av 
of conceptual disorganization. The T th 
interpreted as providing some supP 
assumptions underlying the use 0 
ancy between vocabulary and abs 
as an index of intellectual impairment- 


tractio® 
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OVERINCLUSIVE THINKING IN PROCESS AND 
REACTIVE SCHIZOPHRENICS 


THOMAS S. ELISEO 


Veterans Administration Hospital, Lebanon, Pa. 


Previous research has indicated that schizophrenics show more overinclusive 
thinking (ie, failure to exclude behavior unrelated to a task) than do 
normals.” 17 reactive schizophrenics, 29 process schizophrenics, and 21 chronic 
general medical-surgical patients, equivalent in vocabulary knowledge and 
education, were compared on the Epstein Inclusion Test. Contrary to the 
Prediction that the process schizophrenics would commit more errors, no 
significant differences were found between the groups. It is possible that 
©verinclusion is a characteristic of depression or chronic illness in general, 
rather than specific to the schizophrenic disorder. 


Ebstein's 
Meron’ 
over; 


study (1953) has supported 
includ, (1944) theory that schizophrenics 
Melugign - More than a normal group. By over- 
PPopriate. meant the failure to exclude in- 
; Seen, behavior to perform a task properly. 
k rding to Cameron the schizophrenic’s 
A performance is marked by the in- 
ig Stein’s of irrelevant ideas and objects. 
iE aper ational definition of overinclusion 
fop EC Sa and pencil test, which presents the 
lowed ith a list of 50 stimulus words, each 
yore.” Th, 5 response words and the word 
Sap response subject’s task is to underline all 
De. o ee words that are absolutely neces- 
ling licte i the stimulus words. Epstein 
(i eq E found that schizophrenics under- 
dig? Over, oSPONse words than the normals 
nlctencar cluded more) and that there were no 
indes ween normals and schizophrenics 
Nse SS of an insufficient number of 
Sth (underinclusion). k 
z ìzophrenics are classified as either 
or reactive (Re), according to their 


Con, 


prepsychotic adjustment, differences in their be- 
havior have been noted (Herron, 1962; King, 
1958). The Pr, in comparison with the Re, has 
a less favorable prognosis, shows a deficit in 
physiological reactivity, gives more “psychotic” 
Rorschach records, more organic signs and 
responses of a lower genetic level on the 
Rorschach, and interprets proverbs on a more 
concrete level (King, 1958). Additionally, the 
Pr shows early signs of and a gradual onset of 
the psychosis and an inadequate prepsychotic 
social, economic, and sexual adjustment, whereas 
the Re exhibits a marked precipitating stress, 
sudden onset of illness, fairly good prognosis, 
and a comparatively normal prepsychotic ad- 
justment (Fine and Zimet, 1959). Thus, previous 
research (King, 1958) suggests that it is likely 
the Pr would show greater deficit than the Re 
in conceptualization. 

The present study is concerned with the over- 
inclusive behavior of Pr and Re. It was hypothe- 
sized that Pr would overinclude significantly 
more than the Re, who would in turn make more 
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overinclusive errors than general medical-surgical 
patients suffering from chronic physical illness 
(GMS) and that there would be no differences 
in underinclusive errors between the groups. 


METHOD 


Tests. The Epstein Inclusion Test (Epstein, 1953) 
and the Word Knowledge Test, which is part of 
the Airman Classification Battery (Fruchter, 1949), 
were given to 17 Re, 29 Pr, and 21 GMS. Epstein’s 
procedure was followed in the administration of the 
Inclusion Test, from which overinclusion and under- 
inclusion scores were obtained. The Word Knowl- 
edge Test was given in order to have the subjects 
equivalent in Vocabulary level. Both tests were 
administered either individually or in groups of 
2 to 4 patients. 5 , 

Subjects. The schizophrenic subjects were selected 
from a group of newly admitted patients, 18 from 
each of the following Veterans Administration 
Hospitals: Tomah, Wisconsin; Battle Creek, Michi- 
gan; and Lebanon, Pennsylvania. The 3 hospitals 
were part of 32 Veterans Administration Hospitals 
participating in Project VI of the Veterans Adminis- 
tration Cooperative Studies of Chemotherapy in 
Psychiatry, the purpose of which was to examine 
the effectiveness of 6 phenothiazines.: The patients 
were newly admitted schizophrenics who had not 
received any tranquilizing medication for at least 
1 month before admission. The majority of the 
patients were service connected for a schizophrenic 
disorder. The mean number of hospitalizations for 
schizophrenic disorders was 3.90 for Pr and 3.29 for 

Re. Thus, they can be regarded as being chronic, 
although obviously not “backward” schizophrenics. 
All subjects were under 55, showing no evidence 
of organic damage or mental deficiency. A social 
worker rated the patients on the Elgin Prognostic 
Scale (Wittman, 1941), as modified by Becker 
(1959), on the basis of either an interview with 
the patient or an available social history. Scores 
on the Becker revision range from 0-50; the median, 
25, was selected as the separation point for Pr 
(scores of 24 or less) and Re (scores of 26 or 
higher) ; the subjects with scores of 25 were evenly 
divided between Pr and Re. Of the original 54 
schizophrenics, 18 were classified as Re, 35 as Pr; 
1 patient was not classified because of refusal to be 
interviewed and an inadequate clinical folder. Six 
of the Pr and one Re either refused or invalidated 


1 The author wishes to thank J. J. Laskey of the 
Veterans Administration Central Research Labora- 
tory for aid in obtaining schizophrenic subjects and 
the process-reactive classifications and background 
data for the subjects. Thanks are due also to D. 
Pearl, Veterans Administration Hospital, Battle 
Creek, and V. Pishkin, Veterans Administration 
Hospital, Tomah, for testing the 18 patients from 
each of their hospitals. The suggestions and criti- 
cisms of L. Aumack, H. Blackburn, and A. Felice 


of Lebanon were helpful. 
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the testing. Thus, the final testing involved 17 Re 
and 29 Pr. 

The GMS patients were from Lebanon Veterans 
Administration Hospital. All suffered from a chron’ 
physical disorder, including diagnoses of heart dis 
ease, cirrhosis, pulmonary disorders (e.g. asthma), 
arthritis, and gastric ulcers (Kessler, 1953). Some 
of these conditions are considered “psychosomatic i 
in the selection of the subjects, those having al 
concomitant or previous psychotic or neurologic 
diagnosis or showing signs of such were eliminata 
The mean for the total number of hospitalization 
for the GMS was 3.29. The difference 1m a 
number of hospitalizations for the three grot 
(Pr =3.90, Re =3.29, and GMS = 3.29) Wes o- 
significant. Comparison of the GMS and pa 
phrenic groups would provide a closer estimate a 
the effect of the schizophrenic thought disait 
itself, avoiding the confusion of a specific disoras 
(e.g schizophrenia) with the effects of a chro 
illness in general. Jedge 

The highest possible score on the Word Know ore 
Test was 30. The mean Word Knowledge S° 
was 22.29 for GMS, 20.53 for Re, and 
Pr, while the mean years of schooling was Pr. 
for the GMS, 11.00 for the Re, and 10.69 fOr sg 
The groups did not differ significantly with AA 
to vocabulary level and education. The GMS © Pr 
however, significantly older (p< .01) than thi for 
and Re. The mean age for the GMS was 4295r ant 
Re 35.41, and for Pr 37.71. The product-mom g, 
correlation of age and Word Knowledge WE es 
indicating little relationship between these varia 


RESULTS ne 
;, t 
Because of the unequal sample saajal: 


Kruskal-Wallis H test was used in the ana owl- 
The correlation between Elgin and Word Knove 
edge scores was low (r= .02). Contrary to oe 
hypothesis, there were no significant oie 1): 
between the groups in overinclusion (Teb aot 
The prediction of no differences in underine Y oy 
was supported, although a trend toward £ 
underinclusion by the GMS is apparent. wledg? 
was little relationship between Word Kn 

and Overinclusive Scores (r= —.07), tiles 
r of —.52 (p< .01) between Word Lona the 
and Underinclusive Score suggests t! clusivé 
higher a vocabulary level the less under!” 
errors. 


Discussion twee? 

Perhaps the lack of differences t i 
GMS, Re, and Pr can be attributed gt 
hypothesis that overinclusion is an al J 
depression or of chronic illness in ane the" 
is assumed and gross observation in! z ical ar 
persons suffering from a chronic phy pert 


order are usually depressed (Grinker $ on ÍS 
1954, p. 329). One effect of depress 
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TABLE 1 


OVERINCLUSION AND UNDERINCLUSION SCORES 
For GMS, Re, AND Pr 


GMS Re Pr Significance 
(V=21) (W=17) (V=29) test 
ess Overinclusion 
yom 25, 27 31 H = 48 
SD 30.71 38.76 = 41.93.80 > p > .70 
Ran 21.75 35.90 36.89 
&e 6-70 2-112 1-185 
n Underinclusion 
yia 5 10 13 H=48 
SD 18.19 12.00 16.52.10 > p > .05 
Range 9.97 7.02 9.69 
543 2-29 0-40 
ficulty ; Eii 
Such ten Concentrating and focusing attention, 
a task Aai effort is required to accomplish 
Peculate Teti, 1959, p. 425). Epstein (1953) 
Cause that schizophrenics overinclude be- 


Paire or a defect in attention, resulting in im- 
both de ‘Sctimination and generalization. Thus, 
hay, aoe and schizophrenic individuals may 
the ability in attending, thereby hampering 
ti tty to perform tasks requiring concentra- 
moderately depressed person would be 
wh to make more overinclusive 
ul silken a a severely depressed individual 
Teg T Not participate or if he did would 
hace nding to the task and would prob- 
“clude. In the present study no 
depression at the time of testing was 
Support of the depression explana- 
and Hirst (1957) found that depres- 
T ee Significantly more than a 
digg E, findi Ospitalized control group. 
s E that the GMS group did not 
Surface schizophrenics was surprising and 
© contradicts, if it is assumed that 
i: a normal group, most previous 
tt usual control groups in studies 
chants “Phrenic thinking have been college 
nic ae not suffering from a ani 
Vig. Cong, ct! affliction. Epstein’s contro 
tals foisted of nonhospitalized normal indi- 
Various social and occupational 
attends recreation groups, housewives, 
stan secretaries, and professional 
aon, the tye’, 1953; P. 385). Thus, whether 
SPhrenig Ought disorder was specific to 
Or to long-term illness was un- 
Present results suggest that the 
> Whether the symptoms are those 


g 
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» vi 


frem 


449 


COMMENTS 


of schizophrenia or physical, show some deficit 
in conceptualization. Whether the deficit is 
specific to overinclusion or is present in other 
aspects of thinking between depressives, schizo- 
phrenics, the chronically physically ill, and non- 
hospitalized normals remains to be investigated. 
In a related, as yet unpublished, study the 
present writer found that GMS, Re, and Pr did 
not differ in the interpretation of the figurative 
and literal meanings of words. Becker (1956) 
found that newly admitted male Pr and Re dif- 
fered in the ability to abstract proverbs (the 
Re made better interpretations than the Pr), 
whereas the females did not differ. No adequate 
explanation for this sex difference was found. 
Thus differences in conceptual deficit between 
male Pr and Re may be limited only to the 
interpretation of proverbs. 
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TABLE 4 


NUMBER OF CHILDREN IN CLINIC FAMILIES 


Source (in percentage) 


South- South- North- North- 


TABLE 5 \ 


CLASSIFICATION OF DOMINANT SYMPTOMATIC 
BEHAVIOR AT INTAKE 


Source (in percentage) 
ae ee 


Southern Northern 


Number of em erm ern ern th a 
children clinic census* clinic census® Dominant symptom clinic c 
1 13 42 15 35 Aggressive 18 z 
2 32, 31 41 33 Nonaggressive 28 9 
3 and more 55, 27 44 32 Somatic complaints 10 14 ( 
Mentally retarded 26 10 
Total 100 100 100 100 Mixed 17 4 
Unknown 1 
= lit ksonville. 
+ Consus Sguree refer to eroa ag ul eording to observed Total 100 100 
frequencies OF E efer to New York State. 
s failed 


As readily seen in Table 5 at the Northern 
clinic there was a marked incidence of children 
whose predominant symptom was aggressive 
behavior. That finding was said to be consistent 
with the cultural agents contributing to the 
caseload of that clinic. In the Southern clinic, 
however, the predominant symptoms were those 
of nonaggressive behavior such as withdrawal, 
daydreaming, silliness, and mental retardation. 
No inferences are drawn on the possibility of 
cultural agents contributing to these symptom 
pictures. 


DISCUSSION 


The results indicate that the Southern clinic 
and the Northern clinic serve somewhat dif- 
ferent populations. The Northern clinic serves 
mainly white, middle to high socioeconomic 
groups; the Southern clinic disproportionately 
serves whites of middle to low socioeconomic 
status. While in the Southern clinic, Protestants 
were underrepresented and Catholics overrepre- 
sented, the clientele is more representative of 
the three different religions in the community 
than was that of the northern clinic. However, 


both the Southern and Northern clinic 


. ly. 
to represent the Negro population adequately 
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METHOD FOR THE STUDY OF STAFF INTERACTION THROUGH 
RATINGS OF PATIENTS 


JEROME FISHER 


Langley Porter Neuropsychiatric Institute 1 


Ratings of degree of improvement of 58 hospitalized patients were obtained 
from a staff of 20 engaged in a therapeutic community type of program. 
Analysis of staff intercorrelations of improvement yielded an unexpected 
configuration in which the head nurse of the ward was found to be the 
star of a sociogram of the staff raters, indicating her importance and in- 
fluence in many different interactional and interpersonal spheres of ward, 
hospital, staff, and patient activities. The results of the method applied in this 
study corroborate earlier observations by Stanton and Schwartz (1954). It is 
suggested that continuous, systematic evaluations of staff patterns of interaction 
be made an integral part of therapeutic programs. 


Na a novel, One Flew Over the Cuckoo’s 
experience. (1962) writes about some of his 
atric hark asa nursing assistant on the psychi- 
ticular a of two California hospitals. Of par- 
ho ig mie is the character of a head nurse, 
À ed Big Nurse” and, through the eyes 
Psychotic patient, is described as follows: 


direc pe wields a sure power that extends in all 
Ye but ni hairlike wires too small for anybody's 
Web of Sie Bi see her sit in the center of this 
Wit] mech nke a watchful robot, tend her network 
Which The a insect skill, know every second 
Send Up t runs where and just what current to 
© get the results she wants [p. 26].* 

1 The cota 

influence incidence between the powers and 
and the 4 attributed figuratively to Big Nurse 
Which implications of the results of this study, 
Vated gp o5 Completed about 4 years ago* moti- 
Ritially mts of this paper. 

Staff rf » the purpose was to show how well a 
at thera O people, working together in a type 
1954 Pile pies community program (between 
Behera 1959) on a small psychiatic ward of 
Ree on ridiga] and surgical hospital, could 
8 Male one the degree of improvement of 
A lents, As the study proceeded, how- 
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Dems) Copy; 
a alssion sp Ent 1962 by Ken Kesey. Reprinted by 
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ever, a sociometric method was developed for 
investigating the interactional structure of the 
staff. It was from these sociometric findings 
that something approaching the Big Nurse 
phenomenon evolved. 


METHOD 


The patients were representative of veterans who 
had been hospitalized for various kinds and degrees 
of psychiatric disabilities. Their characteristics as 
to age, education, diagnosis, etc, are not relevant 
to this report and, therefore, are not presented. On 
the other hand, how well each staff member knew 
and could recall the patients was a function of how 
long the patients had been in the hospital. The 
range of hospital stay was from 1 month to 2.5 
years (the latter applied to only one patient). The 
mean was 5 months. 

Among the 20 staff members the following person- 
nel were represented: 2 clinical psychologists, 4 
clinical psychology trainees, 1 head nurse, 5 nursing 
aides, 1 occupational therapist, 1 psychiatrist, 1 
psychiatric social worker, 1 recreational therapist, 
and 4 staff nurses. 

The following rating scale was presented to the 
staff with instructions to rate the degree of improve- 
ment shown by each of the 58 patients: 0, Don’t 
know, don’t remember the patient; 1, Worse; 2, No 
change; 3, Some minor improvement; 4, Improved; 
5, Considerably improved. 

To minimize possible systematic effects of order, 
the names of the 58 patients were randomly ar- 
ranged on each of three lists. In turn, the 20 staff 
members were also assigned randomly to one of 
the three lists. 

After excluding all ratings of zero, a mean, and 
sigma of the staff’s ratings of improvement for 
each patient were computed. A minimum of four 
staff ratings for each patient was considered essential 
for purposes of studying interrater agreement. Four 
of the 58 patients did not meet this minimum and 
were eliminated. A subsequent review of their 
hospital charts revealed that they had been hospi- 
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talized a week or less when they left “against tients, it is possible to study, among other things, 
medical advice.” how well the staff agree or disagree among 
ee themselves. In addition, the data peril ie 
graphic and sociometric application. The latte 

The ratings of the 20 staff members on 54 could also apply to other scaling techniques, 
patients were intercorrelated and only those that Q sorts, etc. I 
reached the .001 level of significance were ac- The “star” of the sociogram, Figure 1, n 
ceptable. The more conservative probability level clearly BV, the head nurse. Her position ° 
was considered appropriate because of the fre- “influence” is underscored particularly by t° 
quency of small numbers of patients rated by finding that, of the other nine staff members, 
some pairs of raters. Each correlation was tested only four are on her nursing staff (ZM and 
against its own standard error derived for each HB—nurses; TB and DF—aides); the test 
pair of raters and based upon the particular belong to other disciplines (HN—occupation® 


N for that pair. therapist, LR—psychiatrist, CE—social workeh 


Figure 1 presents a sociogram which, with and BJ and FJ—clinical psychologists). Amor 
several modifications in data analysis and the latter five, there is considerable interagt™ 
: joini 


(1957) in his unpublished study of social psy- each of them. Vet, each agrees with BV, sU! ible 
chological characteristics of ward treatment ing the possibility that BV may be zespo 
s 


teams. It is based upon the statistically signifi- for some of their commonly shared po 
i 


cant intercorrelations which were obtained among view about the patients, Another finding OF | ist 
10 of the 20 members; each circle represents est is the sociogram position of the psychiatr y 
one of the 10. The lines between circles indicate (LR). It would seem that his sphere of profes 
those with whom each one “agrees” or shares sional and administrative influence iS byn 


graphic illustration, was suggested by Giedt ment as represented by the mutual lines J a 
gE! 


similar points of view. BV, the head nurse, means so extensive as that of BV even tho ht 
occupies the center position of the sociogram; as the only psychiatrist on the ward, it ! Ex 
note that there are nine lines that join her and have been assumed that in his role as © ief i 
other staff members. Nearest to her is HN, the the service, he would have been the star ©: 
occupational therapist, with seven lines. At in- sociogram. That he was not may have a 
creasing distances from BV are the other staff because he preferred the nonauthoritarian ra 
members, with DF, a nursing aide, showing only of a staff member who, along with the other 
a single line joining him to BV. shared certain prescribed responsibliti | 
to his or her own discipline, for the OP 
DISCUSSION the total therapeutic program. sey st 
By intercorrelating staff ratings of improve- ene o peg EN in me 
ment, ie., their phenomenological judgments predictive implicati ding the het 
about improvement, regardless of real or objec- p. ons regarding They: 
tive determinants of improvement of the pa- 


jque 
uniqu 
es, 2 of 


eration 


charge nurse with respect to this stu y donc) 
too, note what seems to be a genera 


to overlook her importance: ‘tal 
was the gui 


What was greatly underestimated Es 
role of the charge nurse as a coordinator <p wart?, 
as an intermediary (italics by Stanton an° "ise in 

| 


was strikingly necessary for a charge TY pec 
position such as we tidied It was esse nti offi 
a large part of her role was aS liaison e e 
through whom the activities pertaining to. i we 
were channeled. She served as a go-betwee as f 
various parts of the hospital and pa 
intra-ward staff coordinator. Fina 
central figure who either executed, evade Stanto® 
the arguments and decisions made 
Schwartz, 1954, p. 157]. staf 


tients, © he 


Daily morning meetings of moe t 
Fic. 1. Sociogram based only upon the intercorre- took place in a large room re event g's 
lations of those staff ratings which exceeded the 001 nurse’s office where she reported m i 


level of confidence. the preceding day, night, and early 
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petivities of each patient in the program, as 
as to her by the nurses and nursing as- 
ace turn she was in the best position to 
ee ee feedback” via impressions and attitudes 
Re Staff to the patients as well as to her night 
€s and aides. 

staf ne before this study was undertaken, the 
tion ad begun to examine its patterns of inter- 
group, T regular weekly group meetings. As a 
ae e members of the staff conceived of 
cratic Me Senerally as group oriented, demo- 
Policy, n their decision-making efforts about 
ing i patient-staff activities, etc. It is interest- 
a ba eae, to view the findings against 
i sibais ground of these meetings. Perhaps it 
He eid that the head nurse and/or one of 
ecause bt pees attended the meetings irregularly 
nursing k general ward pressures and patient 
Were [or ine But when they did attend they 
aides “hci silent observers in the group. The 
attended : also invited, but they rarely, if ever, 
h ths he Meetings. One might speculate that 
5 Parne g staff been present and active 
s Pp oes these data might have shifted in 
cally cresting and curious ways. Parentheti- 
Meetings general impression about these staff 
of Galio „Was largely supportive of the work 
ni, Adams, and Tallman (1953), namely, 


4 Bac; 
Was aides the Psychiatric ward, the nursing staff 
Joining nes Ponsible for the nursing care of an ad- 
high turngtt!sy ward of 20 beds. Shortages and 
del Over rates of nurses were constant person- 
by „the foe These were some of the reasons offered 
ce F nurse in explaining the irregular at- 
her and her staff. In light of these 
Circumstances, it is even more curious 
d some of her staff occupy the position 
e sociogram. 
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that treatment of patients is maximized by con- 
ducting therapy with the staff, concurrently and 
continuously. Along these lines, an additional 
note is suggested by the findings of this study. 
The phenomenological field represented by the 
staff is; in effect, the environmental field for 
the patients. The interpersonal and interactional 
relations among staff members are assumed to 
be important variables of therapeutic influence. 
It would seem necessary, then, that some form 
of evaluation of roles and activities of the pro- 
fessional staff be built into therapeutic programs 
so that appropriate assessment measures may be 
systematically and continuously applied. Results 
of such studies may prove to “actualize” treat- 
ment, as Galioni et al. found, as well to 
provide more comparative data regarding the 
study of interpersonal relations among staff 
members. Further study is planned of related 
variables, e.g., patient conformity and staff 
expectations, and their effects on improvement 
and likeability of patients as rated by staff. 
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USE OF THE TERMAN-MERRILL ABBREVIATED SCALE ON THE 
1960 STANFORD-BINET FORM L-M ON NEGRO ELEMENTARY 
SCHOOL CHILDREN OF THE SOUTHEASTERN 
UNITED STATES 


WALLACE A. KENNEDY, VERNON VAN De RIET, anb JAMES C. WHITE, Jr. 
Florida State University 


Consideration was given to the degree of precision which would have been 
lost had the abbreviated scale only been administered in a normative study of 
the 1960 Stanford-Binet Form L-M on 1800 Negro elementary school children 


from five Southeastern states (Kennedy, Van De Riet, & White, 


1961). 


A Pearson product-moment correlation of .99 was obtained between the 
mental age scores on the full and abbreviated scales for the 1800 Ss stratified 
according to age, grade, sex, socioeconomic status, and community size, and 
randomized within these limits. There was little variation from grades 1 to 6, 
with a mean IQ of 80.7, a SD of 12.4. This low IQ in a homogeneous 
population is one explanation for the high correlation. With these Ss little 
precision is gained from the use of the full scale over the abbreviated scale 


of the 1960 Stanford-Binet Form L-M. 


Since the 1937 revision of the test, an ab- 
breviated scale of the Stanford-Binet has been 
available, using only two thirds of the items 
and saving one third of the administration time. 
In large-scale research a saving of about 15 
minutes on each subject for individually ad- 
ministered tests is of considerable significance. 
Although Terman and Merrill (1937), in de- 
scribing the abbreviated scale, did not present 
any statistical treatment, they mentioned that 
an error of up to 10 points might be made 
in estimating the full-scale IQ from the ab- 
breviated one. 

Watson (1951), in his chapter on the Binet, 
reviews the work from 1937 to 1950, and this 
source is used by Terman and Merrill (1960) in 
the manual accompanying the 1960 L-M Revi- 
sion. In his review Watson (1961) says: 


Despite the high correlations found rather consist- 
ently, the Terman and Merrill short form . . . has 
slightly lower mean IQs than those found for the 
full scale. However, in no case is the mean dif- 
ference obtained between the short and the full 
scale statistically significant [p. 261]. 


Five of the six studies reviewed by Watson 
(Kvaraceus, 1940; Shotwell & McCulloch, 1944; 
Spache, 1944; Spaulding, 1945; Wright, 1942) 
represent highly atypical children mostly mental 
defectives, or, as in the case of Spache’s 100 
subjects, nursery and private school children 
with a mean IQ of 124. Brown (1942), on the 
other hand, studied 300 randomly selected kin- 
dergarten children drawn from 1 year’s enter- 


ing class of a large city. He reported a corre- 


ith 2 
lation between the two scales of .94; oi y 
13.2% of the population differing 1° 
more than 5 points. wld seem 


From a review of the literature it WO corre- 
that there is no large-scale test of the 
lation between the two Stanford-Binet Sking 


It is then little wonder that generally SPa 


on, 

Jy. 
to speak of it as an emergency expedient a 

In the fall of 1960, shortly after 
cation of the 1960 revision of the 
combined Form L-M, the writers C 
broad, normative study of Negro tes. 
school children in five Southeastern stding i 
sample of 1800 subjects, stratified acco! gm 
age, grade, sex, socioeconomic status, 22° its, 
munity size, and randomized within t pe the 
was tested by a team of examiners , ia tate 
Human Development Clinic of Reg cal 
University. In addition to the Binet, "nous? 
fornia Achievement Test and the G00 
Draw-a-Man Test were administered. 

The results of the normative study, 
elsewhere (Kenendy, Van De Riet,, yibuti 4 
1961), indicate a slightly skewed dis of ba 
with an IQ mean of 80.7, and 4 yittle © 
There were no sex differences in 1Q; ett 
ference in IQs of children from urbaani he 
politan, and rural areas; but a highly, $ con o 

; ject n 


relationship between IQ and soc 
; Qa with ages 


onducte 
elementa 


Status; negative correlation W1 on” 
correlation between grade and IQ. some joð 
In looking back over the project eci” 


sideration was given to the degree ° 
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TABLE 1 


Means, STANDARD DEVIATIONS, CORRELATIONS, AND STANDARD ERROR OF THE ESTIMATE FOR THE 
FULL SCALE AND ABBREVIATED SCALE MA OF THE STANFORD-BINET Fors L-M 


Ch i a Standard 
„Tonological age Full scale Abbreviated scale error 
Yea of the 
: Months N MA SD MA SD Correlation estimate 
5 
6 ee 19 614 3.8 60.2 41 93 35 
7 2-83 227 67.0 94 65.6 9.2 98 17 
8 84-95 243 748 10.8 73.2 10.7 98 14 
b oe 302 84.4 12.1 82.4 12.6 97 17 
10 oo} 19 281 93.3 144 91.5 15.5 98 23 
1 i ake 299 103.0 17.3 101.7 18.3 98 20 
12 i 2-143 279 110.0 18.5 108.6 19.2 99 10 
13 ta 109 109.0 18.4 107.6 18.5 98 35 
14 ee 167 30 98.5 18.5 96.5 19.5 99 52 
-179 9 106.2 13.9 105.3 12.4 98 1.00 
REFERENCES 
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scale oy have been lost had the abbreviated 
a progran een administered. With this in mind, 
MA on ins Written to compute the revised 
Was felt the basis of the abbreviated scale, It 
ant compu because of the common CA used 
might yiela = the IQ, a comparison of IQs 
3 Comparison ously high correlation and that 
Urate Dict n of MAs would give a more ac- 
Scale, ure of the power of the abbreviated 
Tab 
ati oi presents the means, standard devi- 
Stimate fee ations, and standard error of the 
Seen w the two scales. It can be readily 
Points wi the abbreviated MA mean is 2 
il than the full form, and that the 
subs eos Sg is 6 months lower. The cor- 
to Jets at wee a low of .93 for the 19 
u 5 Years to 71 months, which is equivalent 
-~ The PA e for the 279 subjects at year 
Subs COrelati correlation was .99. 
ty Jects ig bre = ‘99 on a sample of 1800 
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an error of more than 5 months 
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t m the abbreviated MA. 
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Seem then from this study that 
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fast, the advantage obtained in 
5 the full scale is slight and the 
ariability between the two IQs is 
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BRIEF REPORT 


SOCIAL DESIRABILITY: A REPLY TO WIGGINS? 


JERALD N. WALKER 2 


University of Washington 


A critique by Walker (1962) of an earlier study by Wiggins (1959) was 
subsequently challenged (Wiggins, 1963) on four main issues: “(a) the actual 
development of the SD scale, (b) the success of other desirability scales in 
both studies, (c) the similarity of other control scales in both studies, and (d) 
Walker’s failure to control for the test-retest ‘improvement’ effect.” This paper 
has demonstrated that these points are inappropriate or inadequate and that 
Walker’s results and conclusions must stand. 


Wiggins (1959) reported a study in which he 
attempted to evaluate the adequacy of a variety 
of purported measures of social desirability. 
Walker (1962) questioned his results, and, in a 
reply, Wiggins (1963) held that the critique was 
unsuccessful and that his original results must 
stand. This paper is a reply to Wiggins’ criticisms. 

The items of the SD scale were originally 
selected on the basis of perfect consensus among 
ten judges, rather than on the basis of social 
desirability scale values, because Edwards (1957) 
was interested “only in being able to select a 
set for which there was clear-cut agreement as 
to what the socially desirable response should 
be [p. 30].” 

There is no inconsistency between the fact 
that the SD scale was constructed on the basis 
of a role playing group, and Walker's criticisms. 
It must be obvious that in a role playing experi- 
ment, of two measures of social desirability, the 
one whose control mean is higher relative to its 
maximum possible score is bound to come off 
second best with a scale whose control mean is 
lower relative to its maximum possible score. 
(The control mean of Edwards’ SD scale was 
30.99, and has a maximum possible score of 39; 
the control mean of Wiggins’ Sd scale was 12.06, 
and has a maximum possible score of 40 [Wig- 
gins, 1959].) 


1An extended report of this study may be 
obtained without charge from Jerald N. Walker, 
P.O. Box 787, Presidio of Monterey, California, or 
for a fee from the American Documentation In- 
stitute. Order Document No. 7630 from ADI Auxil- 
iary Publications Project, Photoduplication Service, 
Library of Congress, Washington 25, D. C. Remit 
in advance $1.25 for microfilm or $1.25 for photo- 
copies, and make checks payable to: Chief, Photo- 
duplication Service, Library of Congress. 

2Now at Human Resources Research Office, 


Presidio of Monterey, California. 
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RELIABILITY OF MEASUREMENT UNDER VARIOUS 
EXPERIMENTAL INSTRUCTIONS? 


MARGERY H. KRIEGER anp CHARLES B. SCHULTZ 


Veterans Administration Hospital, Seattle, Washington 
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retest change can range from 0—6 points. Average 
change scores were calculated for each scale on 
each concept (N = 53), and used as scores to 
reflect amount of change. For the entire sample 
of scales (V = 19 X 6=114), these change scores 
followed a unimodal, symmetric distribution. 
Mean change score for each of the six concepts 
was less than one score unit, ranging from 
Self = .62 to Mental Patient=.89. The F 
(between mean square/within mean square) 
between these six means is 2.67 (p < .05). 
Duncan’s new multiple range test (Edwards, 
1960) indicates that Self, Instructor, and 
Kennedy versus Mental Patient, and Self versus 
Doctor are significantly different at the .05 level 
or beyond. The overall mean change score for 
combined specific concepts (.67) is siznificantly 
lower than that for nonspecific concepts (.81) 
(t = 3.11, p < .005). All specific concepts yielded 
lower mean change scores than did any non- 
specific concept. 

These results indicate that test-retest reliabil- 
ity of an instrument varies under different 
experimental instructions. Specific concepts tend 
to be more stable than nonspecific ones. The 
magnitude of any correlations between such 
concepts will be limited by their reliabilities. 
Thus, differences in correlations between experi- 
mental instructions (often interpreted as dis- 
similarity) or between occasions (often inter- 
preted as change) may be due to unreliability 
of measurement rather than to the effect of 
the independent variable. It should be empha- 
sized that these findings may be limited to the 
particular instrument used. Repetition of the 
study with other instruments is planned. 
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THE RELATION OF THE GORDON PERSONAL INVENTORY 
TO SEVERAL EXTERNAL CRITERIA? 


WARREN W. WILLINGHAM 
Harcourt, Brace and World, New York 


anp ROSALIE K. AMBLER 


United States Naval School oj Aviation Medicine, Pensacola 


The Gordon Personal Inventory is a self- 
rating personality scale which measures four 
traits: Cautiousness, Original Thinking, Personal 
Relations, and Vigor. The purpose of this study 
is to determine the relationship between these 
traits as measured by the inventory and peer 
judgments of the same traits. 

Eleven cadet sections in United States Naval 
School, Pre-Flight, served as subjects (N = 208). 
The Gordon Personal Inventory (GPI) and 
nomination forms were administered in a regular 
testing period in the fourteenth week of training. 
It is likely that testing approximated adminis- 
trative conditions since these cadets take similar 
tests which are used administratively. 

After completing the GPI, cadets nominated 
high and low men on each of four traits corre- 
sponding to those measured by the GPI. These 
traits were defined on the nomination blank by 
five positive and five negative statements taken 
verbatim from the GPI. Standard scores for the 
four traits were derived from the nominations 
by methods described by Willingham (1959). 
The number of delinquency reports (disciplinary 
infractions) each cadet incurred in training was 
also recorded on the supposition that this vari- 

able is related to cautiousness. 

The intercorrelations among the GPI scales 
were lower than those reported in the manual 


1 The opinions and conclusions do not necessarily 
reflect those of the Department of the Navy. 

An extended report of this study may be obtained 
without charge from Warren W. Willingham, Georgia 
Institute of Technology, Atlanta 13, Georgia, or 
for a fee from the American Documentation Insti- 
tute. Order Document No. 7625 from ADI Publi- 
cations Project, Photoduplication Service, Library 
of Congress, Washington 25, D. C. Remit in advance 
$1.25 for microfilm or $1.25 for photocopies, 
and make checks payable to: Chief, Photoduplication 
Service, Library of Congress. 
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On an a priori (face validity) basis it was 
decided that the scales of aggression, hetero- 
sexuality, and abasement of the EPPS were most 
relevant. ¢ tests were used to compare results 
from these scales and three significant ¢’s were 
found, all on the aggression scale. Women draw- 
ing males on both or either of the two adminis- 
trations of the DAP had a higher score on the 
aggression scale, i.e., had a need for, or showed 
more aggression. 

An analysis of variance was computed to 
determine whether there were differences in 
the profiles of the four groups on all the scales 
of the EPPS. A second analysis was calculated 
using only the two consistent groups (FF and 
MM). The only significant F ratio found was 
between scales. These results suggest that there 
is no consistent profile difference between 
groups. This finding, however, does not preclude 
the possibility of a difference on one scale, 
Phi coefficients were computed for test-retest 
reliability of the first figure drawn. Small but 
significant coefficients were found. 

In summary, the results of this study gave 
support to the hypothesis that the sex of the 
figure drawn on two administrations of the DAP 
may be indicative of certain personality traits, 
i.e., there was a greater need for aggression, as 
measured by the EPPS, found in females who 
drew males. This drawing behavior was elicited 
in a culture which, according to Lynn (1959), 
rewards masculinity, i.e., is androcentric. 
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NEED FOR SOCIAL APPROVAL AS REFLECTED ON THE TAT? 


RUSSELL P. NORMAN 
Queens College, Charlotte, North Carolina 


This study was designed to explore ‘the rela- 
tionships between need for social approval as 
measured by the Marlowe-Crowne (1960) social 
desirability (SD) scale, and responses to the 
TAT. Subjects were 38 female general psy- 
chology students who represented high (N = 21, 
M = 21.29) and low (N = 17, M = 6.09) groups 
on the SD scale. It was hypothesized that in 
comparison with low SD subjects, high SD 
subjects would overemphasize socially approved 
needs (Achievement and Nurturance) and under- 
emphasize socially disapproved needs (Sex and 
Aggression). It was also predicted that high 
SD subjects would tell shorter stories, and would 
recall fewer of the pictures after having taken 
the TAT. 

Ten TAT cards were administered by female 
examiners using standard instructions. Cards 
were presented in counterbalanced order to con- 
trol for position effects on the recall task. Fol- 
lowing the TAT proper, subjects were given 2 
minutes to recall as many of the pictures as 
possible. Number of pictures recalled and for- 
gotten and number of words in each story were 
tabulated for all subjects. The four needs were 
judged on a 4-point rating scale, ranging from a 
score of zero (no use of the need) to a score 
of three (maximum use of the need). Mean 
interrater reliability was .82. 

Data analysis revealed that predictions were 
upheld for the socially disapproved needs (Sex, 
P< 01, and Aggression, p< 05). Of the 
socially approved needs, only Nurturance was 
significant (p < .01), as Achievement failed to 


1 An extended report of this study may be ob- 
tained without charge from Russell P, Norman, 
Department of Psychology, Queens College, Char- 
lotte, North Carolina, or for a fee from the Amer- 
ican Documentation Institute. Order document No. 
7627 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress, 
Washington 25, D. C. Remit in advance $1.25 for 
microfilm or $1.25 for photocopies, and make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 
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was felt that the hypothesis related to "rh 
as expressed on the TAT was supported. ries, 
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although as predicted, low SD subjects rec 
more of the pictures (p < .01). cially 
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that one’s culture might dictate socially app ai 
outlets for these needs. Although low Pa with 
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SD subjects tended to avoid the vio ra and 
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“acceptable” stories, the recall value es sub- 
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ject had actively dealt with a age that 
interpretation. Also, it may have e stimuli 
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whether in the pictures themselves ke efense® 
fantasies of the subject, sets repressive ult 
into motion which make it more @ Y 
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low SD subjects were more actively 
at all stages of the task. 
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ingen infrequently used source of descriptive 
ita me el about a patient's adjustive behavior 
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ae ae the feasibility of using this informa- 
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unities a seemed to be the major functional 

underlying the descriptive bits. Nine 


expert ; 
sorted noe (psychologists and psychiatrists) 
Pool e bits into these categories. The total 
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GORDON OLSON 
Anoka State Hospital 


3. Emotional Reactions B: Covert and Self 
Directed. The behaviors that define this factor 
seem to reflect an effort to internalize and 
control emotion. 

4. Emotional Reactions C: Lack of Contact; 
Hallucinary, Delusional, and Bizarre Behaviors. 
This factor is represented by behaviors that are 
markedly inappropriate and lack any obvious 
direction toward either self or other. 

5. Interpersonal Relations. This factor seems 
to assess the patient’s willingness to, and poten- 
tiality for, relating well to others, both patients 
and personnel. 

6. Motivation for Rehabilitation. The be- 
haviors with sizable loadings on this factor reflect 
a patient who not only has a desire to leave the 
hospital, but who is also seeking positive means 
to effect his rehabilitation. 

7. General Mental Alertness. This factor char- 
acterizes a patient who is alert, interested, and 
comprehending of events and people in his 
environment. 

8. Responsible and Constructive Activity. The 
behaviors that define this factor reflect a patient 
who actively enters into the hospital program, 
particularly in maintaining a responsible interest 
in a work activity. 

9. Supervision and Care. This factor seems to 
be related to the amount of staff time or at- 
tention required by the patient, other than that 
resulting from an actual physical handicap. The 
factors extracted in this study seem to serve as 
an adequate summary of aides’ and nurses’ 
observations of manifest patient behavior as it 
relates to adaptation to the hospital environ- 
ment. Thus, if rating scales can be developed 
which reliably indicate a patient’s status with 
respect to these dimensions of behavior, we 
could to advantage distill and make use of the 
large heterogeneous pool of information about 
a patient that these personnel possess. 
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A COMPARISON BETWEEN TAYLOR’S AND FREEMAN’S 
MANIFEST ANXIETY SCALES 


IRIS BALSHAN GOLDSTEIN 
Michael Reese Hospital, Chicago 


Both the Taylor Manifest Anxiety Scale 
(T-MAS) and the Freeman Manifest Anxiety 
Scale (F-MAS) are utilized in the assessment 
of anxiety symptoms. In spite of this, there is 
no evidence that these instruments measure the 
same thing. Their intercorrelation of .32, re- 
ported by Alpert and Haber (1960), with 40 
subjects, leads one to believe that the two scales 
are indeed different. 

The intent of the present study was to investi- 
gate the relationship between the Taylor and 
the Freeman scales and also to see how they 
were related to the Guilford-Zimmerman Tem- 
perament Survey (G-Z). 

Eighty women from upper division courses in 
the Department of Education were given the 
T-MAS and the F-MAS. There was a cor- 
relation of .42 between the two inventories, 
accounting for only 18% of the variance. 

G-Z scores were available for 53 of the women 
and were correlated with each of the anxiety 
scales, Among the significant correlations with 
F-MAS were the scales of Personal Relations 
(—.32) and Friendliness (—.36). The signifi- 
cant correlations with the T-MAS were —.35 
with Sociability, —.55 with Emotional Stability, 
and —.55 with Objectivity. 

In general it appears that, although they are 
related, the T-MAS and the F-MAS measure 
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tained without charge from Iris Balshan Goldstein, 
Michael Reese Hospital, P & PI, Chicago 16, Illi- 
nois, or for a fee from the American Documentation 
Institute. Order Document No. 7628 from ADI 
Auxiliary Publications Project, Photoduplication 
Service, Library of Congress; Washington 25, D: C. 
Remit in advance $1.25 for microfilm or $1.25 
for photocopies, and make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 


different aspects of anxiety. Important differences 
between them become apparent when one Be 
at their correlations with the G-Z scales. © 
high negative correlations of the T-MAS hi 
Emotional Stability and Objectivity indica? 
that the highly anxious person would be chari 


. er 
terized by mood fluctuation, worry, and hele 
sensitivity. With a low score on the Socia i 


scale, such a person would neither enjoy 
company of others nor seek social contacts: | 

A different group of G-Z scales are relate uld 
the F-MAS. A high score on the F-MAS pens 
be typical of the individual who had feel 
of contempt and dislike for others. F ons 
scores in Friendliness and Personal Rela 
would also show a tendency for him tO the 
belligerent and hostile. This agrees wit is 
F-MAS instructions, which state that one ? to 
judge the behavior of others in responding ue 
the questions, In accord with the Prole ings 
hypothesis, an individual with negative | ply of 
towards others would not think very hig 
himself. 

The two anxiety scales may C 7 
different G-Z traits partly because of di 
in item selection, validation methods, | i 
instructions. It may also be that anxie A two 
multidimensional phenomenon and that vxiety” 
scales tap different dimensions of ® asses? 
Rather than using one of the two tests t° wish 
levels of anxiety, the experimenter 7 
to utilize a cuir ware on both of them 
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AVOIDANCE OF INAPPROPRIATE SEX-TYPING BY 


YOUNG CHILDREN * 


WILLARD W. HARTUP,* SHIRLEY G. MOORE 


University of Minnesota» 
Anp GLEN SAGER 


John Tracy Clinic 


Í ae was developed for measuring avoidance of inappropriate sex- 
ages 3 Pa a grat by sex-appropriate preferences. 69 boys and 78 girls, 

a u 8, were observed while playing with sex-inappropriate and 

hian va oys. Latency of orienting to inappropriate toys was longer for older 

Spat wath a boys, but no age difference was found for girls. Percent of time 

Both sex 3 eE proprete toys was lower for older than younger children of 

Ihan pos Sa batehicies for boys, but not girls, were longer when E was present 

of m a absent; E's presence did not affect percent-inappropriate scores 

M-F set vos: Ot girls. These avoidance measures were not correlated with 

res on the It Scale for Children. 


, The 

tion Pend study is based on the assump- 

Children i appropriate sex role behavior in 

Goidance of i part, a function of acquired 

vach avoida appropriate sex role behaviors. 

ta both id nce may be acquired by the child 
€ntification with available models 


anq x 
dire 
A 
Dort liscrimination learning. The im- 


. t . 

i Point A mee 

fa boys & at however, is that masculinity 
Mi Ssumed to involve avoidance of 


Mini 
tos mt mle te 
| involve Le femininity in girls is assumed 
Most Oidance of masculinity. 


thi empiri 
| clldren wad studies of sex-typing in 
tay tOac r data in which the effects of 
Constance o o propriate-ser objects” and 
ese Aded, Fo appropriate-sex objects” are 
has Nteq chil ia example, Rabban (1950) 
hich Which en with a group of objects 
Tole ey Were fer Were masculine and half of 
Die, NOG e Rabban’s index of sex 
c sen T consisted of the proportion 
ale Ce for 2 the eid which were ap- 
Mey, Or Ch; sex. Brown’s (1956) It 
ate of dren (ITSC) is Aret 
de Con The role preferences which sim- 
of cies, e S approach and avoidance 
; ae ok from among pictures 
of S © aut, feminine objects are made 
angel Ors sat 
"a Ales, Mae to acknowledge the assistance 
thi, Thig 3 Stein a Akira Kobasigawa, Gay Meyer, 
had aup tty ia conducting this investigation. 
oe aha we Conducted while the first and 
d Deve t the Institute of Child Be- 
pment, State University of Iowa. 


by the subject for a figure in a drawing 
rather than for himself. Because of such 
confounding, most masculinity-femininity 
tests for children are not adequate devices for 
studying avoidance of opposite-sex behavior. 

The present investigation was designed to 
assess young children’s avoidance of inap- 
propriate objects unconfounded with prefer- 
ences for sex-appropriate objects. Specifically, 
avoidance of opposite-sex behavior was 
studied as a function of chronological age and 
the presence or absence of an adult examiner. 

Hypothesis 1. In children between 3 and 8 
years, the relation between avoidance of in- 
appropriate-sex objects and chronological age 
will be positive. This hypothesis is consistent 
with most theories of identification and also 
with the assumption that sex-typing stems 
from direct sex role differentiation by parents. 
For example, Freud (1933) hypothesized 
that superego development (which involves 
sex-typing) is contingent upon the resolution 
of the Oedipus complex at about 5 years of 
age. If school-aged children are indeed more 
identified than preschool-aged children, the 
former should manifest greater avoidance of 
inappropriate objects than the latter. 

The consequences of intentional sex role 
differentiation by parents should also be more 
in older than younger children. 
Presumably, parental demands to avoid be- 
havior typical of the opposite sex are main- 
tained or increased in frequency during the 


apparent 
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carly school years. It seems unlikely that 
such pressures would be reduced at the begin- 
ning of middle childhood. 

Hypothesis 2. In children from 3 through 8 
years of age, there will be greater avoidance 
of inappropriate-sex activities when’ an adult 
is present than when an adult is absent. 
During the early years, few children con- 
sistently self-impose the controls on behavior 
demanded by society. As Sears, Maccoby, 
and Levin (1957) suggest, “self-control is 
dependent on the immediate (or near future) 
presence of someone who can punish or re- 
ward [p. 364].” Although most children 
eventually become independently masculine 
or feminine, it is assumed here that early 
childhood is a transitional period in this 
regard. During this time the presence of an 
adult authority figure should stimulate in- 
creased conformity to sex role standards. 

Hypothesis 3. In children from 3 through 8 
years of age, the relation between avoidance 
of inappropriate-sex objects and appropriate- 
ness of sex-typing as measured by the ITSC 
(Brown, 1956) will be positive. This hy- 
pothesis is based on the assumption, men- 
tioned previously, that a portion of the 
variability in scores on M-F tests such as the 
ITSC derives from individual differences in 
avoidance of inappropriate-sex activities. 


METHOD 
Subjects 


The Ss were 69 boys and 78 girls. The boys, who 
were enrolled in laboratory schools at the Uni- 
versity of Minnesota, were divided into four sub- 
groups: a young nursery school group (YNS) of 
17 Ss, ranging in age from 3-2 through 4-4; an 
old nursery school group (ONS) of 18 Ss, ranging 
from 4-5 to 5-2; a young elementary school group 
(YES) of 17 Ss, ranging from 5-5 through 6-11; 
and an old elementary group (OES) of 17 Ss, 
ranging from 7-0 through 8-2. 

The girls, who attended laboratory schools at the 
State University of Iowa, were also divided into 
four groups: a YNS group of 17 Ss, ranging from 
3-2 through 4-2; an ONS group of 17 Ss, ranging 
from 4-3 through 5-4; a YES group of 22 Ss, who 
ranged from 5-6 through 6-11; and an OES group 
of 22 Ss, ranging from 7-0 through 8-3. 

The schools attended by the Ss typically enroll 
children of higher than average intelligence from 
upper middle-class professional and business homes. 
95% of the Ss were Caucasian. 

Within each of the eight age and sex subgroups, 
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o experi- 


Ss were randomly assigned to one of tw (EP) or 


mental conditions: experimenter present 
experimenter absent (FA). 


Equipment 


Laboratory rooms, 10 X 12 feet, equipped m 
one-way observation facilities were used for into 
serving the Ss. When the child was brought ctly 
the room he was led to a toy table located direct? 


so that he faced the window with his 
Male Ss were presented with an armare ig 
of feminine and “neutral” toys. The aE Ai 
consisted of a doll with extra clothes, @ nT ee 
play make-up, pearl beads, a purse, and some a 
up clothes consisting of high-heeled sl 


with four pegs, a field guide to butter t 
small number of pictures and a technical enildre 
beyond the reading level of any of the ch they 
four blocks with heavy duty snaps SO © 

could be combined in a cluster, and an cl iss 
wooden puzzle of a duck with one piece m 
The feminine toys occupied the right-han n the 
thirds of the table and were separated 


23% 
Sn Qa“ 


neutral toys by a small space. toy" 

Female Ss were also offered two groups “ sculit® 
masculine toys and neutral toys. The cae foo! 
toys consisted of a pair of boxing gloves: ® aed 


a. TS 


ball, a football helmet, a toy bulldozer; ‘ 
tank-transport, a catcher’s mitt and baseba E 
a razor, a pipe, and tobacco pouch. The 
toys used with the girls were the sa 
used with the boys. The 
arranged on the right-hand two thirds 0 1 toys: 
separated by a small space from the neutra oprt 
The neutral toys, in contrast to the inapP! imit? 
sex toys, were deliberately selected for their an 


ur! 

of 

me a5 "ye 

: mber t 
potential for play. They were fewer in nU rhus n 
was oF 

as y 

, 

dichoto™” pa! 


v 
r s W 
masculine 109° gpl 


were generally in disreputable condition- 
dichotomy inappropriate versus neulra 
founded with a multidimensional 


5 > n sumer gtr? 
tractive versus unattractive. It was ave h 
preference by a child for the unattrac active te 
attra ibet? 


toys over the more numerous an elt” y 
appropriate-sex ones would repress s“ 
propriate for S’s sex were available. made fo! 

The selection of toys for boys W3° toys ‘of 
bitrarily by the first two authors. The pasis at? 
girls, however, were selected On ix grade 
ratings by five preschool teachers, a of Fal 


students, five fathers and seven mol a toy? gy 
school-aged children. The nine masculin port 
with girls were rated “substantially of tdg" 
Propriate” for boys than for girls by ‘were 
raters; the four neutral toys used 


avoidance of the latter materials. a 
neutral by at least 19 raters.* ith "y 


2 js us rere 
3 Only three of the feminine toys Lae ie a 
Ss (doll, pearl beads, high-heeled 5” aged 
cluded in the ratings. All os 
feminine by at least 22 of the 23 rate” 


Ph ee were also asked to rate, on a 4-point 
hie ah interest value” of each of the toys. 
girls ean me toys selected for use with 
children tha judged to be of greater interest to 
mean interes any of the four neutral toys. The 
while the = value of the masculine toys was 1.6 
Was 2.9 a ae interest value of the neutral toys 
001, ‘Thus iference which is significant beyond 
founded aS, attractiveness and sex-typing were con- 
in the toy groups for girls as intended. 


D, 
» Procedure 
To begi 
egi 
at the abi the laboratory session, E positioned S 
Stoups, pposite the space between the two toy 


Sirls) then os was female for both boys and 


ere ar 
ley a of toys for you to play with. You 
yoi Play ee them anywhere in the room. When 
these on auth them at the table, try to keep 
Masculine ae Side (E points to the feminine or 
to the Oys) and these on this side (E points 
te gt tov. 
fie the Ep 
Whe While yo 
bein tts tim 
NB Dleasans 
bup © Plea: 


Ne 
“Ntacts 


condition E then said, “I'll sit over 
ure playing, and I'll let you know 
silly oe E then sat down and waited, 
Ver initnesPOnsive to the child’s attentions 
i X ng or deliberately sustaining 
A th, 

, e EA condition Æ said, 


bro Boing to : 
k o i n G H 
Eo n toy So right next door and fix this 


a p JUSt go S indicates material in her hand). 
ew Minutes Acad and play, and TIl be back in 


Cre 5 

Seconds qpserved by a concealed O once every 
timg llea light aà Period of 13.5 minutes. A time- 
lookin! © light Signaled O every 15 seconds, Each 
andin? what g ached, O recorded where S was 
neck; O did Ei touching, and where S was 
strag "8 was a at S between observations. 
th Was? or extrane, ed as feminine (or masculine), 
tly mop adged is depending upon the object 
Wa Sifieg ent of ai ne visually oriented toward at 
ik Servation, Touching was similarly 

the object, if any, that 


>, bein, ‘Pending ð 
W Orin n 
ing, Ptit andleq s 
y at the time of the observation. 


hig Neutra toys was scored as feminine (mascu- 
alkas ju oF extraneous depending vhere the 
tired to he pending on where 

S on the fi e standing in relation to 

atea apo A 36 X 40 inch feminine 

a was marked off in front of the 

© neutral area was 36 X 30 inches 
the Non-sex-typed toys. 

aleney S were derived from the 

a and percent-inappro- 

SS ei touching, and proximity 

> Op io, of the number of time 

ss clo ore S was observed to look at, 

5e to an inappropriate toy. Per- 


Ups 

ae ae 
scores 
Ooking, 
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cent-inappropriate scores consisted of the number 
of intervals in which S looked at, touched, or was 
close to an inappropriate toy divided by the 
number of intervals spent with all toys. 


Observer Reliability 


Two judges observed 41 boys (a total of 2214 
observations) and 19 girls (a total of 1026 ob- 
servations). Agreement (use of the same category to 
classify a given observation) ranged from 94% to 
97% for the latency scores and from 82% to 
97% for the percent-inappropriate scores. 


ITSC 


The ITSC was administered individually to 
each S$ within 4 weeks of the session with in- 
appropriate toys by an examiner who had no 
knowledge of» S’s behavior in the other session. 
Administration of the ITSC followed the procedure 
described by Hartup and Zook (1960). 

The ITSC scores were skewed toward the 
masculine end of the scale for both male and female 
Ss. This skewness was very marked for boys; in 
fact, so little variability existed in the scores of 
elementary school boys that it was decided to 
eliminate these Ss from all analyses involving the 
ITSC. 


RESULTS 


Intercorrelations among looking, touching, 
and proximity scores were computed sepa- 
rately for the latency and percent-inappro- 
priate components. The tau coefficients com- 
puted between the various pairs of latency 
scores ranged from .77 to .79 for boys and 
from .79 to .84 for girls. Product-moment 
correlations computed between pairs of per- 
cent-inappropriate scores ranged from .85 to 
91 for boys and from .78 to .86 for girls. 

Since the individual latency and percent- 
inappropriate scores were so highly inter- 
related, the main analyses of the data were 
completed using summary measures. The 
summary latency score consisted of the mean 
of the three individual latencies; the sum- 
mary percent-inappropriate score consisted 
of the mean of the three individual percent 
scores. 

It was assumed that the latency score 
would be a positive function of avoidance of 


inappropriate play materials. Further, the 
percent-inappropriate score was assumed to 
be an inverse function of such avoidance. 
Consequently, it was expected that these two 
summary scores would be negatively related. 


This expectation was confirmed, but the 
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TABLE 1 


SUMMARY OF EXTENDED KRUSKAL-WALLIS ANALYSIS 
OF VARIANCE FOR LATENCY SCORES: MALES AND 
FEMALES SEPARATELY 


Source H dj 
Males 
Among subcells aie vg 
Age (A) ale 3 
E’s presence-absence (B) 7.01 1 
AXB 1.47 3 
Females 
Among subcells 14,28* 7 
a i 5.83 3 
B 2.46. 1 
AXB 5.99 3 
* p <.05. 
** p <.02, 
=+ p <0. 
association is only moderate. For boys, tau 


— 48; for girls, tau = — 45. 
Both summary scores we 
rately by sex, 

statistical designs 
yses consisted of 
treatments were F’; 


re analyzed, sepa- 
using Treatments x Levels 
- The levels in these anal- 


the four age groups; the 


ysis of variance 


the latency 
nonnormal i 
effective tra: 
be found, t 
conducted 


TABLE 2 


MEAN Ranks For LATENCY Scores or Four AcE 
GROUPS WITH EXPERIMENTER PRESENT oR ABSENT: 
MALES AND FEMALES SEPARATELY 


Age group» 
YNS ONS YES OES 


Males 


Experimenter present 


34.0 37.3 46.8 48.2 

Experimenter absent 15.0 25.0 31.4 42.1 
Females 

Experimenter present 30.8 35.2 576 44.3 

Experimenter absent 37.7 261 36 433 


* YNS = younger nursery school Ss, ONS 
school Ss, YES = younger elementary school 
elementary school Ss, 


= older nursery 
Ss, OES = older 
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Kruskal-Wallis nonparametric analysis of 
variance.* 5 
Latency. The results of the analysis g E 
latency scores are presented in Table k 
First, for boys, the main effects of age wa 
significant (p < .05). The association a 
tween age and latency is positive, i.e. O i 
boys delayed longer than younger m 2). 
approaching the feminine toys (see Table A 
Mann-Whitney U tests between awe 
indicated the following: differences eee 
YNS and ONS and between YES and ter 
were not significant; latencies were shoi: 
in YNS than in either YES or OES, r4 “ONS 
and $ < .001, respectively; latencies in 
were shorter than in OES, p < .02. ice 
The main effects of Æ’s presence or abse 1). 
were also significant for boys (p< o 


TABLE 3 


a 5 ANALYSIS 
SUMMARY OF EXTENDED KRUSKAL-WALLIS ANA 


PRIATE 
OF VARIANCE FOR PERC NAPPROPRIA 


ee 
df 2 
Among subcells 15.89* 7 
Age (A) 9.67* 1 
E's presence-absence (B) BS 3 
AXB 2.71 
* p <05. 

Table 2 shows that for 


s 
all subgrouP 


3 se! 
latencies were longer when Æ was pres 


than when she was absent. ncy 

Next, for girls, the analysis of Jate™, 
Scores revealed that neither the effects of 
nor of E’s presence were significant nce? 
1). As can be seen in Table 2, age differ” 0? 
in girls’ latency scores were in the ts of 
direction as the boys’. H for main effec” ; 


Ha e 
age, however, fell short of significane 
< .20). 


ple 


priate scores revealed significant main © 
of age (p <.05, see Table 3). Young 
showed more frequent tna te 
than older boys. Mann-Whitney g 
tween subgroups, as summarized in 


a use“ 
n 

*As developed by Ray Hyman & 
Rosenblith (1959). 


| a TABLE 4 
Mean IKS Fi 
a Big FOR PERCENT-INApPROPRIATE SCORES 
“oa Groves with EXPERIMENTER 
RE OR ABSENT: MALE SUBJECTS 
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TABLE 6 
MEAN PrERCENT-INAPPROPRIATE SCORES FOR Four 
AGE Groups WITH EXPERIMENTER PRESENT OR 
ABSENT: FEMALE SuBJECTS 


r -\ge groups p Age group? 
enoe NS NS OES — YNS YES OES 
“xPerimente P zz = 

er prese 5 
Experimenter a ent 34.0 35.3 26.8 24.7 Experimenter present 76.5 62.0 37.0 33.0 
absent 53.0 17.0 34.6 23.9 Experimenter absent 62.3 O41 53.9 50.7 
| r nursery school < by older nursery a YNS younger nursery school Ss, ONS = older nursery 
unger elementary s, OES = older school Ss, = younger elementary school Ss, OES = older 


ary school Ss, 


indi 
ae nc differences between the two NS 
stint etween the two ES groups were 
Oys * Sig on. Rice groups of preschool 
minine significantly more time with the 
Sc n oys than either group of elementary 
Presence fa (p< .05). The effects of E's 
StOreg Was absence on percent-inappropriate 

he i Significant for boys, 

ere im oe scores for girls 
atiance a using parametric analysis of 
icant ne main effects of age were sig- 
dicated. T < .005, see Table 5). ¢ tests in- 
not iffer e two groups of preschool girls did 
“Mentar rom each other nor did the two 
8toups 5 y school groups: both nursery 
Nascy or significantly more time with the 
Schoo] e toys than either of the elementary 
Bs tee (see Table 6). The effects of 
for girls €nce or absence was not significant 


Rela; 
lna ation of ITSC Score to Avoidance of 
Omputeg tee Toys. Tau coefficients were 
Sum, ar between ITSC scores and the two 
Mtation Measures of avoidance. These com- 
S were completed separately for pre- 


8 TABLE 5 
Wty ae ABLE 5 


AN: 5 OF ney PERCENT- 
NAPPROp. NALYSIS OF VARIANCE FOR PERCENT 


RIATE Scores: FEMALE SUBJECTS 


Sourc, 

Ago Source af MS r 

by (Ay Sy eee een 

Drege 3 3174.2 32“ 

Cape) NCe-absence (B) 1 855.6 1.41 

D R (7) 

i 1.62 
lls (w) 3 982.9 5 
Wty of 608.3 
*e T e 


elementary schoo! 


school and elementary school Ss for two 
reasons: the foregoing analyses revealed age 
differences in the avoidance measures, and 
other studies (Brown, 1957; Hartup & 
Zook, 1960) indicate that ITSC scores are 
also related to age. 

For boys, positive correlations were ex- 
pected between latency and ITSC scores 
(since high scores on this scale are equivalent 
to high masculinity). Negative correlations 
were expected between percent-inappropriate 
and ITSC scores, Correlations in the opposite 
direction were anticipated for girls. Table 7 
shows that the direction of each coefficient in 
the table is as expected, but all are essentially 


of zero-order. 
DISCUSSION 


Age differences. The data confirm the hy- 
pothesis that, with increasing age, young 
children increasingly avoid inappropriate-sex 
objects. With no exceptions, the elementary 
school Ss spent less time with the inappro- 


TABLE 7 
NS BETWEEN ITSC SCORES 


KEN n RANK CORRELATIO: 
Te ‘AVOIDANCE OF INAPPRO~ 


AND Two MEASURES OF 


PRIATE SEX-TYPING 
E Avoidance 
measure 
Percent- 
inappro- 
Subjects Latency priate 
Subjects 
Girls k: p 
Nursery school (V = 25) : aal a 
Elementary school (V = 44 
- 07 —.07 


Nursery school (N =35) 


s were not given the ITSC because 


= sery irl 
a Nine nursery school gi school teacher. 


e examiner was their nursery 


th 
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priate toys than the nursery school Ss. Also 
without exception, the latency of initial 
orientation to the inappropriate toys was 
longer for the ES groups than for the NS 
groups. These differences were all Significant 
except for the latency scores of girls. Thus, 
increasing avoidance of inappropriate sex- 
typed activities appears to be characteristic 
of both sexes during early childhood, although 
manifestations of such avoidance may be 
more general in the behavior of boys. 

An alternate interpretation of the age dif- 
ferences obtained in this study is that “ap- 
proach to neutral toys” increases with age. 
The authors reject this interpretation on the 
grounds that the neutral toys used in the 
study do not ordinarily attract more interest 
from older than from younger Ss. The eight- 
piece wooden puzzle, the plastic pegs, and the 
four blocks are toys ordinarily provided for 
preschoolers, not children in the primary 
grades. The only exception to this generaliza- 
tion is the adult book which was the fourth 
neutral toy. The volume used, however, was 
beyond the capacities of any of the children. 

Also, the observers reported frequent in- 
cidents which suggest that persistent play 
with the neutral toys is interpretable as 
avoidance of the sex-typed toys. Comments 
such as “Those are for girls,” and “Those are 
boys’ toys” were frequent. Play with the 
neutral toys, particularly among the older 
Ss, ‘was commonly nonvigorous, repetitive, 
and interrupted on many occasions with side- 
long glances at the sex-typed toys and E (if 
present). A tendency to “approach neutral 
toys” may have been Operative in the be- 
havior of some Ss but, for the reasons given, 
It seems most plausible to regard the differ- 
ences obtained as indications of different 
degrees of “avoidance of inappropriate-sex 
toys” rather than “approach to neutral toys.” 

The findings with respect to age suggest 
that some factor or factors associated with 
entrance to elementary school operate to in- 
crease avoidance of inappropriate-sex activity. 
The percent-inappropriate scores of both boys 
and girls indicate: ES subgroups were more 
avoidant than NS groups, but neither the 
two ES groups nor the two NS groups differed 
from each other. 


Pressure to avoid inappropriate-sex activi- 
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ties is probably increased from a variety o 
sources about the time middle-class children 
enter elementary school. Parents ee = 
creasing concern about the adequacy a 
child’s sex-typing; teachers may also 1m a 
duce demands for sex-typed behavior. ae : 
routines are different in nursery and pa 
mentary schools. For one thing, ag rein 
and toilets are no longer shared by the an 
sexes. Also, changes in peer group intera 
may contribute to the increased avolda 

of inappropriate activities found amore tas 
children in this study. Sociometric ae i 
repeatedly show that an outstanding fea i- 
of peer groups in middle childhood (as Py 
pared with earlier or later) is cleavage k 
sex. Consequently, future investigation sng 
avoidance tendencies in relation to ean ds 
should include a study of children’s dema a 
on each other for appropriate sex-tyP 

behavior. iners 

Differences Associated with Exam nce 
Presence. The hypothesis that the pres¢ ae 
of an adult authority figure stimulates ys 
creased avoidance of inappropriate-sex er 
was confirmed for boys only. Both ole 
latencies and smaller percent-inapprop™ 
scores were found when male Ss ve fier- 
served with E present. This EP-EA atency 
ence, however, was significant for the la 
scores only. 

On the surface, the findings suggest ad 
difference in sensitivity to the presni a 
an authority figure. This difference ae 
variety of intriguing questions. Is the pe i 
of “internalization” so complete or S° ound 
mentary in young girls, as compared tO 1 t 
boys, that the presence of an adult poe rolê 
stimulate additional conformity tO e ex 
Standards? ® Or, is a permissive femé ultur? 
aminer an irrelevant stimulus in this © irls? 
for sex role conformity in young study 
Would the sex difference noted in this mal? 
be found if the examiners had bee? “and 
adults? Parallel effects on young POY? are 
young girls may not occur unless Æ re le, 
of opposite sexes. There is, fF Cainin 
strong indication in the literature P°™ 4¢ 

i rop” g 
jnapP’ usi" 
assess yet? 


a sex 


° The relative levels of avoidance of 
toys by males and females cannot be ses 
the data of this study since the pe” 
hot observed with identical toys. 


AVOIDANCE oF INAPPROPRIATE SEX-TYPING BY CHILDREN 


fren Dom gece nese effects with young 
ta whett hat performance varies according 
ot a E and S are of the same or op- 
Sien caer Gewirtz & Baer, 1958; 
le, 61). In any event, the effect of 
isowoEth a on children s sex-typed behavior 
male ph further investigation. The use of 
Mido needed before broad 
this ay ns can be drawn from the data of 
m a Avoidance Measures to ITSC 
ems 1 low correlations were obtained 
ures, Suci E scores and the avoidance meas- 
typing is h findings may indicate that sex- 
avis ai noe a single dimension in the be- 
Rested by Sak children, an hypothesis sug- 
and, in th orstelmann (1961). On the other 
are Dati he of girls, the present findings 
scores a = puzzling. The girls’ ITSC 
0 tasen i heavily skewed in the direction 
obtained mare In fact, 29 of the 44 ES girls 
Very imag masculine scores. It seems 
Suc mea that ES girls should obtain 
avoj the E ine scores at the same time they 
a greater — toys in the laboratory to 
cenil ent than the NS girls. — 

ing a questions have been raised con- 
Measure Vs adequacy of the ITSC as a 
example ma role preferences in girls. For 
Significant artup and Zook (1960) found 
Scale afier increases in femininity on this 
Sexlegg dra Preschool girls were told that the 
Was a o o Wing (for whom S make choices) 
girl. Kohlberg and Zigler (1962) also 


Cern 
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suggest that the procedures used in the ITSC 
may be inappropriate for girls. Consequently, 
the low correlations obtained in the present 
study may, at least for female Ss, result from 
deficiencies in the ITSC. 
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RELATIONSHIPS BETWEEN PERFORMANCE ON THREE 
TESTS FOR ORGANICITY AND SELECTED 
PATIENT VARIABLES * 


PATRICIA J. BRILLIANT axo MALCOLM D. GYNTHER 


Washington University 


This study compared the predictive accuracy of 3 commonly used tests a 
organicity: Bender Gestalt, Benton Visual Retention, and Graham-Kendi 
Memory-for-Designs. Each of the 147 patients admitted to the hospital Aung 
the experimental period was approached for testing. Complete data were 0 
tained from 120 patients later classified as Chronic Brain Syndrome, Acute 
Brain Syndrome, Psychosis, Personality Disorder, Chronic Alcoholism, and 
Other. Results showed that test performance is not related to race or sex, but 
is significantly related to age, IQ, and education. When the effects of these 
variables were statistically controlled, performance on all Measures predicted 
the clinical diagnosis of organicity at less than the .001 level of significance. 
The best single measure was the Bender which, scored by the Hutt-Briskin 
method, correctly identified 82% of all patients. 


Psychologists are frequently asked to assist 
in the diagnosis of organicity. Among instru- 
ments commonly used for this purpose are 
the Bender Visual Motor Gestalt Test 
(Bender, 1938), the Graham-Kendall Mem- 
ory-for-Designs Test (Graham & Kendall, 
1946; Graham & Kendall, 1960), and the 
Benton Visual Retention Test (Benton, 1945; 
Benton, 1955). 

Many investigators have attempted to 
demonstrate the usefulness of these tests. 
However, since few of these research workers 
have used more than one of these tests, little 
is known concerning the tests’ differential 
effectiveness with a given sample. 

Factors other than diagnosis which have 
been shown to influence performance on 
these tests are IQ (Aylaian & Meltzer, 1962: 
Griffith & Taylor, 1960), education (Pascal 
& Suttell, 1951), and age and length of 
hospitalization (L’Abate, Boelling, Hutton, 
& Mathews, 1962). However, studies which 
do not control for these variables are com- 
mon (e.g., Bowland & Deabler, 1956; Gar- 
rett, Price, & Deabler, 1957; Goldberg, 1959; 
Howard & Shoemaker, 1954; Mehlman & 
Vatovec, 1956; Niebuhr & Cohen, 1956; 
Robinson, 1953). 


1This paper is based on a Master’s thesis done 
by the senior author, under the direction of the 
junior author, at Washington University, St, Louis, 
Missouri, 1962. 


r , r esting 
Another important issue involves test 


the instruments by means of designs en 
relate to their clinical use. In other oe 
psychologists are usually asked to other 
entiate between organicity and some jn 
(frequently unspecified) psychiatric als of 
drome, not between organics and norm oi 
between organics and nonpsychiatrlt ise 
tients. However, studies which are othe sor 
well controlled do attempt precisely te i 
of atypical discrimination (L’Abate, 


: rker, 
man, Vogler, and Chused, 1963; Pa 
1954; Wahler, 1956). ability 

Another point concerns the generaliza ad 


of findings. If many restrictions are Pi to 
upon one’s sample, it is often not poss! 7 
generalize beyond the sample obtain’ rit 
an illustration, the Pascal and Suttell Pe ted 
procedure for the Bender (1951) is “oat one 
to subjects aged 15 to 50 with at ae pus 
year of high school. This procedure any 
inapplicable to data obtained from o 
psychiatric patients. Also, if samples ved J 
match base rates at the hospital ve joc! 
the study, generalizations even WÍ 
significance are difficult to draw. tigation 

The purpose of the present a mn 
was to use three tests for organiclty rictet 
heterogeneous psychiatric sample ‘a Bs 
as to age, intelligence, and edi sia 
determine the effects of these sgt 
well as those of race, sex, and dias” 
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TESTS FOR ORGANICITY AND PATIENT VARIABLES 


METHOD 
Subjects 


ree seo admitted to Malcolm Bliss Mental 
expetimn enter, St. Louis, during the 6 weeks 
a a period was approached for testing. 27 
Bue anes 20%) of the patients admitted 
tanie of s time were considered untestable be- 
fompi extreme belligerence, autism, and/or in- 
pleted geod The 120 subjects who com- 
females S test battery were composed of 50 
QIN (22 Negro and 28 white) and 70 males 
a ied and 49 white). The age range of the 
ig was 17 to 84 years with a mean age of 
with ms Education ranged from 0 to 16 years 
jects cou ft of 9.1 years. Nearly all these sub- 
r „be described as falling in the lower 
of tisks omic class with occupational classifications 
Skilled or semiskilled workers. 


Test 3q aterials 


Te i materials used in the present study con- 
Bender Vi standard sets of each of three tests: 
tion (p isual Motor Gestalt, Benton Visual Reten- 

orm C), and Graham-Kendall Memory-for- 


esi 

Beliy The Vocabulary subtest of the Wechsler- 
€, Form I, was also used. 

Dy 
£ rocedure 

Ev 5 

aye Subject was given the test battery within 
of h of admission in order to control for length 


of whip zation and degree of medication, both 
lect was May affect test performance. Each sub- 
Counterbalanen the three tests for organicity, in 
and then ‘anced order to control for practice effects, 
latter aaa the Vocabulary subtest. From this 
is wag” a verbal IQ estimate was obtained. 
S subsequently used to correct for the 
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influence of intelligence on the Benton and 
Graham-Kendall tests. 

After each test battery was given, the tests were 
scored without knowledge of the patient’s diagnosis. 
For the Benton and Graham-Kendall tests, the 
standardized objective scoring procedures which 
yield difference scores corrected for age and intel- 
ligence were used. Two scores were obtained for 
the Benton, a “number correct” and a “number of 
errors” score. To score the Bender, Hutt and 
Briskin’s (1960) system was used. Subjects were 
classified as “organic” in terms of Bender perform- 
ance if they made five or more of the errors of 


rotation, overlapping difficulty, closure difficulty, 
cohesion, perseveration, retrogression, angulation 
difficulty, fragmentation, collision, simplification, 


impotence, and motor incoordination, all of which 
have been described by Hutt and Briskin. This 
scoring procedure was selected over other methods 
because of its relative simplicity (with practice, 
about 3 minutes per test protocol) and because of 
the lack of previous published research on the effec- 
tiveness of this technique. 

After the test battery was scored, the psychiatric 
staffing diagnosis was obtained. This was used to 
assign each subject to one of five diagnostic cate- 
gories: Chronic Brain Syndrome (CBS), Acute 
Brain Syndrome (ABS), Psychosis (Psy), Person- 
ality Disorders (PD), and Chronic Alcoholism 
(CA). The diagnostic groupings and their respective 
frequencies are presented in Table 1. Technically, 
CA should have been included under the PD 
category, but it was felt that in these patients there 
might be subclinical organic involvement sufficient to 
confound the results if they were included in the 
PD category. Subjects with the diagnosis of Psy- 
choneurosis (N =4), Mental Deficiency (N=4), 
and Undiagnosed (W = 2) were excluded from anal- 
yses involving diagnosis because of the small number 


of subjects in each category. 


TABLE 1 
———— DIAGNOSTIC CLASSIFICATION OF ORGANIC AND NONORGANIC PATIENTS 
m Organics N Nonorganics N 
Acute brain disorders Psychotic disorder 
associated with: , y 
Affective reaction 7 
Intoxication 14 k ; r 
Schizophrenic reaction 26 
Chronic brain disorders hes 
associated with: Personality disorders nlf 
Circulatory disturbance 4 Chronic alcoholism 26 
Convulsive disorder 4 Mental deficiency 4 
Senile brain disease 3 Psychoneurosis 4 
Alcohol intoxication 5 Undiagnosed 2 
Disturbance of metabolism 1 
Unknown cause 3 
> Total 34 86 
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7 
TABLE 2 
MEAN TEST SCORES FOR THE DIFFERENT DIAGNOSTIC CATEGORIES s i one 
nc Benton Benton Graham- Sy 4 
Di Ay Bender (correct) (error) Kendall verbal IQ = 
ragno: - a Gi su 2 E 
© n 89.5 
o “BS 19 6.0 47 11.9 aa 
aus 14 5.3 3.4 6.4 oe 
Pay 33 3.2 2.7 5.2 02.5 
i 17 2.1 S 2.0 af 
CA 26 33 2.2 3.9 102.6 
2 2 : 
7 5 92.6 
All groups 110 3.0 27 5.6 39 


pa 


From the Annual Report of Malcolm Bliss, it 
was found that the admission rates for the year 
1960-61 were CBS, 15%; ABS, 15%; Psy, 35%; 
and PD, 17%. No base rate was given for CA 
as a group, since these were included under PD. 
By testing all patients entering the hospital during 
the experimental period, an attempt was made to 
match the experimental base rates with the actual 
base rates of the hospital. The success of this 
attempt can be judged by the observed base rates 
of the experimental sample: CBS, 17%; ABS, 12%; 
Psy, 28%; PD, 14%; and CA, 22%. Analysis of 
the difference between the actual and the observed 
base rates (excluding CA) showed that 
did not differ significantly (x? = 2.80; df=3; 
p> 50); therefore, for the Purpose of this in- 
vestigation, it could be said that the experimental 


base rates approximated those of the annual admis- 
sion base rates, 


the two 


RESULTS 


Table 2 presents the mean scores of the 
five diagnostic groups for each of the four 
test measures. On each test the two organic 
groups performed more poorly (i.e. 
higher scores) than the nonorganic 
This table also reveals that the separate 
classification of CA from PD was warranted 
since, on every test, the mean score of the CA 


, earned 
groups. 


TABLE 3 


CORRELATION COEFFICIENTS OF AGE, EDUCATION, AND 
IQ Witt Four DIFFERENT TEST 
MEASURES 


Benton Benton Graham- 
Bender (correct) (error) Kendall 


40 


Age 26 40 33 
Education — 41 ~~ 02 —.19 A 
1Q —33 A2 07 —.30 


Note. —For N = 120, an r of 18 is significant at the 


205 level; 
an r of 23 is significant at the O1 level. 


group was higher than that of the PD gon 
Combining these two groups would ‘the 
concealed the true differences between n 
two. Also included in Table 2 is the a 
verbal IQ estimated from the bins 
subtest scores for each diagnostic group. re 
mean estimated verbal IQ for the total samp 
was 92.6, nat 
The first step in analyzing the results Wa 
to consider the possible effects of sex Pra 
race upon test performance. Relations te 
between sex and test scores were found 2) 
negligible (r,»;, ranged from .02 to - a 
but those between race and Bender m 
Graham-Kendall scores were significant ee 
the .05 level (rp = .19 and .21. ae 
tively). Further inspection of the data r š 
gested that these significant relations P 
might be due to the influence of a ne 
factor, such as age or educational difference” 
between the races. When the effects of ie 
were partialled out, the test score-race 77). 
tionship became nonsignificant (71.28 ~~ ie 
As these analyses had shown that sex and i 
were unrelated to test scores, these data Y 
combined for all other analyses. tween 
Table 3 presents the correlations be x 
age, education, and IQ and test scores: | 
amination of these data reveals that es are 
the relationships between these variab ‘rising 
significant. This was particularly up the 
in the case of the Graham-Kendall am tests 
Benton, as scores derived from these 
are corrected for age and IQ. fect test 
Since age, education, and IQ af «ariance 
scores, adjustment by analysis of are 8 
was necessary. The three predictor Hiei s 


ce 
re 


pijeti 
talinn S 
problem presents many statistical Sabi es wat 
so reduction to two predictor vari 
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ae Nae ae IQ was the variable elimi- 
were. hi he grounds that education and IQ 
ie highly correlated (r= .67 for 118 df; 
highly oe ae that education was more 
w st ates in every case with test 
results of jA was IQ. Table 4 shows the 
should be he analysis of covariance. (It 
fiale ont np that adjustments were 
scores ot or age on the Benton correct 
wificant| Q and education were not sig- 
spection” Bags with these scores.) In- 
? ratios w this table shows that all four 
level. ere significant at less than the .001 
hie, indicates the differential discrim- 
Por scat ca of the various test measures. 
seems to p iagnosis of all cases, the Bender 
% ft the best single measure (i-., 
least fide ct). The Benton error score was 
number igo with only 66° of the total 
only the n cases correctly classified. When 
rectly dia umber of nonorganic patients cor- 
tion of eta was considered, the combina- 
Correctly wae Benton scores classified 98‘% 
diagnosed aei only those cases correctly 
as organic were considered, the 


Bend 
er x z 
“hiv ae the best measure with a 67° 
e. The most unsatisfactory result was 
Axan TABLE 4 
ALYSI > Co 
S or Covarian Test SCORES, AGE, AND 
— EDUCATION 
e “oe 
Eos Source dj MS F 
a a e ear oe at 
Ct Adjusted means 4 51.85 418* 
Within-groups 103 1.24 
B Total 107 
entor ” 
Correc) Adjusted means 4 34.50 13.3* 
Within-groups 100 2.60 
B Total 104 
Ento . 
(error) Adjusted means 4 266.90 79.9" 
Within-groups 103 3.34 
G Total 107 
taham- J 
Kendali Adjusted means 4 315.60 11.7* 
Within-groups 103 26.90 
in aa Total 107 
S00, ee re 


477 


TABLE 5 


DIAGNOSTIC ACCURACY (PERCENT CORRECTLY Diac- 
NOSED) OF TESTS FOR ORGANICITY 


Non- 

organic Organic 

diag- diag- 

nosis nosis Total 
Bender 92 67 82 
Benton (correct) 85 62 76 
Benton (error) 95 47 66 
Benton (combined) 98 62 81 
Graham-Kendall 88 03 78 


the number of organics correctly diagnosed by 
the Benton error score; only 4790 were cor- 
rectly classified. It should be emphasized that 
in every case the percentage of nonorganic 
patients correctly classified was much larger 
than the percentage of organic patients cor- 


rectly classified. 


Discussion 

The results of this study indicate clearly 
that performance on three different tests for 
organicity does predict clinical diagnosis of 
organicity at a very high level of significance. 
Although the present data show better dif- 
ferentiation of organic from other clinical 
groups than do the data of L’Abate, Boelling, 
Hutton, and Mathews (1962), they do not 
achieve the dramatic discrimination of organics 
from nonorganics reported by Garrett, Price, 
and Deabler (1957) who had reported an ex- 
tremely low misclassification rate. These dif- 
ferences in diagnostic accuracy seem to be a 
reflection of the differentiation attempted. 
In the study by L’Abate, in which organics 
were compared with schizophrenics, the ex- 
perimental task was more difficult than that 
usually encountered in a hospital setting: 
namely, diagnosis of organicity in a heteroge- 
neous psychiatric group. Garrett chose a 
control group of “normals” (i.e., hospital 
physicians and administrative personnel) to 
compare with the organics, a problem which is 
much simpler than the usual task of the 
clinician. 
For proper evaluation of the predictive 
powers of tests, the organic base rate of a 
hospital population must also be considered. 
In the present study, the base rate was 30%; 
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therefore, a nonorganic diagnosis would be 
correct in 70% of the cases. Using the 
Bender, however, one would expect to be 
correct 82% of the time, thus resulting in a 
12% increase in “hits” over what would be 
obtained by using the base rate strategy. The 
Benton error score, on the other hand, cor- 
rectly diagnoses only 66% of the patients, 
demonstrating that use of a “good’ diag- 
nostic instrument may lead to a decrease in 
diagnostic accuracy! 

The most vulnerable feature of the present 
investigation may be the use of the psy- 
chiatric staffing diagnosis as the criterion for 
organicity, although the reliability of this 
criterion may have been improved by com- 
bining psychiatric diagnoses into a few broad 
classifications. Unfortunately, alternative ap- 
proaches to the criterion problem (e.g., EEG) 
have proved to be as unreliable as psychiatric 
diagnoses. 

The present study might also be questioned 
for not following actual clinical practice in 
that objective scoring criteria were used for 
evaluating the tests of organicity instead of 
the usual “global” judgments. But Goldberg 

(1959) has shown that the global judgment 
method of clinicians is not very accurate. The 
present study has indicated that, with the 
Hutt and Briskin (1960) system, a high level 
of diagnostic accuracy can be obtained with 
the Bender. 

As a final consideration, a practical ques- 
tion may be raised, namely: what are the 
consequences of “overdiagnosing” or “under- 
diagnosing” organicity? A review of the 
present results shows that the four test 
measures diagnosed the following number of 
cases as organic: Graham-Kendall, 32 (59 if 
borderline cases are included); Benton cor- 
rect, 39; Bender, 45; and Benton error, 68. 
The comparable figure for the Psychiatric 
diagnosis of organicity was 34, 

If we assume that the true number of 
organics lies somewhere between these two 

extremes (i.e., 32 and 68), the test measures 
usually overdiagnosed organicity, whereas the 
psychiatrist underdiagnosed organicity, The 
consequences of these “errors” are as follows: 
the psychological measure will result in a 
higher percentage of valid Positives and a 
lower percentage of false negatives than the 
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psychiatrist’s; however, the psychiatrists will 
obtain fewer false positives than the p 
chological tests. In other words, the tests ka 
predict nonorganicity more precisely man 
the psychiatrists who, in turn, will predic 
organicity more precisely than the tests. 
Perhaps the psychiatrist is the “instrument 
psychologists have been searching for tO 
predict accurately the positive occurrence. 
The combination of psychiatrist to predict ol 
ganicity and psychologist to predict a 
organicity (both given equal weight) may i 
the most discriminating approach to the 
problem. 
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UNCONSCIOUS SELF-EVALUATION AS A FUNCTION 


OF AVAILABILITY OF CUES* 


RALPH ROTHSTEIN 
Institute jor Psychosomatic and Psychiatric Research and Training 
Michael Reese Hospital, Chicago 
anD SEYMOUR EPSTEIN 


University of Massachusetts 


20 college coeds rated how much they liked each of 4 people judging from 
the sound of their tape recorded voices Presented at 6 levels of distortion. 
Following this, they rated the voices for how similar they were to their 
own. Unknown to S, 1 of the voices was her own. At a later date, each S 
rated her voice after it was identified as such. It was found that Ss rated 
their own unrecognized voice as most likable when it was neither very 


distorted nor undistorted. Similarly they 
torted voice when the recognized similar 


reacted most favorably to their undis- 


ity was moderate in degree, Opposite 


to findings on unrecognized voice, Ss reacted unfavorably to their recognized 


voice, 


In comparison to the rather voluminous re- 
search dealing with the phenomenal self, 
relatively little experimental work has been 
done on nonconscious aspects of self-judg- 
ment. Wylie (1961) points out that those 
studies dealing with the unconscious or non- 
phenomenal self have typically been vague 
and incomplete and have not yielded a clear 
pattern of results. This she attributes, in 
part, to methodological problems and the 
difficulties inherent in establishing construct 
validity when working with unconscious 
phenomena. 

One approach to the study of unconscious 
self-attitudes which Seems promising surpris- 
ingly has been employed by only three in- 
vestigators, (Epstein, 1955; Huntley, 1940; 
Wolff, 1933; 1943). Introduced by Wolff in 
1933, the technique utilizes judgments by 
people of others’ and their own expressive 
movements. In all three studies it was found 
that Ss rated their own unrecognized self- 
productions differently from how they rated 
others. Wolff (1943) and Huntley (1940) 
reported that self-ratings in the absence of 
recognition tended to be extreme in both di- 
rections. They hypothesized that as disguise 
of an expressive movement increases, self- 
rating becomes increasingly extreme, The hy- 


1 The study was done while Rothstein was at the 
University of Massachusetts. 


pothesis follows from the assumption thar w 
conscious determinants play an AET TE] 
role as disguise increases. Epstein (1 er- 
found no evidence for a group of ig 
evaluators, and attributes Wolff's 
Huntley’s findings to a statistical alacant 
found the typical reaction to an unre S 
self-production to be one of ae the 
and that increasing the disguise decrease 4 
amount of overestimation. The studies ex- 
self-attitudes referred to utilized m o- 
pressive forms, e.g., voice, handwriting, P 
graphs, etc. presented at a single ley” 
disguise. It would thus not be aeterna ed 
what extent the judgments were influ form 
by degree of disguise apart from ae in 
in question. A more desirable APP" i jude 
studying the relationship between sell) jne 


how judgments change as self-related + ole 


expressive form. This is done in a 
study by presenting voice samples ab stiga" 
levels of distortion, In addition to ane unc 
ing “unconscious” self-judgments a makes 
tion of disguise, the present study ks was 
comparisons with conscious ratings- sags 1% 
selected because it offers important itself t 
forming judgments of others, lends readily 
experimental manipulation, and is = 
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Unconscious SeLr-EvALUATION 


recogni 
e Rizen when presented as a tape record- 
& Of someone else. 


: METHOD 
Subjects 


Tw | 

peat, female college students enrolled in the 

Massach Ory psychology course at the University of 
achusetts served as Ss. 


in 


Procedure 


In 
alure aT Dt to arouse suspicion about the 
Purpose of p study, Ss were informed that the 
Telationshi ba investigation was to determine the 
ality, The een quality of voice and person- 
ventory Was ae rita Multiphasic Personality In- 
convincing, Es ministered to make the instructions 
aving 5 = anplas of voice were obtained by 
Paragraphs i two simple and afiectively neutral 
Ings Could ae a microphone so that tape record- 
Pleteq e made.? When recordings were com- 
Serve as Was requested to return in 2 weeks to 
tecordings warase of the voices of others. Voice 
0 be described Presented at all levels of distortion 
Out knowin bed below, and S rated her own (with- 
favorablens it) and the voices of three others for 

Proxima ess” and then for “similarity recognition.” 
ey. tely 1 month later, S listened to and 


aluated 
er own, her own voice after being told it was 


Proc 
edur PP š š 
ure for Obtaining Stimulus Materials 


ai 
tortion y were obtained for three levels of dis- 
(Voice Fop ice Presented in the natural manner 
Which pia and for three levels of distortion 
m kward) me was played backwards (Voice 
© followin © obtain the Voice Forward levels, 
Voice ou Procedures were used: 
a oorde A (VF). The playback of Ss tape 
entron ce was made as clear as possible, using 
t Voice an 9T3C tape recorder. 
abe Was eara Moderately Distorted (VFD:). The 
f Other i €-recorded through a microphone onto 
delity j pe. As a result of the double recording, 
T ies T reduced, 
pa Same orward Considerably Distorted (VFD:)- 
i Ond time oeesure as in VFD, was repeated a 
fe Volume However, during the second re-record- 
tency m, Was maximally increased and range of 
fola Aly restricted. 
n the Ves procedures were employed to 
Dla Vice Bache: Backward levels: , 
repped in ward (VB). Voice recordings were 
sth, ductio z background direction, making the 
an, n unintelligible and sounding like a 


~E language, 
2 


Th 

a e 

verted (paragraphs read by the subjects were 

SE n ae Memory for Stories (The Wet Fall), 
m L (Terman & Merrill, 1937). 
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Voice Backward Moderately Distorted (VBD:). 
Beginning with VB, the same procedure was fol- 
lowed as for VFD. 

Voice Backward Considerably Distorted (VBD:). 
Beginning with VBD:, the same procedure was fol- 
lowed as for VFD:z. 

Preliminary to the experiment proper, voice levels 
were presented to a group of 10 graduate students 
who rated them on degree of distortion. There 
was perfect agreement for both Voice Forward and 
Voice Backward. 


Measurement of Unconscious Self-Judgment 


Two weeks after recordings were made, S was 
presented with four voices to judge, one of 
which, unknown to her, was her own. Following the 
procedure used by Huntley (1940), measures were 
obtained by the use of a graphic rating scale. The 
scale was anchored at 6 points with labels varying 
from “dislike extremely” to “like extremely.” In- 
structions were given to use the entire sensitivity 
of the scale by drawing a line across an appropriate 
point on the scale, rather than simply checking an 
internal. All four voices were presented so that S 
could develop a frame of reference before any 
ratings were made. The voices were then presented 
for S to rate immediately following each. A reminder 
was given to rate the person, and not the voice, 
per se. Since each S rated and was rated by four 
other Ss, the data could be arranged in a series 
of 4X4 matrices in which the rows represented 
raters and the columns voices. In one case, the self- 
rating was the first presentation, in another the 
second, in another the third, and in another the 
fourth, thereby controlling for position effects. VB 
preceded VF, and order of presentation was always 
from greatest to least distortion. The S was not 
informed that actually the same four voices were 
presented to her for ratings, under different con- 
ditions of distortion. 

It was decided at the beginning of the experi- 
ment that if S spontaneously recognized her voice 
while rating it for favorableness, her data would 
be discarded. That no such spontancous recognition 
occurred is probably a result of the strong set 
produced by informing Ss that they would rate 
the voices of others, as well as a result of the 


small work sample used. 


Measurement of Similarity-Recognition 


To help insure that only unconscious self-ratings 
were included, as well as to obtain information on 
the relationship of self-evaluation and level of 
recognition, ratings for “similarity recognition,” as 
described by Epstein (1955), were obtained. After 
completing ratings for favorableness, S was asked 
to rate how similar each voice was to her con- 
ception of her own. A rating scale anchored at six 
points was used for this purpose, the range of 
which varied from “opposite” to “identical.” It 
had been decided that if a rating of identical oc- 


CHANCE \EXPECTANCY 


RELATIVE FAVORABLENESS 
o 
> 


VBO, 


V0, 
<— DISTORTION 


Fic. 1. Relative (self minus other) mean favorable- 
ness scores for all levels of distortion. 


VB VFD, VFO VF 


curred, S would be questioned to establish whether 
she had recognized the voice when rating it for 
favorableness, Only one rating of identical occurred 
and this at the VFD, level. When questioned, S 
said that she had not recognized it previously, but 
did when directed to look for similarity to her own. 


Procedure for Obtaining Conscious Self- 
Judgments 


To determine whether there were any differences 
in Ss conscious and unconscious self-evaluations, 
approximately 1 month after unconscious self-ratings 
were made, S was told which voice was her own, 
and asked to rate it when it was presented in an 
undistorted manner. The Purpose of the month delay 
was to allow for completion of other aspects of the 


at some time prior t 
instructed to indicate 
then as well as what i 
five choices varying 
“very pleased” 


V judgments. 
Finally, S was asked to write a brief 


description of 


ce sample following identifi- 
cation, 


REsutts 
Favorableness of Unconscious Self-Ratings 


Graphic ratings were converted into scores 
by using a ruler containing 25 divisions to 
measure the distance along the scale to Ss 
judgment. The use of so refined a scale in- 
sured that limitations in the score would be 
restricted to the judgments themselves, and 
not to the selection of too coarse a unit of 
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measurement. While judgments are a 
pected to approach the precision of the . se) 
there is nothing to lose by using an w tee 
sively refined scale, so long as 5 eign 
required to make equally fine oe 
To obtain scores of unconscious sel ee, 
ment, which involves a relative omeni d 
between self and other ratings for eac = 
four voices (4 X 4 array of ratings), 
means of the rows and columns as well = an 
array were calculated. The Tommila wn ; 
this determination was: Self Rating a 
—M,—M,. Subtracting row and at $ 
means control for rating bias and quality 
voice, respectively. is- 
In Figure 1, it can be seen that me pee 
torted presentation of Voice Forwat to selli 
tends to elicit unfavorable reactions ith the 
that the ratings become favorable oe the 
first level of distortion, and neutral K ki 
second. The only significant depe "ar tesis, 
a mean of zero, as determined by nea 
occurs at the VFD, level, where eee a 
reached at the .05 level. There is con voice 
random variability in the ratings vproaches 
Backwards, no level of which et d 
significance. Plotting ratings for oti that 
others separately, as in Figure 2, P eases! 
as distortion increases favorableness € ‘Al is 
and this holds both for VF and VB. nfavor 
combined ratings for VB are more Ba jn 
able than for VF. Apparently, desP ing 
structions to rate the person and ! 


NEUTRAL 
RATING 


FAVORABLENESS —> 


VBD, 


VBD, 


<— DISTORTION a 


; in raw ^a 
Fic. 2. Mean favorableness ratings f distorti” 
self and others for all levels © 


r 
res PET 


Unconscious SELF-EVALUATION 


RES there is a tendency to confound the 
the aoe to judge the unfamiliar less favor- 
thay ne the familiar. It is of some interest 
ova e ratings of the least distorted level 
ee i more favorable than the most 
is ren et of VF, although the former 
Sail pails than the latter. Apparently, 
with; € frames of reference are established 
n each of the forms of presentation. 


Ext, 
remes of Unconscious Self-Rating 


mia vestigate extremeness of self-ratings 
an eles direction, the variance of the 
erent eS scores was compared at the dif- 
ake on of distortion. The higher the 
both fan the more the self-ratings differ in 
f to and unfavorableness from 

iNcteasin Ss of others. It is found that with 
“Uniform 8 distortion self-ratings became more 
tween 1 le., less extreme. Differences be- 
however, vols (VFD2—VF; VBDz:— VB) 
and 97 fall short of significance, p = .09 
Of self.’ respectively. When the variability 
Separate] and other-ratings were determined 
leve] ve One finds that at the undistorted 
both j °) ratings of self are more extreme 
tha; z favorableness and unfavorableness 
Other aes of others (Self o= 36.60; 
= 17.89). But again this difference 


Just f 
alls short of significance, p= 07. 


imil j 
arity. one 
ty-Recognition 


A 
Pre Oh 3 it can be seen that for VB 
the Snia Ss are better able to perceive 
to alp arity of their own and other voices 
tease, actual voice when distortion is de- 
sista and that self-similarity ratings are 
ty one though slightly higher than rat- 
While Others, For VF on the other hand 
OF n ere is a sharp increase in recognition 
g own voice, the perceived similarity 
an initign «ce to one’s own, levels off after 
3 ifica increase, Analysis by £ test reveals a 
Ps gs a difference between self- and other- 
vaq OF the VFD, (.05 level) and VF (.01 
i Presentations. 

Verg ` noteworthy that despite a general in- 
“itil Ationship between distortion and 
Similari Y recognition, VFD; obtains a lower 

Y recognition score than VB, which 
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| gae SELF-RATINGS —— 
7 OTHER-RATINGS----- 
ot a a ee a T E 

V&D, VBD, VB VFD, VFD VF 


SIMILARITY RECOGNITION —> 
5 


<— DISTORTION 


Fıc. 3. Mean similarity recognition ratings in raw 
scores for self and others for all levels of distortion. 


suggests, as with favorableness ratings, that a 
separate frame of reference is established 
within the two modes of presentation. 

In order to determine the relationship be- 
tween unconscious self-judgment and degree 
of perceived similarity of voice to one’s own, 
Ss were divided into three levels of similarity 
recognition for each voice level. Following 
Epstein (1955), a low similarity recognition 
group was composed of Ss whose self-ratings 
indicated “no” or “very little” similarity and 
who gave another voice a rating of higher 
similarity than their own, a high similarity 
recognition group was composed of Ss whose 
self-ratings indicated “much” similarity and 
who gave no other voice as high a similarity 
rating as their own, and a moderate similarity 
recognition group was composed of Ss not 
falling in either of the other groups. A simple 
randomized analysis of variance for favor- 
ableness ratings as a function of level of 
perceived similarity produced significance at 
the .05 level for the undistorted voice. At 
this level Ss rated their unrecognized voice 
most favorably when recognition was only 
moderate. In order of increasing similarity 
recognition, the mean scores for favorableness 
are as follows: — 2.50(N = 3), .72(N = 13), 
— 1.85(N = 4). For the VFD; level of pres- 
entation, no significant relationship is ob- 
tained; at all other levels, there were too few 
cases of high similarity recognition to allow 
for reliable comparisons. 
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Conscious Selj-ratings 


Eighteen of the original 20 Ss took part in 
this phase of the experiment. of the written 
descriptions of reaction to identified voice, 4 
were favorable, 10 unfavorable, and 4 neutral. 
Thus there is a tendency for Ss to react 
negatively to their recognized voices while 
reacting favorably to their voices when un- 
aware that it was their own. While it is not 
possible to make direct comparisons between 
mean ratings on the conscious and uncon- 
scious self-ratings, as the wording on the 
scales is different, the data can be dichoto- 
mized into favorable and unfavorable ratings 
on both scales. Of nine Ss who performed 
differently on the two scales, eight made 
more favorable judgments of unidentified 
than identified voice, which is significant at 
the .05 level. When the very first reaction to 
hearing one’s recorded voice is taken into 
account, there are eight Ss who differ on judg- 
ments of recognized and unrecognized judg- 
ments, and in all eight cases judgments of 
identified voice are less favorable than of 


unidentified voice, which is significant at the 
.01 level. 


Discussion 


The results fail to support the supposition 
of Wolff (1943) and Huntley (1940) that 
with greater disguise of a self-production, 
defenses are increasingly circumvented, and 
judgments of self become more extreme. 
Findings, while falling short of statistical 
significance, Suggest the opposite trend, as 
increasing disguise results in less extreme 
reactions to self, both of a favorable and 
unfavorable nature. It was found however 
that self-judgments were significantly more 
favorable than judgments by and of others 
at the level of moderate disguise. Further- 
more, for the undisguised presentation of 
voice, self-judgments differed from other- 
judgments in being more positive only when 
recognition of similarity to one’s own voice 
was moderate. The curvilinear relationship 
between disguise and personality revealing- 
ness is not surprizing, for while disguise may 
reduce defensiveness, it provides fewer cues 


which can stimulate reactions at any level, 
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i.e., with sufficient disguise, the stimulus be- 
comes meaningless. 

In ths respect, the findings in the present 
study are consistent with studies of pro- 
jective tests (Kenny, 1954; Kenny & Bijou, 
1953; Murstein, 1958) which have reported 
that more revealing material is elicited by 
TAT cards of moderate than of low or high 
ambiguity. The completely negative findings 
on VB suggest that this form provides too 
few self-related cues to be a useful stimulus. 
Significant results using this form in another 
study (Epstein, 1955) may relate to differ 
ences in the population of the two studies. : 

The findings on conscious self{-judgment are 
consistent with the frequent observation that 
people are disappointed when hearing me 
recorded voice. Yet the question posed T 
why one should react favorably to one’s ui- 
recognized voice but unfavorably to One®? 
recognized voice. One plausible explanation 
is that Ss are less guarded or inhibited apon 
expressing narcissistic, i.e., favorable feline 
about self in response to self-related te 
when they are unaware that the productio 
is actually their own. But at the undistorta 
level, recognition of similarity to ones pa 
voice increases to a point where it is a 
sible that Ss may have come to suspect ee 
inclusion. At this level, ratings become vi 
favorable. And when conscious recogniti, 
did occur, reactions to self were strikin ire 
unfavorable, Ss revealing much displea” i 
with their voices. The negative self-react 
associated with conscious recognition is mod 
preted as reflecting culturally acquired ™ ic 
esty about appearing vain and narciss 
In future work it would be desirable t° oth 
vestigate in greater detail correlates of cell 
favorable and unfavorable reactions ‘° pes? 
in order to clarify the meaning ° 
response tendencies. 
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DOGMATISM AND SENSORY ALIENATION: 
AN EMPIRICAL INVESTIGATION 


MARTIN F. KAPLAN 
University of lowa? 
axb ERWIN SINGER 
City College of the City University of New York, School oj Education 


This study was designed to investigate certain aspects of the concept of 
“alienation” as employed by Fromm. Using Fromm’s framework and sug- 
gestions offered by Schachtel concerning inherent autocentrism and allocentrism 
in various sensory modalities, it was hypothesized that: (a) dogmatism as | 
measured by Rokeach's scale and adequacy on sensory discrimination tasks 
would vary negatively with each other and (b) differences between dogmatic 
and nondogmatic individuals would be more pronounced in their performances 
utilizing predominantly ‘autocentric modes than on tasks involving essentially 
allocentric modalities. Sensory discrimination was assessed by requiring Ss 
to match and/or differentiate between stimuli in 5 areas, 13 dogmatic and 13 
nondogmatic Ss (controlling other relevant variables) served as populations. 
Data obtained supported the hypotheses at least at the 5% level. 


THEORETICAL BACKGROUND 


Since the days of Breuer and Freud (1957) 
the notion that psychopathology is character- 
ized by the patient’s strenuous attempts to 
remove from consciousness certain stimuli, be 
they internal or external, has been prominent 
in most theories which claim some affiliation 
to what is loosely referred to as “depth psy- 
chology.” From “repression” and the other 
“mechanisms of defense” through “dissocia- 
tion” to the concept of “estrangement” the 
common denominator is the assumption that 
humans block from awareness what they 
deem noxious. What has changed in the 
evolution of psychological theory are the con- 
structs concerning what is blocked, how the 
blocking takes place, and why humans 
engage in such a process of reduction of 
consciousness. Through the years the em- 
phasis has shifted, at least in certain quarters, 
from postulating man’s attempt to defend 
himself against libidinal incestuous and/or 
aggressive inner stimuli to the hypothesis that 
there exists a more or less prominent tendency 
to remove from awareness the fact of one’s 


1This report is based in part upon a Master's 
thesis by the senior author at the Graduate School 
of Education of the City College of the City Uni- 
versity of New York. The work was aided by a 
grant from the Max Gewirtz Fund. This support 
is gratefully acknowledged. 


<i} ility 
very being and the sense of responsibil 
implicit in the awareness of being or eX" 
The empirical research to be reporte 
addresses itself to some aspects of this 
by investigating certain operations P the 
ably underlying the latter position, 1°» 4 
position which suggests that the - 
problem to be studied is the degree to “ Fi 
some people attempt to minimize the aten 
awareness of themselves and to what cia 
this minimization is intricately 255°° 
with certain character trends. i 
As is well known, in discussing p ain 
pathology Fromm (1941, 1955) takes ns tio? 
of departure from the concept of alio nd 
as developed by Marx in his Econo" (4). 


cho” 


Philosophic Manuscripts (Fromm, prom”, 
Alienated or estranged man 1S to cam A 
essentially Kafka’s Mr. K. oF tis 


Stranger. The process of estrangemen trigh" 
to proceed via a diminution O" jf as 4 
elimination of awareness of the “ding ; 
active and separate organism, ie by PX 
sense of being which is characterize, ity w 


. . bi it 

sive drivenness and lack of respons or a 
action, be that intellectual, emotion. sal io 
physical activity. Alienated man * d 
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DocMATISM AND SENSORY ALIENATION 


cae E in his discussion of “ob- 
Perceptual n E what he calls the “two basic 
autocentric ps Set, the allocentric and the 
— ia es, He suggests that the vari- 
capable rf modalities are more or less 
Epto 2 unctioning in one or both modes. 
senseg p = that the olfactory and gustatory 
modes essentially in the autocentric 
jectifying age relatively incapable of ob- 
actions A ] e world but rather arouse Te- 
some nan feeling tones which bring about 
tribute ae vis-a-vis the stimulus yet con- 
P. 88)” clue at all about object structure 
modalities os the other hand, sensory 
Much mole en D hearing and vision are 
reactivity sel ao of objectifying. Inner 
Within the y but does not necessarily occur 
Modalities perceiver when these latter sensory 
rectly idini employed. A person may cor- 
Breen With ly a given visual stimulus as 

terms ae assuming a personal position 

the sti a pleasure-displeasure continuum 
i imulation. Schachtel points out that 


IS differens; 
.> differential functioning of sensory mod- 


alities tis: 
Natives buy ot a matter of absolute alter- 
ce of aa rather of the relative predomin- 
th Jectification [p. 98].” 
soni oe of the predominantly 
self-aware modes enhances man’s immediate 
aenea through attention to personal 
iectificatio, Addition to mere attempts in 
usive foc ony and conversely, if rather ex- 
tric mod using upon predominantly allocen- 
"Nee th alities minimizes subjective experi- 
Nature Tough emphasis upon grasping the 
Subjecticn the object without attending to 
that de reactions, then it would follow 
th ae ~alienated man would de-emphasize 
in aa ie., autocentric modalities, and 
n reness ¢ reduce subjective experience and 
tic mode of self. The predominantly allocen- 
$ e-em Sh however, should not undergo such 
Whil Phasis in the self-alienated person. 
(196,,° Employing different terms Fromm’s 
fe iscussion of idolatry is consistent 
i i ormulation. He states at one point: 
à atic pe tentally, also the psychology of the 
tden eg ae S empty; dead, depressed, but in 
Aa lin ace for the state of depression 
write, a eadness, he chooses an idol, be it the 
pakes dae a an idea, the church, or God. He 
OTE, Ria idol into the absolute, and submits 
n absolute way [pp. 44-45]. 
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If we pursue the implications of Fromm’s 
thinking further it would become reasonable 
to hypothesize that the fanatic in his “empty, 
dead, depressed” state would emphasize the 
allocentric modalities and de-emphasize the 
autocentric modes or to restate the proposi- 
tion in more empirically verifiable terms, 
that increasing fanaticism or dogmatism 
would be associated with decreasing auto- 
centric perception and experience. If the 
fanatic is self-estranged then his self-estrange- 
ment is likely to proceed primarily via 
alienation from those sensory modalities 
which bring in their wake self-awareness or 
perhaps require some basic readiness for 
self-awaretiess, the awareness of self as a 
separate, living, and immediately reacting 
unit. 
Before discussing the specific experimental 
procedures employed a theoretical point needs 
clarification. While fanaticism and dogmatism 
are not necessarily synonymous, they seem 
closely related, for the fanatic as well as the 
dogmatic are essentially individuals who are 
in the jargon of psychological description 
“rigid,” incapable of tolerating ambiguity and 
in Rokeach’s (1960) terms “closed-minded.” * 
But closed-mindedness in its conceptual 
meaning is not restricted to the “fanatic” or 
the “dogmatic” but permeates in one form 
or another all manifestations of psychopath- 
ology. For example, when one says that a 
person is relatively devoid of insight one 
proposes, in effect, that the individual in 
question is closed to or unaware of his feel- 
ings, tendencies, impulses, or reactions, is 
in the broadest sense of the word self- 
alienated so we return to our original theoret- 
ical point of departure, i.e the proposition 
that pathology is essentially characterized by 
a type of fanaticism or dogmatism which 
reflects itself in varying types of closed- 
akes a subtle distinction 


2 Even though Rokeach m 
ism, employing the 


between rigidity and dogmati 
er in relation to specific behavioral manifesta- 


the latter in connection with total belief 
systems or general orientations, this distinction is 
for our purposes unnecessary. We shall employ the 
terms dogmatism and rigidity interchangeably, both 
denoting a schematized and fixed outlook and 
orientation towards self and the world. Con- 
sequently, the term rigidity as employed here 
comes close to what Rokeach refers to as dogma- 


tism or closed-mindedness. 


form 
tions, 
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mindedness and rigidity, these rigidities ex- 
pressing themselves in different forms, but 
all of these forms detracting from self- 
awareness. 


HYPOTHESES 


In order to investigate this proposition 
empirically we hypothesized that the degree 
to which the tendency towards dogmatism is 
operant within a given individual to that ex- 
tent self-awareness should appear reduced, 
this reduced self-awareness operationally re- 
flecting itself in decreased emphasis upon 
the autocentric sensory modalities and rela- 
tively elevated emphasis upon the allocentric 
modes of perception. Schachtel’s discussion 
suggests a rather specific hierarchy or con- 
tinuum along the autocentric-allocentric di- 
mension. In ascending order of allocentric 
potential he arranges the sensory modalities 
in approximately the following order: 
olfactory, gustatory, tactile, auditory, and 
visual. Thus he suggests that the olfactory 
modality is hardly capable of bringing about 
objectification and is closely associated with 
the autocentric mode of perception, the visual 
modality is most capable of objectifying and 
therefore potentially most closely associated 
with the allocentric mode of perception 
(1941, 1955). 

On the basis of these considerations the 
following specific hypotheses were developed: 
la. Highly dogmatic individuals should ex- 
hibit significantly lowered sensory acuity 
when compared with individuals who are 
relatively free of dogmatism; 1b, As a logical 
corollary of the above hypothesis there should 
exist a negative correlation between dogma- 
tism and sensory acuity; 2. Finally, this 
differentiation in sensory acuity should be 
most prominent in the autocentric sensory 
modalities and less pronounced in the allo- 
centric modes. Put a bit differently, declin- 
ing sensory acuity is expected in all human 
beings on the allocentric-autocentric con- 
tinuum but this decline will be m 
nounced in highly dogmatic than 
tively nondogmatic individuals, 


ore pro- 
in rela- 


METHOD 


For purposes of this investi 


i gation dogmatism was 
operationally defined as the 


Score achieved by a 
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subject on the Rokeach Dogmatism Scale o 
pp. 71-100). Previous researches have establishe T 
usefulness of this device by finding satisfactory oa 
relations between scores achieved on the ie 
personality variables serving as criteria mea best 
By and large it may be considered a anes 
existing objective measure of closed-minde a 
and a host of studies support the reliability he 
construct as well as criterion validity of 
instrument (1960, pp. 101-108, 171-274). ade 
The assessment of the sensory acuity oF tle 
tivity to stimuli proved a more complicated os 
Two procedures were employed to assess agen 
Sensitivity. First five odors from Henning’s dhe 
Prism were selected. These odors were along ch 
dimension of spiciness at known distances from ice 
other. Three spices, one at a time, were RA are 
under the nose of each subject while his eyes us 
closed and following MacDonald’s suggestion (Woo 
worth, 1958) the subject was asked to compare the 
second stimulus presented with the first and ost 
third and to judge which of these resembled pre 
closely the second stimulus. Each set of three ae 
stimuli was presented in systematically rota x 
order so that each spice became at least once s 
comparison stimulus. In the second olfactory iii 
three toilet waters were employed., Each scent m 
presented, one at a time and then one of the tne 
was re-presented, the subject being asked to indie 
with which of the original stimuli he was now ne 
fronted. Systematic rotation of order of Rise 
tion was observed and care was taken to make ca 
scent a comparison stimulus, igi, 
Three sets of salt, sugar, and vinegar ea m 
each set containing a heavy, a medium, anc 565, 
slight concentration of the respective addi 
served as the stimuli for the gustatory task. ehh 
concentration in the “medium” solution, hone er 
was always closer to one of the extremes rat “h 
than exactly in the middle between them. ae 
Set of the three solutions was presented for he 
trials, presentation varying from trial to ls oui 
subject was asked to sip each solution ET 
swallowing and after each sip he was to r 
mouth with warm water. Each trial consiste! and 
the presentation of two of the three stimuli and 
the subject was then given the third solution R vo 
was asked to indicate which of the preceding 
the third solution resembled more closely. | á 
Three sets of swatches of cloth, each 6 S 0 
square, each set containing three swatches for 
similar texture, were employed in tent Th 
tactile acuity. The texture of one set wae "Evon 
of the other smooth, and of the third silky. was 
though the nature of the materials in each pa sub- 
rather similar they were of different weave given 
jects were asked to avert their eyes and ae rub 
one swatch at a time to touch. They were wed t0 
one hand over the material but were not ni uF 
lift or to pinch it, 10 seconds were mion on o! 
each swatch. A trial consisted of the presenta e Te 
all three materials constituting a set an ing asked 
presentation of one of them, the subject bel 


hes 
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to jj P 
Aa n of the three he had received for 
ing Sie -— Each set was presented employ- 
making ae bes rotation procedures three times, 
the test e Ea trials in all. Care was taken that 
Presentati ulus appeared at varying points in the 
Roe ation procedure. 
Föne ee task a record, “The Measure of 
the Bauer poe Performance,” manufactured by 
Several binds ompany was secured. The record has 
requency it each with a different tone of known 
Sets, each i ‘ae bands were grouped into three 
Tequencies $ containing three tones. Again, the 
than to the Senp tones were closer to cach other 
mes, varyin ird tone. Each set was presented three 
c subject be the order of presentation, and after 
Which of th ad listened to two tones he was asked 
closely, e two the third stimulus resembled more 

Fi z 

nally, for the visual task sample paint cards 

tards, rege! were obtained. There were 12 
each trial ifferent tint of green, pink, or yellow. 
Presented two tints of the same color were 
(without ne the subject was then asked to judge 
Which of eing shown the original stimuli again) to 
Since the the two the third tint seemed closer. 
Order of aie company had ranked the tints in 
Was an ame from the primary color, there 
tect or aas basis for judging a response cor- 


n ord 
er . fs . 
tasks to make scores achieved on the various 


C 

Constant parable the number of trials were held 

. | foreach ma to task, allowing one point for 
the ba entification or judgement. In this 

or ach ssible range of scores remained the 

Ema sensory modality. 

tuleets ee nn pool of 40 randomly selected 
sm © had been given the Rokeach Dogma- 


th cale 
hos » those 13 who had scored highest and 


e 

ki 3 who ha 

e all T had scored lowest on the Scale were 

tie tasks, Pr, above-described sensory discrimina- 
ocedures for both comparison popula- 


s, 
tela, ? Person F 
ley, Terns carrying out the testing and similar 
ables were controlled and kept constant. 


Üt RESULTS 

wi 

Nypothen be recalled that we suggested in 
t sis la that highly dogmatic individ- 


S wo ray 
Or date exhibit significantly lowered sens- 
pho on y when compared with individuals 
Avir relatively free of dogmatism. Data 


ne |} : 

sented ; Dearing upon this assertion are pre- 
é In Table 1. 

of data speak for themselves. On five 


e six 
i x measures evaluated the low dog- 


atic py 
dogmai > proved superior to the highly 
Me c individuals to a degree which 


a Rég A 

Mtihan differences observable rather un- 

ask ig q ble to chance. Only on the visual 
he difference observed, although mov- 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF ALL MEASURES 
EMPLOYED For THE Hicu DOGMATIC GROUP AND 
tue Low Docwatic GROUP 


Low High 
dog- dog- 
matic matic Difer- 
Measure N=13 N=13 ence i 
Dogmatism scale M 12792 174.62 
—46.72 10,06*** 
SD 5:99 13.23 
Olfactory task M 5.31 3.70 
1,61 1s" 
SD 98 1.47 
Gustatory task M 6.00 446 
1.54 2,79" 
SD 1.36 
Tactile task M 5.46 
° 1.46 2.70%* 
SD 1.51 
Auditory task M 6.61 
1.15 1.93* 
SD 1.30 
Visual task M 6.77 6,07 
70 1.24 
SD 1.25 1.49 
Total sensory M 30.15 23.69 
6.46 54t 
SD 3.23 2.55 


es are for one-tailed tests. 


ing in the predicted direction, relatively small 
and hence likely to have been the result of 
chance factors and artifacts. The data must 
therefore be considered as highly suggestive 
that Hypothesis la represents a valid assertion. 

In Hypothesis lb, a logical corollary of 
Hypothesis 1a, the above findings are simply 
expressed in different statistical fashion. A 
Pearson product-moment correlation of —.61 
was calculated between our measures of 
sensory acuity and scores achieved on the 
Rokeach Scale. This value is attributable to 
chance factors less than 1:1,000 and, as al- 
ready stated, expresses in different fashion the 
observation that with increasing dogmatism 
sensory acuity decreases. 

It was also predicted that the differences in 
sensory acuity between the two groups would 
not be uniform from modality to modality 
but much more follow a gradient of differen- 
tiation from relatively minor differences in 
the primarily allocentric modes to marked 
differences in the primarily autocentric modes 
(Hypothesis 2). 


Once again the results seem rather un- 


490 


equivocal in support of the hypothesis. The 
gradient of differences is apparent even 
though neither group exhibited the exact pat- 
tern of performance predicted. It will be 
noticed that both groups did better on the 
gustatory task than on the tactile task but 
this is irrelevant for it should be recalled that 
absolute performance on the tasks is less 
important than the gradient of differences in 
performance predicted and in this respect 
the data are evidently conclusive. The par- 
ticular order of differences observed is at- 
tributable to chance only on a 1:120 basis, or 
maximal for the present design.* 


Discussion 


At this point we wish to mention certain 
methodological difficulties. It is well nigh 
impossible to construct tasks in varying 
sensory modalities which may be deemed 
equal in difficulty. We have no way of tell- 
ing whether the visual tasks were comparable 
to the gustatory tasks. Of course, it would 
be possible to systematically eliminate items 
here and add items there until a series of 
tasks is evolved on which a criterion popula- 
tion would exhibit equal competence from 
task area to task area and having established 
such a scale, administer it to various pre- 
selected populations. But we believe that 
this would be begging the question, for we 
still would not know whether the tasks were 
comparable; we would only know that a given 
Population obtained identical scores on all 
tasks. We therefore decided not to concentrate 
so much on the development of items of 
illusory comparability in difficulty but much 
more upon the investigation of differences be- 
tween groups on various types of task. 
the question of whether these diff 
would vary in accordance with certai 
retical propositions. Our f 
in this direction and we 
closed and dogmatic person 
or alienated from his sensory impressions and 
this estrangement is more significantly ob- 
servable in the predominantly autocentric 
modalities than in those modalities which 


s and 
erences 
n theo- 
ocus of inquiry was 
conclude that the 
appears estranged 


® We are indebted to Jacob Cohen of New York 
University for suggesting the appropriate statistical 
treatment of this datum. 
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lend themselves more readily to what Schach- 
tel has called “objectification.” y 

Our data suggest two additional ings 
tions and comments. First of all they 
to make one doubt the frequently made A 
tion that there exists an inherent conum = 
tion between the so-called “objective aa 
“subjective” approaches to pcg als 
the contrary, our low dogmatic indivi a 
did not only perform in superior a 
the predominantly allocentric tasks MEEN 
on the predominantly autocentric test 1 hile 
and vice versa, the highly dogmatic, W on 
doing almost as well as the low dogmati “tor 
the allocentric items, did a remarkably in on 
job on the autocentric procedures. oe 
words closed-mindedness and apam e 
ness are not phenomena which appear 1n i in 
areas of an individual’s living and wvasive 
others, but much more they are ar only 
phenomena, the close-minded is no highly 
closed-minded vis-a-vis personal and ‘nae 
autocentric experience, he is also er 
vis-a-vis objectifiable stimulation. its this 
versely, the open-minded person exhibi : eri- 
orientation vis-a-vis both autocentric oa o 
ence and objectifiable data. As a mât that 
fact the idea suggests itself pointedly kes 
the very openness to experience which rant 
it possible for a person to be au p ade- 
with his sensations also enables him weston 
quately objectify such sensory imp" ger ap” 
And this openness to sense impression" eas; 
parently runs parallel to openness to an 
willingness to examine them critically, : 
careful analysis of thought. This wees 
be too surprising for the history of ra $- 
civilization is replete with incidents tribu d 
amples in which those who have Sir show” 
most significantly in one area have el gee 
themselves open to the pursuit of ot cienti ts 
ingly opposite endeavors. Leading pe peti? 
have been remarkably receptive tO ‘atuit 
experience. eager to follow fantastic ri 3 
hunches, and have exhibited genuine manis 
in humanistic intellectual pursuits; | creati” i 
tic intellectuals and men prominent 1 toundin’ 
artistic endeavors have exhibited “A E 
interest and insight in matters a: js 10” 
“two cultures” which Snow oer 
of talking about may not be as p ent 
each other as one might suspect 
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involvement in either may very well require 

a personal quality essential for both. 
Je ter aah, if neurosis is viewed as dog- 
and ri E teja restriction of horizons 
Parties 1 oo along certain lines, these 
retical “ad ines depending upon the theo- 
ate inhi of an author, then our results 
estrange y supportive of the hypothesis that 
ees or alienation is markedly as- 
ciation wath such rigidification, be this asso- 
ten, a ase concurrent, or resultant. The 
Upon thi ne present study throws little light 
Speculate question of sequence. We can only 
at the and suggest that the observation 
T th, a modalities which appear 
carly e Snel more prominently at 
type of ee do not lend themselves to the 
Order to ‘ectification our culture demands in 
that ah. along.” It is readily conceivable 
Wake a cultural demands bring in their 
towards a or less_dogmatic orientation 
xisting, an orientation which, as we 


have shown, also entails sensory alienation. 
Further research bearing upon this sequen- 
tial issue is sorely needed. 
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THE DIMENSIONALITY OF THE MMPI CLINICAL SCALES 
AMONG NORMAL SUBJECTS 


PAUL DEMPSEY 
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Scales 1-9 and K were tested for dimensionality in separate samples of 40 
males and 40 females. The test used was based on contextual analysis, a 
method that yields an explicit factor score for each S, thus permitting a 
practical coefficient of dimensionality: ra, the Pearson r between scale scores 
and contextual scores derived from the same items. Results were highly 
similar in both samples, only Scale 5 (Mf) showing a difference in dimen- 
sionality between sexes. Differences between scales were marked. Scales 1 (Hs) 
and 7 (Pt) showed excellent dimensionality; 3 others, 8 (Sc), 9 (Ma), and 
K, were fairly satisfactory; the rest showed coefficients too low to warrant 
treatment as interval scales among normals. The best scales can profitably be 
shortened; the better scales are improvable. The poorer appear suitable mainly 
for nondimensional uses in configural work and profile interpretation. 


One striking gap in MMPI research is 
the absence of any explicit investigation of 
the dimensionality of scale scores on the 
clinical scales. There have been several fac- 
torial studies of the intercorrelations of scale 
scores, many of which have yielded similar 
results (Welsh, 1956), but these have per- 
force assumed the dimensionality of the scales 
they were investigating. There has also been 
a set of factorial studies of the intercorrela- 
tions of the items on single scales (Comrey, 
e.g., 1958a, 1958b, 1958c), but while these 
have not made the a priori assumption of 
the dimensionality of the scales, neither have 
they provided any basis for testing it. In 
short, apart from the descriptions provided 
by McKinley and Hathaway on the original 
construction of the scales (Welsh & Dahl- 
strom, 1956), there is available no direct in- 
formation on this important question. 

The need for filling this gap in research 
arises primarily because current use of the 
MMPI requires of the clinical scales that they 
make consistent differentiations among normal 
subjects and among hospital patients as well 
as that they distinguish normal subjects from 
hospital patients, and both of these from 
specific criterion groups. In a word, current 
use of the MMPI calls for scales that make 
dimensional distinctions within groups, al- 
though the clinical scales themselves were 
constructed by methods that insure such 
distinctions only between groups. As Hatha- 
way (1956) describes the general Process of 
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5 2 ] on 
scale construction, all items were selecte¢ 


the basis of group comparisons, in his aori 
“by contrast of criterion groups with pee 
clinical groups and with various ine 
groups [p. 104].” The process of crossvali iis 
tion also rested on group comparisons, 12 m 
case on the extent to which the megna s 
distributions of scale scores in samples o. m 
criterion abnormal type were separated pa 
the frequency distributions shown by Ot us 
kinds of samples. In brief, although a, 
groups were defined, no attempt was mares 
during scale construction to get within-gre a 
differentiation that reflected systematic 1” 
vidual differences. 

The success of the configural apPr° m- 
(Meehl & Dahlstrom, 1960) and the union e 
ity of factorial results (Welsh, 1956) ae 
positive if indirect evidence that at least nti 
of the scales are making consistent em z 
tions within at least some groups of SU ner 
at least some of the time. Nonetheless; g ex 
can be considerable advantage in making the 
plicit the degree of dimensionality = ups 
various clinical scales within the cial Pog is 
or subpopulations in which the MM mons 
commonly used to make distinctions a 
persons. As a first approximation, this C2" ind 
taken to mean within normal samp t mis 
within samples of hospital patients "pe ja 
cellaneous diagnoses. The present Pat he 
concerned with the dimensionality 
scales among normal subjects. 

To test the dimensionality of a” 


ach 


rer 
scale 
y $ 


Ti 


DIMENSIONALITY 


quires both a definition and a method. An 
evaluation of the various indexes of dimension- 
ality outlined by Lumsden (1961) indicates 
that historically there has been relatively lit- 
tle basic disagreement in definition but con- 
siderable variation in method. Unfortunately 
si of the methods has been really satisfac- 
ba whether because of limitations in logic, 
i need for excessive calculations, or diffi- 
ulty of interpretation, no one of the sug- 
a indexes has recommended itself for 
A enl use. The index of dimensionality used 
i os Present study was devised by the writer 
WAS yy et area over existing indexes. It 
terrat e possible by the availability of con- 
of a ey eg a recently developed method 
detlying S the major dimension or factor un- 
subjects the responses of a given sample of 
1962. on a given set of stimuli (Dempsey, 
on empsey & Baumhoff, 1963). For 
ing ch Purposes, the important distinguish- 
ae AMC a of contextual analysis is 
factor yields specific numerical positions or 
only ne for all subjects, scores that exist 
an Sang in the centroid analysis of 
stores fe these scores, called contextual 
major’ order the subjects with respect to the 
their ese individual differences in 
iia ormances and are unaffected by 
make z not related to those differences, they 
ition I ie a simple and convenient defi- 
Perfect , a perfect unidimensional scale. A 
heré ORES TOR scale is one in which 
Scores a k perfect correlation between scale 
Seip oe contextual scores derived from the 
patible ms. This definition is logically com- 
Y Lo with the general definitions suggested 
Lord rate (1947), Lazarsfeld (1950), 
Si 52), and Lumsden (1961), as well 
ith Guttman’s definition of a quasi-scale 

val Pepe 1950). It also states one extreme 
Dretable _4 very simple and readily inter- 
called the of dimensionality. That index, 
Fis te = major coefficient of dimensionality, 
pee nine the Pearson product-moment 
content 1on coefficient between scale scores and 
tet scores. The major coefficient, whose 
interpr on is from .00 to + 1.00, is to be 
lens s ed like any Pearson r. Its availability 
inal A esting the dimensionality of the orig- 

MPI scales eminently practicable. 
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METHOD 
Subjects 


Two samples of 40 Ss each were used. All 
members of both samples were classifiable as normal, 
although there was considerable variation in ade- 
quacy of adjustment. The Ss were selected so that 
there was homogeneity within samples with re- 
spect to sex, age, and educational level, but im- 
portant differences between them in all three vari- 
ables. One sample was composed of male graduate 
students with a median age of 27 years. The other 
was composed of female undergraduates with a 
median age of 19 years. The samples are called 
the male and female samples on the assumption 
that sex is the most important distinguishing 
variable. 


Procedure 


The MMPI protocols obtained from the Ss were 
scored on Scales 1-9, and also on K. The entire 
analysis was carried on within samples. A total of 
20 separate tests of dimensionality were made, one 
in each sample for each of the above scales. The 
index used was the major coefficient of dimen- 
sionality ra, described earlier in this paper. The steps 
followed in making the tests were identical for all 
scales and both samples. Brief descriptions of the 
chief steps are given below. 

1. Preparation of Agreement Matrix. Paired com- 
parisons were made of the response patterns of the 
Ss to all items of a given scale, each possible pairing 
receiving an agreement score equal to the number 
of identical responses. These scores were then as- 
sembled in a matrix in which both rows and 
columns designated persons. 

2. Contextual Analysis of Agreement Matrix. This 
technique has been described in some detail else- 
where (Dempsey & Baumhoff, 1963). It is basically 
an iterative summing process that quickly con- 
verges on a unique solution for any matrix in 
which there are systematic differences among agree- 
ment scores. The solution provides for each S a 


dimensional position or factor score that is 
maximally consistent with the major systematic 
differences appearing in the agreement matrix. 


These scores are called contextual scores. 

3. Calculation of the Coeficient of Dimensionality. 
The Pearson product-moment method was used to 
correlate the scale scores of the Ss on a given scale 
with the contextual scores derived from the same 
items. The scale scores were raw scores, and no K 
correction was added. Tests of significance were 
made where appropriate. 


1 The writer wishes to express his appreciation 
to Donald W. MacKinnon, Director of the Institute 
of Personality Assessment and Research, University 
of California, Berkeley, for permission to use the 
protocols of these subjects. 
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RESULTS 


The 20 coefficients of dimensionality cal- 
culated for the 10 MMPI scales in the two 
samples of normal males and normal females 
are shown numerically in Table 1. The. range 
of the 7, coefficients for the male sample is 
from a high of .96 for Scale 7 (Pt) to a low 
of .11 on Scale 3 (Hy). For the female sample 
the range was approximately the same, the 
highest ra coefficient being -91, also on Scale 
7 (Pt), and the lowest being .01 on Scale 5, 
the Mf Scale. On 7 of the 10 scales the co- 
efficients for the male and female samples are 
within .08 of one another, on 2 the difference 
is .22, and on 1 scale, Scale 5 (Mf), the male 
sample shows a coefficient of .61 and the fe- 
male sample .01. This last difference is signi- 
ficant (p < .01). Since the male sample shows 
a slightly higher coefficient on each of the 
other scales as well, there is a small but sig- 
nificant sex difference in the overall height of 
the coefficients of dimensionality. 

For two of the scales, Scale 7 (Pt) and 
Scale 1 (Hs), the ra coefficients in both sam- 
ples exceed .90. On these, scale scores account 
for more than 80% of the variance attribut- 
able to systematic (dimensional) differences 
in the Ss’ performances. Three other scales 
show coefficients high enough to account for 
50% of the dimensional variance: Scale 8 
(Sc), where ra = .84 and ry = .79 in the male 
and female samples, respectively; Scale 9 


TABLE 1 


COEFFICIENTS OF DIMENSIONALITY OF MMPI 
Scares IN Two NORMAL SAMPLES 


Coefficient of dimensionality, ra 
Male 


Female 
Scale sample sample Difference 
K 81 -13 .08 
1 (Hs) 93 -90 03 
2(D) 67 45 22 
3 (Ily) at 10 01 
4 (Pd) 36 14 22 
5 (Mf) 61 01 .60* 
6 (Pa) 43 «41 02 
7 (PI) 96 91 05 
8 (Sc) 84 79 05 
9 (Ma) 82 75 07 


Note.—N = 40 in each sample, 
*p <.01. 
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(Ma), where ra =.82 and ra= .75, E 
tively; and the K Scale, where ra = .81 ana 
males and 74 = .73 among females. The 


i Frat wer 
maining five scales show coefficients at a l0 
level. 


DISCUSSION 


In general these results are in line es 
the reputation of the various scales: Fie 
commonly thought of as “good” scales cee 
emerged with high ra coefficients, and t a 
thought of as “poor” scales with lower err 
There may perhaps be some surprise a 
the coefficients shown by Scale 2 (D) m 
not higher, particularly among females, of 
that those shown by Scale 9 (Ma) are E 
lower. The outstanding characteristic of ion 
results, however, is the enormous varien 
in the size of the r4 coefficients from ing: 
scale. Two of the scales show encellent Si 
mensionality, three more are fairly satis nts 
tory, but five of the scales show coefficien s 
too low to warrant treating them as inter na 
scales, at least among normal subjects. if 
bearing of these differences on the ine e 
interpretation of the various scales wi 
briefly considered. 1 

On the two most highly dimensional B 
Scale 7 (Pt) and Scale 1 (Hs), scale s€ 


eS; 


and normal females, respectively. thes® 
normal subjects obtain similar scores 0” 
scales, the probability is very high tha suc! 
have given similar performances. nifo 
similarity does not in itself justify g does 
inferences as to psychological angen i pro” 
Provide a consistent behavioral basis tudes: 
file interpretations and correlational “if n it 
including factor analysis. In this conne 
should be noted that while Scales ae orally 
highly dimensional, they are also “tert 3 
complex. Comrey’s analysis of the p 
Scale 1 yielded eight identifiable £4 i 
did his analysis of Scale 7 (Comrey 


tors) 
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va As may be expected from the high 
ys Pca of the scales, many of Com- 
Rae fa orsiare very small, contributing little 
os i random variation to overall scale 
aN ; iminating these small factors would 
Konali, y a slight improvement in the dimen- 
FN Pil of the scales; the chief gain would 

5 m making the scales shorter. 
ing eine less satisfactory, but still show- 
Scales A erate degree of dimensionality, are 
Stale: O (Sc) and 9 (Ma) and also the K 
each i aha, 50% of the variance in scores on 
single di hese scales can be accounted for by a 
in this ote in the normal samples used 
Score A y. While the relationship between 
ih tte formance here is less predictable 
high “mek nt dimensional scales it is still 
scales as gh to justify treating these three 
esctibed — scales. As with the scales 
of thes above, factor analyses of the items 
Se show many small but identifiable fac- 


tor. N 
OTA or Scale 8 Comrey and Marggraph 
the z identified 12 factors; for Scale 9 and 


and 3 Scale, Comrey (1958ab) identified 14 


ens . . . . 
wae » however, their dimensionality 
nation e improved considerably by the elimi- 


items. a a small number of cross-cutting 
quire an Iminating such items would not re- 
Pretation. change in the character or inter- 
Serve to a the scales. Rather it should 
curate, ake existing interpretations more 
Th contrast to the foregoing are Scales 2-6, 
Coefficient which shows a dimensionality 
Dough t among normal subjects high 
Scale, Sir o justify treatment as an interval 
Somewhat. each of these five scales presents a 
Pretation different problem of use and inter- 
, each will be considered separately. 

a whi five, Scale 2, the Depression Scale, 
Ore Mee have most in common with the 
dimension ly dimensional scales. While its 
jects ar nality coefficients among normal sub- 
a Tea low, they are relatively closer to 
ar able level than those of Scales 3-6. 
Writer h samples of hospital patients, the 
Or Sean obtained dimensionality coefficients 
to Wie 2 of .86 and .87, sufficiently high 
e gross dimensional distinctions. To 
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permit consistent differentiation within the 
restricted range of talent represented by nor- 
mal subjects, however, requires the elimina- 
tion of many error items. The writer has 
developed a modified version of Scale 2 that 
appears to meet the strictest dimensional 
requirements, yielding coefficients of better 
than .90 in normal samples. A paper on this 
modified scale is in preparation. 

Scale 3, the Hysteria Scale, on the other 
hand, shows no evidence of dimensionality 
among normal subjects. The two coefficients, 
ra= .11 for males and ra = -10 for females, 
indicate that scale scores account for only 
1% of the variance in the Ss’ positions on the 
major underlying dimension or factor. Two 
normal subjects who obtain the same score 
are no more likely to have given similar 
performances than are two subjects with 
widely different scores. Treating such a 
congeries as an interval scale makes very 
little sense, and it is not surprising that 
factorial studies of normal subjects show 
inconsistent results for this scale (e.g. 
Kassebaum, Couch, & Slater, 1959; Wheeler, 
Little, & Lehner, 1951). Making a dimension 
of Scale 3 would be an arbitrary procedure, 
for there is no dimensional nucleus or core 
that can be made more prominent. Yet ob- 
viously Scale 3 has proved very useful in 
profile interpretation and configural scoring 
(Welsh, 1952; Meehl & Dahlstrom, 1960). 
Indeed, the scale seems to show just the sort 
of nondimensional but consistent relationships 
that the configural approach was devised to 
tease out (Meehl, 1950). A given score on 
Scale 3, say a T score of 70, for example, is 
likely to arise from an entirely different set 
of items when a subject’s Scale 2 score is low 
than when it is high. Presumably such grossly 
different performances can appropriately be 
given different interpretations. In short, the 
psychological significance of Scale 3 scores 
among normal subjects appears to be a 
matter of complex contextual interpretation, 
and the assumption of dimensionality is an 
error to be avoided. 

Most of what has been said about Scale 3 
is also appropriate to Scale 4, the Psycho- 
pathic Deviate Scale. In the original paper 
on this scale, McKinley and Hathaway 
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(1944) describe it as deliberately mixed in 
factor content and see its value as best il- 
lustrated in the diagnostic evaluation of the 
psychopathic personality in the clinic. Scale 
4 has, however, also been reported as useful 
in making distinctions among normal subjects 
(e.g., Hovey, 1953). As with Scale 3, im- 
proving the dimensionality of this scale may 
actually impair its usefulness by reducing its 
validity in configural scoring and profile inter- 
pretation. This, however, is a question that is 
subject to test. T 

Scale 5, the Masculinity-Femininity Scale, 
has played a relatively unimportant role in 
the development and use of the MMPI 
(Hathaway, 1956). It is of interest here in 
good part because it is the only scale that 
shows a significant sex difference in dimen- 
sionality among normal subjects. While it is 
tempting to interpret this difference as sup- 
porting the hypothesis that femininity is not 
the mirror-image, or even the approximate 
mirror-image, of masculinity, no interpreta- 
tion is justified at this stage, Among males, 
where ra= .61, it should be possible to 
construct a highly dimensional scale from the 
items of Scale 5, but among females, where 
rq = .01, such a scale could not be developed 
without changing the direction of scoring of 
many of the items. 

The last of the scales showing low coeffi- 
cients of dimensionality, Scale 6, the Paranoia 
Scale, is similar to Scale 5 in the sense that 
it seemed weak to its originators at the 
time of its publication and has played a 
relatively unimportant role in the subse- 
quent use of the MMPI (Hathaway, 1956). 
Among normal subjects of both sexes the 
variance of scores is small, and it is not 
surprising that dimensionality coefficients of 
only .43 and .41 are found for male and 
female samples, respectively. An item analysis 
conducted by the writer indicates that the low 
dimensionality of Scale 6 is to be attributed 
in good part to subtle items that consistently 
draw more frequent endorsements from low 
than high scorers among both normal and 
hospital samples. Since Scale 6 has not dis- 
tinguished itself by the accuracy with which 
it separates out persons with Paranoid symp- 
toms, there may be considerable merit in 
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F ; by 
improving the dimensionality of the pa 
reversing the direction of scoring the 
consistent of these subtle items. 
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DEFINITIONAL ASPECTS OF THE CRISIS CONCEPT 


BERNARD L. BLOOM 
Harvard School of Public Health > 


8 expert judges reacted to a series of 14 miniature case histories of stress 


experiences which had been constructed so tha 


differential effects of 5 variables u 
of “Crisis.” The variables studied 
of a precipitating event, 


Among psychologically stressful experi- 
ences, those particular events referred to as 
crises are receiving increasing attention by 
mental health workers. Interest in the crisis 
has arisen partly because it is felt that the 
consequences of the management of such 
periods of stress often include significant 
and long-lasting changes in level of adequacy 
of mental functioning, Many workers who 
apply public health concepts to the field of 
mental health believe that good mental health 
is in large measure the result of a life history 
of successful crisis resolutions; and, there- 
fore, by Providing therapeutic intervention 
to people while they are in crisis, the in- 
cidence of subsequent mental 
these persons may be significan 
Clinical studies of crises precipitated by 
specific events have appeared in recent years. 
Bereavement has been studied by Lindemann 
(1944); reactions to surgery have been 
studied by Janis (1958); and reactions to 
the birth of a Premature child have been 
studied by Caplan (1960) and by Kaplan 
and Mason (1960). These studies are related 
to earlier investigations of combat reactions 
and responses to natural disasters, although it 
is only within the past several years that the 


concept of crisis has been given focal im- 
portance, 


disorder in 
tly reduced. 


1 Now with National Institute 
at the Denver Regional Office of 
Public Health Service. 


of Mental Health 
the United States 


t it was possible to study the 


pon judgments of the presence or absence 


» (d) evidence of 
of resolution. The results in- 
d as an episode beginning with 
either no discernible reaction or 


f the in- 
During the exploratory phase 0 jg 4 
vestigation of many new concepts, earn: 
danger that definitions can be too a an¢ 
The boundary between what is include cep 
what is not included in a particular colibe 
might appropriately be kept fluid anhi the 
erately vague so as not to restrict a 
span of inquiry. Precise definitions qwances 
cepts are required, however, as one aon stind 
beyond initial exploration toward the ae. 
of hypotheses which have been adv rs 
involving the concepts. This phase appesi 0 
have been reached regarding the con Si 
crisis. Caplan (1961) has suggeste et 0 
example, that “during the period of UPS ing 
a crisis, a person is more susceptible OL ative 
influenced by others than at times of 'ferring 
psychological equilibrium [p. 13].” R196) 
to this increased susceptibility, Caplan jatit 
Suggests that “from a preventive PSY’ apre”? 
point of view, this is a matter of R elpi”É 
importance; because by deploying sis 2 
services to deal with individuals in me um 
small amount of effort leads to a ma what 
amount of lasting response [p. 82 this pe 
ever else may be required to test pts mus 
vocative hypothesis, relevant life eve crises tr 
be unambiguously sortable as eua attempt? 
not crises. The success of studies © tcome d 
to intervene and influence the oute on 
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‘ eee 
N 


Tue Crisis CONCEPT 


oe to explore the nature of agreement 

Olden to the crisis concept among 

m ies highly skilled in both crisis theory 
mmunity mental health practice. 


METHOD 


ies poe noncrisis state the existing 
is denne responses available to an individual 
occur, A e to meet and solve problems as they 
self aa occurs when an individual finds him- 
Problem eae deal effectively with an emerging 
shading Rae at vary in severity, with mild crises 
Solving “ied the us al ups and downs of problem 
Uunctionin, avior Which characterize the ordinary 
literature g of an individual. Review of the relevant 
identification nn that the crucial elements in the 
den. of the crisis-state appear to be (a) 
ividual jg precipitating event of which the in- 
Cognitive a Eye (b) significant subsequent rapid 
Particular fai affective disruption unusual for that 
‘Uption ae ividual; and (c) duration of the dis- 
ese ee least several days. With regard to 
cernin Ta a number of questions may be raised 
Presence 8 the behavior of clinicians in judging the 
Sequence a absence of a crisis. Is a reaction 
Oa prë iewed as a crisis if there is no awareness 
himself eating event either by the individual 
Ment? Ho by significant persons in his environ- 
Order to ow rapid must behavioral changes be in 
Person p Suspect that a crisis is present? Does a 
comfort eed to have some awareness of inner dis- 
Crisisp 7°? tension in order to be judged to be in 
there a S a person considered to be in crisis if 
ny re no external manifestations of disequilib- 
Stressful oes the rapidity of the resolution of the 
the expe experience have any bearing on whether 
ete re is judged as constituting a crisis? 
to consis instrument utilized in this investiga- 
ns Word ted of 14 brief (averaging approximately 
ist ties a each) case histories. These fictional 
p, Which moe described a single stressful event 
Ree ea central character was named Mr. Jones. 
palis te ax ustory was constructed to appear as 
ioe & on Possible, yet each contained information 
Preci (a) the awareness or lack of awareness 
Use ipitating event; (b) the rapidity of the 


i o F 
plema = disruption; (c) presence or absence of 
viio qo mtort; (d) external evidence of be- 
Ki isruption on the part of the presumed 


E e ‘Gun crisis; and (e) rapidity of resolution 
Ry e ict. Starting by dichotomizing each of 
istiare v ponents, 13 of the stories were so 
caneated in that the five variables under study 
hae other eae or less random conjunction with 
tor” in wee for example, Story B was com- 
eo low ich a known precipitating event was 
ny tort an sudden onset of both internal dis- 
th follow: external manifestations of disequilib- 
We Conflict ing which there was slow resolution of 
Bradual Story M was composed in which there 

o onset of externally visible disorganiza- 
awareness of internal tension, no known 
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precipitating event, and rapid resolution. The four- 
teenth story (Story N) involved a known precipitat- 
ing event followed by no symptoms of any kind, 
neither internal discomfort nor behavioral dis- 
organization. A group of professionally trained 
mental health workers served as judges during the 
construction of these stories, and a story was in- 
cluded in the final test instrument if there was 80% 
agreement by the judges as to the presence or 
absence of all five of the components intended to 
characterize each story. As examples of the fourteen 
stories, Stories B, M, and N are herewith presented. 


Story B 


Mr. Jones slammed on the brakes but not in 
time to avoid hitting the boy who had dashed 
out in the street in front of his car. Before he 
could even open the door of the car, he felt 
nauseated and very frightened. He managed, al- 
most blindly, to reach the front of the car but 
found himself unable to do anything to assist the 
moaning teen-ager who was badly cut and 
bleeding. When help arrived, Mr. Jones was in 
a dazed condition, unable to talk coherently about 
the accident but aware that he was tremendously 
disturbed. Fortunately, the boy’s injuries seemed 
much more severe than they actually were, and 
he recovered with no permanent injuries. Although 
Mr. Jones was found not to be legally responsible 
for the accident—and the boy whom he hit fully 
admitted that it was not Mr. Jones’ fault—it 
was months before Mr. Jones was able to talk 
about the accident and drive his car comfortably. 


Story M 


Mr. Jones’ peculiar behavior had begun very 
gradually. At first he seemed to become forgetful, 
then simply “off on another planet” most of the 
time. When Mrs. Jones would try to call it to his 
attention, she found him completely unaware of 
his actions. Then one day he just disappeared. 
There was simply no trace of him for nearly a 
week, when he was found by a United States 
Customs officer returning to New York on a 
flight from Europe. He was arguing with the 
officer about paying duty on a guitar he had 
bought in Spain, and he had attracted quite a 
crowd. He was very belligerent and the officer, 
suspecting he was not well, called the police— 
who in turn called an ambulance. In the ambu- 
lance Mr. Jones suddenly seemed to realize who 
and where he was and, after satisfying the 
medical authorities that he was well, was released. 
He was home again the next day, virtually back 
to normal. 


Story N 


The day after Mrs. Jones was killed in a plane 
accident, Mr. Jones reported for work on time 
just as he had done more or less regularly for 
the past several years. The office staff were rather 
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TABLE 1 


ENUMERATION OF THE COMPO) 


ENTS OF THE TEST STORIES 


Behavioral 
Internal disorgan- 
Onset of tension ization Rese ition 
à j = = — Resolutt 
Precipitating event symptoms a ae af eoniiict 
- Un- Sud- Grad- Recog- recog- Visi- visi- ~ SIAN 
Story Known known den ual nized nized ble ble Rapi 

7 A X X x X X x 
B x X X X N E 

€ x x x x : x 
D X X X X ” 

E x x xX x < x 

F xX X x x x 
ces x X xX x i . 
H x X X X X 

I x X X . X x 

J x X X X x 
K x X X X . 
L X X X x : 

X X X X < wanit 
x >a None x bg Not relevan 
surprised to see him, particularly since Mrs. RESULTS 


Jones’ death had been so tragic—and they had 
been so much in love. The staff remarked to 
each other how well he was reacting to the 
tragedy and how no one could ever have guessed 
that so terrible a thing had happened so recently. 
Mr. Jones’ boss was particularly understanding 
and tried to induce him to take time off, but Mr. 
Jones seemed willing to work and was as friendly 
and relaxed as ever. Weeks later Mr. Jones re- 
marked that his wife's death had been a blow to 
him, but life had to go on. He said that he had 
never felt any serious discomfort or any real 
difficulty in carrying on his normal activities, 


Enumeration of the components of the 14 test 
stories is presented in Table 1, As will be seen, 
there are considerably more stories in which a 
precipitating event is Present than in which no 
Precipitating event is known, Except for this vari- 
able, the stories distribute themselves nearly equally 
with regard to each of the components under study. 

The test instrument individually adminis- 
field of crisis theory, 
e Community Mental 
ard School of Public 
on a separate sheet 


a S questions. The first 
asked, “On the basis of your understanding of the 
concept of ‘crisis, did this event constitute a 


crisis for Mr. Jones?” The judge was asked to 
check one of the three choices, namely, 
or “do not know.” The second question 
sentence or two, why did you check this al 


“Yes,” “no,” 
was, “In a 
ternative?” 


+o the 
e 
The measure employed to determine g 


sydge 
extent to which each event was jue tota! 
constitute a crisis was to divide the -pe 


number of “yes” judgments by the sum Story 
“yes” and “no” judgments on each À 
This measure treats the “don’t know 
ments as qualitatively different RoT ii o 
“yes” and “no,” and a separate estimat” pe 
uncertainty was made by calculating tory 
Proportion of all judgments to each ory" 
which were in the “don’t know” C3 crisi? 
In only 5 of the 14 stories were the 
judgments unanimous, In the case rans", 
remaining 9 stories, “yes” percentage yes 
from 20 to 86. In 4 of the stories, t “pm 
percentages were between 40 and 63- 0 I 


. n 
know” percents ranged from 0 Ei oi 
only 5 stories were there no “ao ar! g 
judgments. The judges were thus ° PE 
unanimous in their judgments. o we 

In the analysis of the judgment © „nde 


as a function of the five componenoatedly 
study, the set of 14 stories was all stori 
subdivided so that judgments tO nen COs 
containing each particular comp® a stor is 
be contrasted with judgments to analy? 


oe ent 
not containing that same compo” 


| 
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nae relationship of these components to 
oe of crisis is presented in Table 2. 

ae = variables, the crisis judgment 
is pa ated to two. The judgment of crisis 
ka = significantly more often when there 
precipitati precipitating event than when the 
significanthe event is unknown—and IS made 
resolutio y more often when there is slow 
eee 7 as contrasted with rapid resolution. 
ficult, S of crisis are made with particular 

oa yen the precipitating event is un- 
high lev poe this circumstance there is a 
judges ae uncertainty on the part of the 
precipitati ith respect to knowledge of the 
ing event, it should be pointed out 
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appear to be unrelated to variations in the 
other three variables. 

A rapid resolution was defined as one which 
took place within a week; a slow resolution 
was one which required between 1 and 2 
months. In the case of this variable, episodes 
in which there is slow resolution are judged 
significantly more often to be crises than 
episodes in which the resolution is rapid. In 
view of the fact that episodes with known 
precipitating events tend to be judged as 
crises and episodes with rapid resolution 
tend not to be judged as crises, it might be 
anticipated that histories characterized by 
both known precipitating events and rapid 


that o È 
judg the six individuals who made a crisis resolution would be particularly ambiguous 
si ‘ai to Story N are unanimous in con- to the judges. Five stories share these two 
tease it a crisis—although in this story no attributes, and “yes” percents to these par- 
Ons follow the event. Crisis judgments ticular stories range between 57 and 72, 
TABLE 2 
at Crisis JUDGMENTS AS A FUNCTION OF THE EXPERIMENTAL VARIABLES 
Judgments 
ieee No. of = = 
s “perimental variable stories Yes No Dk Yes Dk% 
Precis — PN 
Cipitars 
Dating event Known i 88 65 16 7 80 8 
Unknown 3 24 3 11 10 21 42 
Difference X P i 9 
S.E. Dif. è Eo) 
CR 5.0* 32t 
Ray: eR 
Midity g 
1Y of onset Sudden 7 3% n 7T B 13 
Gradual 6 20 14 8 05 17 
Difference 8 4 
S.E. Difi. 9.8 7.0 
CR. ns ns 
terng 
l tension Recognized 8 o4 36 18 10 67 16 
Not recognized 6 48 32 9 7 78 15 
Difference 11 1 
S.E. Diff. 9.1 6.9 
CR ns ns 
Be i IR: 
Vior = 2 
tisora Visible 9 4 igs 70 15 
8anization Not visible 5 40 25 9 6 74 15 
Difference i 0 n 
S.E. Diff. 9.5 7. 
CR ns ns 
Dee, nis 
f resolution Rapid 7 56 27 20 9 37 16 
Slow 6 48 35 7 6 83 13 
Difference 26 3 
x 9.3 6.9 
2.8* ns 
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indicating extensive disagreement among the 
judges. , 

The results of the analysis would seem 
to indicate that in practice, a crisis is defined 
primarily in terms of knowledge of a precip- 
itating event and secondarily in terms ofa 
‘slow resolution. Known precipitating events 
are generally judged to lead to crises if (2) 
there is no reaction or if (b) there is a reaction 
of any kind and resolution requires a month 
or more. The judges’ comments suggest that 
situations in which the resolution is rapid 
are commonly viewed as episodes illustrating 
appropriate responses to reality situations. 
Reactions of any kind which appear when 
there is no known precipitating event are 
likely to be considered psychiatric disorders 
rather than crises. Since in practice inter- 
vention ordinarily takes place after the 
identification of some precipitating event 
which might lead to crisis but before resolu- 
tion of any resultant stress, the data were 
reanalyzed in order to determine whether, 
within the group of 11 episodes in which a 
precipitating event was known, crisis judg- 
ments were related to the nature of subsequent 
reactions. Crisis judgments in the case of the 
histories in which there was a known pre- 
cipitating event and sudden onset were con- 
trasted with crisis judgments in which there 
was a known precipitating event and gradual 
onset. Similarly, comparison was made be- 
tween judgments in instances of known pre- 
cipitating events and recognized internal ten- 
sion and judgments in instances of known 
precipitating events and unrecognized internal 
tension. The final analysis contrasted crisis 
judgments in cases of known precipitating 
events and visible behavioral disorganization 
with judgments in cases of known precipitat- 
ing events and no visible beha 


vioral dis- 
organization. In no case was a significant 
difference found. 


Discusston 

In order to test the efficac 

at times of crisis, 
crisis must be 

identification, one 


y of intervention 
a sample of people in 
identified, Following this 
could contrast outcome in 
a subgroup who had been exposed to inter- 
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vention procedures with another sub f 
who had not been exposed to such proce E. 
Outcome in both subgroups could be Ea 
pared with that in a noncrisis Kom this | 
whether or not a controlled study ° e the 
kind is undertaken, the identification k E 
crisis subgroup should be sulficlen y ieh 
ambiguous so that it is the intervention © o 
is clearly the subject of study. Pae at- 
intervention procedures should not, state. 
tributable to misdiagnosis of the anai ne 
The single most contributory varia resen 
volved in the crisis judgment in the p 
sample appears to be the existenc F 
precipitating event. On this basis, A, K 
simply define the crisis state as ine js a 
following certain specified events. T | 
appealing operational solution to the in that 
definition, but judges do not maintal e 
crises inevitably follow certain events: -a 
does not define crisis solely by the G tual 
of some event of the life of the person, | 


ae oup 
practice would dictate that within a g" pould 


be able to distinguish between those I state 
for whom the event results in a on cris 
and those people who seem not to be e t 
as a consequence of the event. The Pr“, ig 


serimi 
findings suggest that this kind of dist!” per 


a t a] fur 
tion may not now be possible. Until 5 ake™ 
refinement of the crisis concept is nn pyentio" 
assessment of the effectiveness of inte 


efforts will be difficult, 
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THE EDWARDS SOCIAL DESIRABILITY SCALE AS A 


SHORT FORM OF THE MMPI 


MAURICE 0. ELVEKROG * ann NORRIS D. VESTRE ? 


Veterans Administration Hospital, St. Cloud, Minnesota 


prediction of MMPI scale scores from 


This study evaluates the accuracy of 


the Edwards’ SD scale. SD-predicted MMPI scores were obtained according to 


a procedure presented by 


for 1 normal and 2 psychiatric samples. 
predicted and obtained scores via pro 


significance of the differences between the means, 
the predictive efficiency of the Edwards’ 


with different MMPI scales and, to a 


predictive errors indicates that 
procedure shows striking variability 


lesser extent, with different population samples. In genera 
SD scale as a “short MMPI.” 


to support the use of the Edwards 


a tremendous amount of work that has 
Psycho been done on “response styles” in 
aly Bec tests has culminated in the 
the ane, Edwards (1962) that at least for 
Rents innesota Multiphasic Personality In- 
lla G (MMPI), the variance of the test 
abilit 1S SO largely a function of social desir- 
a y that an individual’s MMPI scores can 
meet simply on the basis of his Social 
the Sp ility (SD) score, and that, in fact, 
(Edw scale is “a short form of the MMPI” 
See & Walker, 1961). While there are 
k to methodological problems involved in 
ent ie ag offered for this claim, the pres- 
wheth udy is confined to the question of 
is of er the procedure suggested by Edwards 
of the wh practical utility as a short form 
Bie IMPI. Holloway and Gocka (1962) 
Beg Ned a table based on the Edwards 
ion equation. In the present study 

fe dicted MMPI scores were obtained 
value rhe table which gives the predicted 
of the or the usual clinical and validity scales 
(0 e MMPI for all possible SD scale scores 

to 39), 


5 METHOD 
ample 


Th i 
a E different samples were studied since Edwards 
alker (1961) assert that prediction may be 


es 

1 
N : : 
Near OW at Veterans Administration Hospital, Min- 


Dolis. M; 
2 i Minnesota. 
W at University of Minnesota. 


Edwards and were compared with actual scores 


Analysis of the relationship between 


duct-moment correlations, testing of the 


and a tabulation of the 


1, the results failed 


more accurate for some groups (with high SD 
scores) than for others. Sample A was composed 
of 76 chronic schizophrenics who had been hospital- 
ized continuously in a Veterans Administration 
neuropsychiatric hospital for approximately 4 years. 
Their mean age was 39 years. Sample B consisted 
of 102 consecutive admissions to the same hospital. 
These had a mean age of 38 years. Fifty-eight per- 
cent had a psychotic diagnosis (most often schizo- 
phrenia) ; 30% had a neurotic diagnosis; the re- 
mainder had a personality trait disturbance, char- 
acter disorder, or brain syndrome as a diagnosis. 
The admissions group, being more acutely disturbed, 
was expected to have more elevated MMPI profiles 
and lower SD scores than the chronic group. Group 
C was composed of 78 student nurses affiliated with 
this hospital. This group, whose mean age was 20 
years, was expected to have higher SD scores than 
the patient groups and to resemble more closely the 
college students which have been the subjects of 
many of the earlier studies on this topic. 


Procedure 


The MMPI (Booklet Form) was administered in 
the usual way and scored for the Edwards SD scale 
in addition to the usual clinical and validity scales. 
For each subject, the Holloway and Gocka table was 
entered with the individuals SD score and the 
predicted MMPI scale scores were read. These SD- 
predicted scores were K-corrected, where required, 
and converted to T scores for comparison with those 
obtained from the scoring of the actual MMPI 
records. 

Three analyses were done: (a) Product-moment 
correlations between predicted and observed scores 
were compared and tested for significance. (b) The 
means of the differences between observed and pre- 
dicted scores were evaluated by means of a £t test. 
(c) A table of cumulative frequency percentages of 
absolute differences between predicted and observed 
scores was made. This may be the most relevant 
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TABLE 1 


CORRELATIONS BETWEEN OBSERVED AND SD-Pre- 
DICTED MMPI Scare T Scores 


Chronics Admissions Nursts 
Scale (N = 76) (N = 102) (N = 78) 
I —.30** —.10 —.15 
F 4 iaig F- ag .46** 
K 74** Kri iua 165%% 
Hs 25* .66** —.01 
D m” Py saad .50** 
Hy 07 Ey a 10 
Pd 00 At =21 
Mf —.27* —.02 O4 
Pa .69** .58** 19 
Pt LIS .48** 37 
Sc .82** 59%% =13 
Ma 46** .20* —.06 
Si .73** 71 65** 
*p <05. 
+ > S01, 


consideration for clinical Purposes, as from it can 
easily be determined the frequency with which 


predictive errors of specified magnitudes existed in 
cach sample. 


RESULTS 


Product-moment correlations between SD- 
predicted and observed MMPT scores are 
presented in Table 1. As can be seen, there 
are considerable differences in level of correla- 
tions obtained for different scales and differ- 
ent samples. Consistently high correlations are 
produced in the case of the K and Si scales 
and, to a lesser extent, with the F, D, Pa, Pt 
and Sc scales. It is worth pointing out, how- 
ever, that of the 39 coefficients in Table 1, 
nine ate negative (only two of these nine are 
significant) ; also, six more are not signifi- 
cantly different from zero. All of the remain- 
ing coefficients (24 in number) are positive 
and significant. In the case of the L, Hy, Pd, 
and Mf scales, in particular, prediction is 
quite inaccurate. The correlations associated 
with these scales are either uniformly, or al- 
most uniformly, negative or nonsignificant. 
The correlations for the student nurse sam- 
ple are lower, on the whole, than for the 
other two samples. The average y values for 
the chronics, admissions, and nurses are .38, 


41, and .18, respectively. It will be noted that 
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there are a number of instances wherein a 
student nurse sample stands out by we A 
disturbing the pattern of coefficients. a 
example, for the two psychiatric pare 
positive and significant coefficients obtain om 
the Hs, Sc, and Ma scales while for be 
student nurses low negative correlations W 
found on these three scales. 

Table 2 indicates that for most of the x 
there are systematic errors in prediction a 
are statistically reliable. Predictive aerial 
could, therefore, be improved by the ad i 
or subtraction of a constant appropria ai 
the scale in question, although this eo we 
ment would be limited by the size of the ew 
relations reported in Table 1. There ees ai 
be consistency between these three samp ee 
whether a scale is overpredicted or sen ot 
predicted, but there is one obvious or A 
to this rule. This exception is the one 
which is underpredicted for the two alee 
tric samples and overpredicted for the a 

Another procedure which would snak 
offer meaningful data by which to Se la 
the accuracy of SD-predicted MMPI ances 
scores is a tabulation of absolute dieren asi 
between observed and SD-predicted T ative 
In Tables 3, 4, and 5 are presented gene 
frequency percentages of predictive 


TABLE 2 


FER- 
ANAtysis or MMPI T Score MEAN pA 
ENCES (OBSERVED MINUS SD-PREDICT a 


Chronics Admissions Nura 
Scale (N =76) (N=102) (V= 
L —2,99* —7.56** 
E —10.88** —27.45** 
K —=2,33** —4,93** 
Hs —4.03* —8.86** 
D 2.08 3.38* 
Hy —4.50** —3.32** 
Pd 5.11* 1,37 
Mf —16.41** —11.97** 
Pa —6.21%* —11.47** 
Pt —3.69** —2.94* 
Se —10,14** —20.77** 
Ma —8.89** —10.10** 
Si 3.21** 4.34** 
ned For. 
Note-—Where mean difference is positive, Vite hig! 


i ii + 1 scl 
are higher; where it is minus, $D-predicted sc 
* 


p S .05. 
*+b S01. 
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TABLE 3 


CUMULATIVE FREQUENCY PERCENTAGES INDICATING DIFFERENCES BETWEEN OBSERVED 
AND Prepicrep MMPI T Scores For CHRONIC SCHIZOPHRENIC SAMPLE 


Ws 


76) 


Absolute differences between observed and predicted T score 


Scale <5 <10 <15 <20 <25 <30 <35 <40 
L 42 55 83 87 97 100 
F 21 47 61 76 91 93 97 99 
K 43 80 91 100 
Is 26 58 71 79 83 89 93 93 
D 14 39 68 83 93 93 96 97 
Hy 42 62 78 84 91 93 95 97 
Pd 25 39 54 70 78 87 93 95 
Mf 9 20 42 58 82 92 97 100 
Pa 24 51 72 88 96 99 100 
Pl 37 71 83 92 97 99 100 
Se 24 39 63 82 88 93 97 99 
Ma 22 53 67 83 92 96 100 
Si 42 80 91 97 100 


ia re chronic schizophrenic, psychiatric hos- 
ie admissions, and student nurse samples, 
aden Although this type of analysis 
test Ittedly lacks the elegance of a statistical 
free it may be argued that it is quite relevant 
« determining the practical utility of a 
Short MMPI.” 


A few aspects of Tables 3, 4, and 5 should 
be noted. The scale which is most accurately 
predicted over all three samples is K. For the 
total V of 256 (76+ 102 + 78) there are 
only three instances where K is missed by 
more than 19 T score points and for over 75% 
of these subjects the prediction error is not 


TABLE 4 


CUMULATIVE FREQUENCY PERCENTAG: 


ES INDICATING DIFFERENCES BETWEEN OBSERVED 


AND Prepictep MMPI T Scores For NP Apuissions SAMPLE 


A (N = 102) 
= = an 
Absolute differences between observed and predicted T score 
Scale <5 <10 <15 <20 <25 <30 <35 <40 
L 25 35 69 83 99 100 
i 12 23 27 31 42 47 61 70 
K 39 74 87 97 100 
Hs 16 34 61 72 81 87 96 o7 
D 22 46 70 83 89 93 95 08 
Hy 27 60 75 89 95 96 98 99 
Pd 20 42 61 70 80 88 93 96 
Mf 11 31 57 76 88 94 99 100 
Pa 27 47 60 72 88 95 96 98 
Pt 27 50 78 89 95 98 100 
Sc 12 22 33 47 53 64 75 82 
Ma 16 38 58 76 87 94 08 09 


Si 32 68 91 98 
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TABLE 5 


CUMULATIVE FREQUENCY PERCENTAGES INDICATING DIFFEREN 


AND PrEDIcTED MMPI T Scores 


(WN = 


‘ 


ES BETWEEN OBSERVED 
FOR STUDENT Nurses SAMPLE 


78) 


Absolute differences between observed and predicted T score 


Scale <5 


<10 <15 <20 <25 <30 <35 <40 
f 13 27 73 85 100 ô 
I 6 17 4 58 73 86 90 
K 4 73 90 100 
Hs 40 65 86 99 100 
D 51 77 90 97 100 
Hy 45 79 94 100 
Pa 28 65 77 92 96 99 100 
Mf 19 37 67 83 94 99 100 
Pa 17 45 62 81 95 97 97 100 
Pi 44 82 97 100 
Se 23 41 55 72 83 94 97 97 
Ma 24 47 65 82 90 95 97 100 
Si 29 69 90 97 100 
—— 


greater than 9 T score points. Most of the 
other scales are not as accurately predicted. 
In particular, it might be noted that the pre- 
dictions for F and Sc are quite inaccurate. 
In the case of Sc, approximately one-third of 
the total N of 256 are missed by over 19 T 
score points and approximately two-thirds are 
missed by over 9 T score points. Admit- 
tedly, the evaluation of an “acceptable” 
amount of error for practical purposes is a 
matter of judgment, but it would certainly 
appear that the amount of error involved in 
the prediction of Sc, F, and certain other 
scales could not be countenanced. 


Discussion 


The various analyses comparing SD- 
dicted and observed MMPI scale sc 
the three samples indicates consideral 
ability in predictive efficiency depend 
scale and, to a lesser extent, on sam 
of the observed correlations are ind 
a statistically reliable relationship between 
predicted and observed scores, but for some 
scales the relationship is extre 


mely low and, 
for others, even negative, A Statistically 
reliable difference between the means of ob- 


pre- 
ores for 
ble vari- 
ing upon 
ple. Most 
icative of 


tained versus predicted scores holds for neatly 
all the MMPI scales. A table of predicted $ 
Score errors of various magnitudes indier 
that these errors are indeed sizable for ha 
scales and, thus, that the Edwards predici 
procedure could hardly serve for purpos ate 
Practical prediction. This does not V! on- 
the theoretical significance of the dena 
strated relationships between the SD scale 
many of the MMPI scales. 

Results for the student nurse sample 
in a number of respects from results oe 
patient samples. This difference appe? ange 
be largely a function of a restriction © puss 
in MMPI scores for the nurse sample. 7) is 
correlations obtained tend to be lower now” 
group. It also appears that the Ho thin 4 
Gocka table tends to give predictions Ya ins 
restricted range and when a sample © rane” 
Profiles almost all within the normal T gay 
there is not the possibility of a5 in PX 
erroneous predictions as might gar s It 
tients groups having more deviant n result? 
is clear also that some differences i sample 
occur because the scores of the n bn Le ie 
tend to cluster at a different ore is ext 
scales than the patient scores do; th! re the 
plified in the case cited above whe 


differ 
r the 
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scale is underpredicted for the patient samples  Evwarps, A. L, & Water, J. N. A short form of 
and overpredicted for the student nurse the MMPI: The SD scale. Psychol. Rep., 1961, 8, 


485-486. 
sample. Hotroway, H., & Gocxa, E. A table for obtaining 
expected scores of MMPI scales from a social 
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Artistic excellence correlates with evaluat 
from drawings of human figures. In this 
part, as the accuracy of proportional re 
6 comparable sets of drawings varying 
accuracy and 6 attributes inf 
psychologists ranked drawings 


suggest that adjustment, artistic merit, 


difficulty are all strongly related to pro 
aggression is not. It is proposed that proportional 
artistic values be considered as an underlying di 


ment of human figure drawings, 


Whitmyre (1953) argues cogently that the 
artistic excellence of drawings of human 
figures is a very powerful influence upon the 
clinician’s evaluation of emotional adjust- 
ment. Indeed one is left with the impression, 
after studying his results, that artistic excel- 
lence is perhaps all that is used as a basis of 
judgment. Sherman (1958) reported a study 
similar to Whitmyre’s with essentially the 
same results and conclusions. The two studies 
help strengthen each other to the point where 
the present authors feel the combined results 
are an important assault upon the clinical use 
of figure drawings. Yet as conclusive as the 
two studies are, questions still remain. 
Neither Whitmyre nor Sherman attempts a 
definition of artistic excellence. They are wise. 
Furthermore both investigators limit them- 
selves to the one attribute of psychological 
adjustment. In this study we approach both 
problems by Proposing an operational defini- 
tion of an aspect of artistic excellence and by 
extending the categories to be judged. 

To avoid sinking into the morass of defin- 


1 This work was done when th 
the District of Columbia Gener 
Georgetown University. 

? Now at Saint Elizabeth’s Hospital. 


e author was at 
al Hospital and 
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Maryland 
ions of adjustment made by clinicians 
study artistic excellence is defined, in 
lationships among parts of the body. 
along the dimension of proportional 


erred from drawings were selected. 36 clinical 
in accordance with the attributes, 


The results 
» intelligence, and sexual 
ccuracy. The attribute of 
accuracy rather than creative 
mension in the clinical judg- 


dependency, 
portional a 


ing artistic excellence, we have selected on? 
obvious element of the concept which we fee 
is crucial in the judgment of figure drawin 
— proportional accuracy. The morphology i 
the human figure contains relatively sab? 
relationships among parts, The graphic phe: 
for example, may be accustomed to conceiving 
of and drawing the male figure as seven and å 
half head-lengths tall; shoulders are = 
head-lengths wide, arms are three ome 
lengths long, etc. (Bridgeman, 1928). pae 
length of the head provides an easy unit oa 
statement of ratios among body parts. Tt ns 
to the accuracy of these ratios that clinic 
and artists respond, we hypothesize, whe 
they judge the drawings of human figures. gè 

But does the clinician respond tO Pe 
portional accuracy as he judges a wide wang 
of personality characteristics? The omer 
both Whitmyre and Sherman suggests ast 
Possibility of an affirmative answer at nN 
for adjustment and artistic talent. Ewe ajed 
ies by Albee and Hamlin (1949, 1950) cept 
lend support to a “global judgment” cor ith 
of the adjustment dimension. In a it 
children’s drawings, Coltharp (1961) t ule 
clinicians able to predict intelligence ons 
able to predict either sociability or em? 
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adjustment. Based upon the authors’ own 
experience and the foregoing studies, three 
characteristics to be judged were isolated— 
adjustment, artistic merit, and intelligence. 
A count of traits from Machover’s (1949) 
standard work suggested three other charac- 
teristics—dependency, aggression, and sexual 
difficulties. Having selected six characteristics 
which seem possible to judge from drawings 
of human figures we hypothesize that pro- 
portional accuracy has a strong influence 
upon clinicians’ judgments of both general 
and specific characteristics of personality. 


PROCEDURE 


fone clinical psychologists were asked to 
the order separate sets of drawings for each of 
5 six selected characteristics. Every drawing had 
a known proportional accuracy score. The develop- 
ment of six comparable sets of nine drawings each 
Proceeded as follows: 
a Three hundred and sixty drawings were avail- 
i a study done by Minsky and Levy (1961). 
si ch drawing had been scored along seven dimen- 
ons of proportional accuracy as described in a 
eo by Levy and Minsky (1962). The seven 
of en; were added together for an overall estimate 
Proportional accuracy after each distribution 
= been normalized and placed on the same scale 
using the T Scale conversion. 
me The distribution of overall proportional ac- 
T cy scores was plotted. Using the distribution as 
Stee 54 drawings were selected using the 
ma E criteria: (a) Six drawings had to be 
he at each of nine different levels of accuracy. 
iden of each of the six drawings had to be 
of ee as homogencous as possible. (b) Each 
to ¢ he nine groups of drawings had to be selected 
yet pias the widest possible range of accuracy and 
(c) AG reasonably equal units between them. 
damea A of the drawings selected could be un- 
SKON, » have missing limbs or have sexual organs 
usta oo through clothing. This latter criterion was 
th to avoid stimuli so salient and misleading to 
e clinician that their good sense might be aban- 
doned, 
ies After the six sets of nine drawings were 
cura, ed an analysis of variance of the overall ac- 
As A scores resulted in an F of .01 between sets. 
cone ar as homogeneity of scores within sets is 
4 erned therefore, they are comparable. 
- Looking at the array of selected drawings 


how, x 
ohm their qualitative heterogeneity became 
lous. Consequently 10 judges were asked to 


Pree the six drawings at each accuracy level ac- 
ing to how good the drawing was. The averages 
to these ranks were taken and drawings assigned 
thog tS accordingly. Thus Set I had all nine of 

Ose drawings ranked best, Set II had those ranked 


next best and so on with Sct VI having all of 
the nine drawings ranked lowest by judges. An 
analysis of variance of the accuracy scores of this 
new arrangement of drawings resulted in an F of 
.01 between sets. 

5. Completed the development of the stimuli. 
Six sets of nine drawings were selected each with 
a wide range of proportional accuracy yet each 
with comparable levels of accuracy represented. In 
addition, an attempt was made to make each set 
qualitatively homogeneous. 

Thirty-six subjects were selected to judge the 
drawings. Each subject is a clinical psychologist 
with experience in the interpretation of drawings of 
human figures. The mean of their experience is 
6.7 years, There were 20 males and 16 females in 
the sample of judges. Judges were asked to rank 
order cach set of drawings according to a different 
characteristic. A 6X6 Latin square provided the 
presentation order of the six sets of drawings; the 
six traits were randomized and then assigned to 
the sets in the same order for each of the first 
block of six judges. The Latin square and assigned 
traits were then inverted for the second group of 
six judges, then used in the original order for the 
third group of judges and so on. This procedure 
provided a random and counterbalanced order of 
presentation. Finally the nine drawings in each set 
were randomized for presentation to the judges; 
this random order was also reversed for every 
other judge. The directions to judges were: 


I am going to ask you to participate in an 
experiment on the clinical evaluation of drawings 
of human figures. In each of these six folders 
are nine drawings, each of which was done by a 
different person. I would like you to rank order 
the drawings in each folder according to the one 
of six dimensions which I will describe later. 
Thus, there will only be six rankings. Indicate 
your rank order by arranging the drawings in 
sequence. Brief descriptions of the end points will 
be provided. In cases of a tie among drawings, 
treat the ties as paired-comparison judgments. 


Two cards containing brief descriptions of the 
“good” and “bad” extremes of each trait marked 
the end points of the ranks. At the completion of 
every ranking, the overall proportional accuracy 
score for each drawing was entered in the rank 
position given the drawing by the judge. 


RESULTS 


Analysis of variance is used: one category 
of analysis, judged assigned ranks, provides 
special problems of experimental dependence 
and correlation across ranks since a single 
judge ranked the nine drawings in each set. 
Consequently a conservative model for anal- 
ysis is required. Edwards’ (1950) model for 
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TABLE 1 


Variance Ratios For Two Sources oF VARIANCE 
AND Eacn CHARACTERISTIC JUDGED 


Sources 
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TABLE 2 


CORRELATIONS BETWEEN PROPORTIONAL ACCURACY | 
AND AVERAGE RANK FoR Eac CHARACTERISTIC 
JupcED 


WN = 54) | 


Interaction 


ranks 
Ranks X sets 
Characteristics (df =8) (df = 40) 
Adjustment 20.70** 1.56* 
Aggression 3 1.38 
Artistic merit 25.20** 3.04** 
Dependency 9.36** 69 
Intelligence 38.64** VPY aaa 
Sexual difficulty 2.90* 1.57* 


05. š 


repeated measurements is used in this study. 
Rank is the source of variance of interest: 
the interaction, sets by ranks, is the most 
conservative error term for testing that effect 
and consequently is the one used. 

Table 1 contains a summary of the anal- 
yses. These results are dramatic. Proportional 
accuracy is apparently a compelling cue 
utilized by clinicians in their judgments of 
general and specific personality character- 
istics. The only exception to the foregoing 
conclusion is the characteristic of aggression. 

The results are further elaborated in Fig- 
ure 1. The mean overall proportional ac- 
curacy score for each rank is computed by 
combining all scores across sets and dividing 
by the number of judges. Thus each mean is 
based upon 36 rankings. The mean accuracy 
scores for each of the six characteristics is 


165 Artistic 


Merit 


H = Intelligence 
—— Adjustment 
AVERAGE 125 Dependency 
OVERALL Sexual 
PROPORTIONAL 105 Oitticutty 
ACCURACY 
SCORE 
as 
65 Aggression 
45 
25 
4 5 6 7 
RANK " 3 
Fic. 1. The relationship between 


proportional 


accuracy and the rank assigned human figure draw- 


ings by clinical psychologists, 


Characteristics Correlations 


Adjustment .76* 
Aggression —.06 

Artistic merit .85* 
Dependency 03" 
Intelligence .84* 
Sexual difficulty 52" 


*p <0. 


plotted against rank. Every point, therefore, 
is the mean of the overall proportional ac- 
curacy score of drawings placed in that rank. 
In general, the curves for the five significant 
characteristics rise slowly until the middle 
rank and then accelerate sharply. This result 
is an artifact of the selection of drawings- 
Although attempts were made to keep dis 4 
tance between levels of accuracy constant, m 
fact the distances between levels became 
larger as overall accuracy decreased. The 
shape of the curves in Figure 1 when COn- 
sidered with the progressive variation in siz 
of intervals between drawings suggests that 
judges make fine discriminations less easily 
than coarse ones, 

Table 2 contains the results of a correla” 
tional analysis. Each of the 54 drawings }§ 
ranked by 6 judges for each characteristic: | 
An average of the 6 ranks is related to ° | 
overall proportional accuracy score to provide 
a more rigorous statement of the relationship 
implied in Table 1 and Figure 1. 


TABLE 3 


FOR 
ASS RELIABILITY CogrFIcIENTS 


AVERAGE ĪNTRACL. 


D p 

EACH CHARACTERISTIC Juncen anp ror Eactl : 
Characteristics I mom y Y : l 

si 

Adjustment 650 66 83 122 
Aggression OF ee 3 a h 16 
Artistic merit 93 92 (94 92 BF 4S 
Dependency +10 54 Z a 20 af 
Intelligence 93 9293 1904 
Sexual difficulty 3600780 25 = 2 

* Based upon 6 judges and 9 drawings. r ct- 

» No relationship, error term greater than main effe 
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Although reasonable degrees of interjudge 
reliability and reliability of means can be 
inferred from the results of the analyses of 
variance reported in Table 1, a more exact 
opportunity to test these reliabilities is de- 
scribed by Ebel (1951). The data easily 
lends itself to such analyses. Table 3 con- 
tains the results of this exploration of the 
stability of the mean ranks. The analyses are 
based upon groups of 6 judges each of whom 
ranked the same set of drawings for the same 
characteristic. Ranks rather than overall 
proportional accuracy scores are used. 


DISCUSSION 


The results reported above seem to sup- 
port the initial hypothesis in the main. 
Whether in or out of awareness, the clinician 
1S responding to the proportional accuracy of 
the human figures he interprets. That pro- 
Portional accuracy is basic to judgments of 
Specific as well as general characteristics is 
not as easily documented. In the first in- 
Stance, the attribute of aggression has no 
relationship to proportional accuracy. But the 
Tesults of the intraclass reliability computa- 
tions strongly suggest that judgments of this 
attribute are not reliable. For example in 
Sets II and IV, the variability of rankings is 
So extensive that it transcends any implica- 
tion of central tendency for drawings. The 
Ta remaining coefficients are quite low. It 
ch Possible that aggression is too general a 
oo Fractionating it into extropuni- 

veness and intropunitiveness might have 
epee more reliable judgments if not a rela- 
lonship to proportional accuracy. 
i, Of the remaining five characteristics judged, 
ttelligence, artistic merit, and adjustment 
ay by virtue of the degree of their relation- 
et to proportional accuracy and reliability, 
DE be rather clearly associated with the ex- 
rimental variable. Although the correlations 
oe proportional accuracy and depend- 
bd and sexual difficulty are highly sig- 
E seen and reasonably high, their intraclass 
Cliabilities suggest some limitations upon 
fen talization. Table 3 does not reveal a sys- 
Se ei pattern of reliabilities by sets. It is 
at ough a “groups of judges X sets X char- 
eristic” interaction is at work. Some groups 


sil 


of judges seem to respond differently to some 
sets of drawings and, in addition, respond 
differently depending upon the characteristic 
they are judging. It may be that central 
personality dimensions such as dependency 
and sexual difficulty are still too diffuse as 
concepts to enable reliable judgment. Per- 
haps too, sets with greater intervals of pro- 
portional accuracy between drawings might 
have resulted in more stable judgments. 

A comment about artistic value as an ex- 
planatory variable in figure drawing judg- 
ments is appropriate here. If one considers 
the highly personal conception of the human 
figure revealed in paintings by contemporary 
artists such as George Cohen, Nathan 
Oliveira, and Jacob Landau to name a few, 
the notion of esthetic merit in its creative 
sense ought to be abandoned as an explana- 
tion of clinicians’ judgments of drawings of 
human figures. Paintings by these artists can 
be found in the Museum of Modern Art 
exhibition catalogue, Recent Painting USA: 
The Figure (1962). This “new”? image of 
the figure gives it primacy in the painting, 
emphasizes it and minimizes context. The 
figures are often distorted, featureless, and 
unclothed. They are painted with assertive 
strokes and usually without regard to anatom- 
ical precision. Human figures drawn in the 
spirit of these artists might easily be called 
“sick” by a clinician. This study suggests 
that one might better substitute proportional 
accuracy for esthetic merit as an underlying 
variable in clinical judgment of drawings. 
Indeed the .85 correlation coefficient between 
the two dimensions suggests that they are 
close to being interchangeable. 

Although the results of this study seem 
conclusive, caution should be observed. Pro- 
portional accuracy is only one aspect of 
representational accuracy. Within the latter 
domain, variables such as symmetry, anatom- 
ical detail, clothing detail, posture, facial 
expression, etc. are all akin to proportion. 
However, since one expects all representa- 
tional variables to be highly intercorrelated, 
proportional accuracy is probably as reason- 
able a representative of the domain as one 
can find, particularly because of the ease 
with which it can be measured. 
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THE RELATIONSHIP BETWEEN NEUROTICISM, STRESS, 
AND QUALITATIVE PORTEUS MAZE PERFORMANCE* 


ARISTOTLE A. ANTHONY ? 


Teachers College, Columbia University 


The results of this experiment are the first empirical demonst 
ety and Q score. Specifically, this research 


(a) the Q score of neurotic adolescents is 
dolescents (p <.001), (b) the Q scores 
dolescents are higher under interfering than 
p<.01), and (c) the difference in Q 
erfering test conditions is greater for 
dolescents (p <.001). For the experi- 


existing relationship between anxi 
tested the following hypotheses: 


higher than the Q score of normal a 


of both normal and neurotic a 


under noninterfering test conditions ( 
scores between interfering and nonint 
neurotic adolescents than for normal a 
ment, 30 neurotic and 30 normal adolescents were 
interfering test conditions, and then under 
groups were equated for age, intelligence, and p: 


me: igi of this study was to investi- 
qualit e relationship between anxiety and 
a ative aspects of , performance on the 
Vestia Maze test. This relationship was in- 
e by comparing the qualitative 
iea E Maze performance, operationally de- 
adole y the Q score, of normal and neurotic 
eti scents under interfering and noninter- 
T test conditions. 
i = Q score was developed by Porteus 
iffer Ponse to his observation that qualitative 
indivi in maze performance Occur. Two 
Store uals may earn the same quantitative 
of th or mental age, but the style or quality 
1942 T performance may differ widely. In 
stud ’ orteus and Honzik began a systematic 
Were 3 qualitative errors in execution which 
Qent aracteristic of the responses of delin- 
Score s hese responses became known as Q 
sidered he characteristics of performance con- 
161-4 in the Q score (Porteus, 1950, pP- 
of t 62) are: (a) an error in the first third 
iy maze; (b) an error in the last third 
hey maze; (c) a cut corner; (d) a crossed 
the? (e) a lifted pencil at any point except 
exit; (f) a wavy line; (g) a wrong direc- 


lpp: 
thins paper is based upon a doctoral disserta- 
Partm, Teachers College, Columbia University, De- 
TI aeS of Psychological Foundations and Services. 
diserta are due to Joel R. Davitz, Chairman of the 
Millie ation committee, and Paul E. Eiserer and 
for th Almy, members of the dissertation committee, 

2,6 invaluable assistance and guidance. 
Ow at the Ridgewood Public Schools, 


» New Jersey. 


Woo Ridge- 


ration of an 


tested, first under non- 


interfering test conditions. The 2 


avental socioeconomic level. 


tion; (%) each qualitative error in either the 
5 or the 6 year maze. 

Porteus (1942, p. 20; 1945, p. 101) found 
that the Q score significantly differentiated 
delinquents from nondelinquents. His results 
were confirmed in other studies by Wright 
(1944), and by Docter and Winder (1954). 
Thus, the Q score appears to be consistently 
successful in distinguishing between delin- 
quent and nondelinquent samples. 

Although Porteus developed the Q score 
with special reference to the behavior of delin- 
quents, clinical observations suggested that an 
evaluation of qualitative aspects of perform- 
ance on the Porteus Mazes may be a par- 
ticularly sensitive measure of anxiety. For 
example, a person who is seen as anxious 
from a Clinical point of view sometimes shows 
considerable muscular tension, and in extreme 
cases, observable muscle tremors. Thus, while 
the anxious person may successfully traverse 
the maze, smooth motor coordination during 
the performance may be disturbed by his 
muscular tension, and this disturbance would 
be manifested by wavy lines, cut corners, or 
crossed lines, the kinds of errors or behavior 
characteristics considered in Porteus’ quali- 
tative scoring. Another observation some- 
ical administration of the 


times noted in clini 
maze tests is the anxious person’s perception 
” That is, if a 


of the maze as a “set of traps. 
wrong turn is taken during the maze tracing, 
the result is entrance into a blind alley; the 


subject is “caught,” or “trapped,” and the 
J 
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anxious person’s tension tends to be increased. 
Thus, he may become impulsive, try to escape 
from the situation in one way or another, and 
in his impulsive efforts to escape, make errors 
in the relatively simpler mazes, or at the 
beginning or near the end of more ‘complex 
mazes. 

This study was designed to test the gen- 
eral hypothesis that there is a positive rela- 
tionship between anxiety and qualitative as- 
pects of performance on the Porteus Maze 
test. Anxiety was considered both in terms 
of the relatively long-term, chronic anxiety 
represented by neurosis, and temporary or 
acute stress induced by test interference dur- 
ing the experimental procedure. It was as- 
sumed that the effects of both kinds of tension 
would be manifested in behaviors measured 
by the Q score procedure. s 

Though there is disagreement regarding 
the etiology of neurosis, there is consensus 
that the neurotic is anxious and fearful (Dol- 
lard & Miller, 1950, p. 222; Freud, 1933, pp. 
114-115; Hoch, 1950, p. 106; Horney, 1937, 
p. 23; Maier, 1949, p. 1950; Masserman, 
1943, pp. 67-68; Mowrer, 1950, p. 537; Sul- 
livan, 1953, pp. 313-316). In the present 
study, a sample of adolescents who had been 
diagnosed as neurotic was compared to a 
group of presumably normal adolescents, and 
it was hypothesized that the Q score (the 
number of qualitative errors) of the neurotic 
adolescent is higher than the Q score of the 
normal adolescent. It is generally agreed that 
under interfering test conditions or conditions 
of stress, the neurotic reacts less adequately 
and with more anxiety to the test situation 
than the normal (Mandler & Sarason, 1952; 
Sarason, Mandler, & Craighill, 1952). In this 
study, test interference, introduced in the 
Second presentation of the mazes to both 
groups, was comprised of ins 


tructions sug- 
gesting subject 


inadequacy and test pro- 
cedures involving distraction, conflicting buz- 
zer stimuli and time Pressure, factors ac- 
knowledged to be stressful (Lazarus, Deese, & 
Osler, 1952). It was hypothesized that under 
interfering test conditions designed to in- 
crease tension or anxiety, the Q scores of 
both normal and neurotic adolescents are 
higher than under noninterfering test condi- 
tions. Furthermore, it was hypothesized that 
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under conditions of test interference designed 
to increase tension or anxiety, the nae 
Q score for the neurotic adolescent is ee 
than the increase in Q score for the nor 
adolescent. 
In summary, the following hypotheses A 
tested: (a) The Q score of the wage ri 
lescent is higher than the Q score both 
normal adolescent. (b) The Q scores of ! her 
normal and neurotic adolescents are a z 
under interfering than under noninter ee 
test conditions. (c) The difference in Q p. 
between interfering test conditions and A 
interfering test conditions is greater a al 
neurotic adolescent than for the no 
adolescent. 


METHOD 


„scent DOY 
The subjects in the study were 30 adolesce 


r 
from the Ridgewood Public Schools, the n0 
group, and 30 adolescent boys from The rs 
Schools, the neurotic group. The procedure a 
two presentations of the mazes to the eae 
lescent and the neurotic adolescent, first, S 
interfering test conditions and, second, un 
fering test conditions. indivi 

The normal subject was defined as an Ler y 
in “good standing” in the Ridgewood Pubie to the 
that is, a person who had not been referre or ac% 
school psychologist for behavior problems scho! 
demic difficulties. The individual’s Co Va ip et 
record did not present adverse teacher comm ic PO 
did it present extreme variability in academ 
formance from year to year. r dividu 

The neurotic subject was defined as an in 5, 
who was not brain 
chotic, 


D evercu* 
volvi 
| ado- 

non” 
inter 


qual 


damaged, mentally ent ance 
or delinquent. The indiyidual’s per it ne 
in school and behavior in society had made } 
sary for him 
society, and 
Schools, whi 


4 
equa’ gpi 

0)» . 
age, intelligence (Porteus, 1950, pp- 151-160) pe 
parent socioeconomic level (Edwards, 1938) P clus d 
The subjects w] A 
in the normal and the neurotic groups Tether 
to participate in a study to determine erences 
mazes were sensitive to individual dis ate i 
the subjects who were asked to particip? 
study agreed to do so. 


* ns 
5 uctĖtO 
Experimental Procedure and Inst! 


me 

ions 
ation’ ur 

The procedure involved two present: e ne 


mazes to the normal adolescent and 


th 
5 
mal 
ved 
i 
js 
hi Ü 


| 


| 
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sblescent, first, under noninterfering test conditions 
at second, under interfering test conditions. 
ig oaitieets were seen individually and introduced 
ae oe procedure with the following state- 
7a want to find out how well you can find 
in all ay ARONEN these mazes. There are 10 mazes 
a, he one somewhat more difficult than the 
ikte ne: Here is the first maze.” The standard 
with ter ee was then followed, beginning 
lottas goustructens. to the Year Five maze 
Was nies 950, pp. 155-157). However, each maze 
Bean to the subject until it was completed 
ceeded re even though the number of trials ex- 
e tim e limit set in the standard procedure. 
oF ae ae for the completion of each maze 
eset = i The first presentation of the mazes, 
a ed, is the noninterference presentation. 
reide conclusion of the testing, the examiner 
© the pas sa to score the mazes, indicating 
as Well Pig in the process that he had not done 
Was told he might have done. The subject then 
by the examiner: 


Wi 


Pro 


R PE" 

Gin fo did fairly well that time, but perhaps you 
Ae abl tter this time. This time the faster you 
e to go through the maze, the higher your 

Pe will be, is 3 S y 
es der to help you go at a faster rate, I will 
toa pl e buzzer once, like this, whenever you come 
Place in the maze where you might go wrong. 


tenti: F 
hie to the buzzer should help you to avoid 


Also, 


Points” in order to keep you from losing time 


fea will sound the buzzer twice, like this, 
through that you are taking too much time going 
are e the maze, If more than two time warnings 
ecessary, your paper will be taken from you 
Ga will receive a score of zero. 
s luck, If you try hard, you should be able 
Prove your score. 


Th 
buzzer buzzer pattern for each maze was double 
The p Single buzzer, single buzzer, double buzzer. 


doube e single buzzer stimuli followed the first 
resentation stimulus, and were completed before 
Timing HO" of the second double-buzzer stimulus. 
Present for the buzzer presentation began with the 
Ceede, ation of the maze to the subject and pro- 
according to the following schedule: 
1, 
doy mazes Year 5 through Year 8, the first 
doug, buzzer was sounded at 3 seconds, the second 
i à buzzer at 6 seconds. 
do ble f, mazes Year 9 through Year 10, the first 
doube puz2er was sounded at 3 seconds, the second 


i © buzzer at 10 seconds. 
Touble p mazes Year 2 through Adult I, the first 
double Bee was sounded at 3 seconds, the second 


Puzzer at 15 seconds. 


hi 
Presong azes were rotated 180 degrees on the second 
to red ation, This procedure was followed in order 
the uce the facilitating effects of practice during 


r z 5 
St presentation of the mazes. The time required 


TABLE 1 


CHARACTERISTICS OF NORMAL AND 
Neurotic Groups 


Characteristics Normal Neurotic 

Mean age 16.42 16.06 
Standard deviation of age iJ 1.74 
Mean intelligence quotient 99.60 96.73 
Standard deviation of 

intelligence quotient 11.96 7.33 
Socioeconomic level of parents 

Professional 8 8 

Managerial 18 18 

Skilled workers and foremen 3 2 

Clerks and kindred workers 1 1 

Semiskilled workers 0 1 


for the completion of each maze was recorded. The 
second presentation of the mazes, as described, is 
the interference presentation. 

The mazes were presented to each subject twice: 
first under noninterfering conditions, and then under 
interfering test conditions. The subject’s performance 
under the two conditions was compared. 


Modifications in Q Score 


Two modifications of the errors contributing to Q 
score were made for this study in order to control 
the number of mazes used by cach subject and 
two modifications of Q score were made as a result 
of clinical observation of the maze performance of 
normal and neurotic adolescents. 

The number of mazes used influences Q score 
since the greater the number of mazes used, the 
greater the probability of Q errors. In order to 
compare the @ performance of normal and neurotic 
adolescents without the effects of the variable, 
Number-of-Mazes-Used, the 10 completed mazes 
were scored in determining Q score rather than 
both completed and incompleted mazes. 

The use of the 10 completed mazes for Q scoring 
made it necessary to exclude first third and last 
third errors from the Q score, since these errors only 
occur on incompleted mazes, 

Previous clinical observation revealed that a break- 
down of motor coordination, particularly under test 
interference or stress, was a characteristic of the 
maze performance of the anxious person. The break- 
down of coordination was reflected in the gross 
error: the cut corner with white space showing, 
and the broken crossed line. The gross error was, 
therefore, included in the Q score used in this study. 

In summary, the errors comprising Q score for 
this study were modified by eliminating the first 
third and last third error, and further modified by 
adding the gross error: cut corner with white space 
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TABLE 2 


Q SCORE UNDER INTERFERENCE AND NONINTERFERENCE 


ANALYSIS OF VARIANCE OF NORMAL AND NEUROTIC ADOLESCENT 
I T.99 
ee 


' Degrees 
Source of Sum of of Mean 
variation squares freedom square 
Between 
Normal and 
Neurotic 250.56 1 250.56 79.98** 11.97 
Between 
Interference 
and Noninter- 
ference 48.22 2 24.11 6.83* 4.98 7.76 
Individuals 
within groups 549.41 58 9.47 2.68* 1.79 2.09 
Interaction 204.61 58 3.53 6.78** 3.41 
Note The Q score is the sum of weighted errors on the 10 completed mazes divided by 10. 
> < 001, 
showing (two points) and broken crossed line (four RESULTS 
points). The Q score used in this study is the sum of ee Q 
the weighted error points on the 10 completed mazes The first hypothesis stated that the 


divided by 10. Two independent judges correlated 
.98 for reliability, scoring 120 mazes representing each 
maze age level, each subject, and each test condition. 


TABLE 3 


FREQUENCY DISTRIBUTION oF NORMAL AND Neurotic 
ADOLESCENT Q Scores UNDER I 
NONINTERFERENCE 


Normal Neurotic 
non- 
inter- 

ference 


non- 
inter- 
ference 


Normal Neurotic 
: inter- inter- 
Scores ference ference 


0= 9 
1.0- 1.9 
2.0- 2.9 
3.0- 3.9 
4.0- 4.9 
5.0- 5.9 
6.0- 6.9 
7.0- .79 
8.0- 8.9 
9.0- 9.9 
10.0-10.9 
11.0-11.9 
12.0-12.9 
13.0-13.9 
14.0-14.9 
22.0-22.9 


NOUCoNFEN 
me NWA DANW 


a oe eNe 
SOME RWOWe 


Note.—The Q score is the sum of wi 


aR eighted er: 
completed mazes divided by 10, Tors on the 10 


her tha? 


score of the neurotic adolescent is hig ple 1 
the Q score of the normal adolescent. $4 ores 
presents the analysis of variance of jects 
by normal and neurotic adolescent SUP) 


under test conditions of interference 2° pe 
interference. As indicated in Table “> qo} 
hypothesis was supported beyond ihe * 
level. pe 0 
The second hypothesis stated that rents 
scores of both normal and neurotic ado ations ‘ 
are higher under interfering test © as 
than under noninterfering test condit 
indicated in Table 2, this hypothes!® 
supported beyond the .01 level. Ke dif 
The third hypothesis stated that a test 
ference in Q scores between interfer! jon 
conditions and noninterfering test neo fo! 
is greater for the neurotic adolescent me 
the normal adolescent. As indicated |; th? 


significance of the interaction tern rable 


: Ss. + x 
jon wa? 


analysis of variance summarized 1” the o0! 
the hypothesis was supported beyo” 
level. jstrib 


An examination of the frequency oa od 
tion of Q scores by normal and ne ene r 
jects under test conditions of interfer" sy 
noninterference presented in Table rotics 
relatively small overlap betwee? men nd Í” 
normals under both noninterfering * 
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fering test conditions. For example, under 
noninterfering test conditions, 23 normal ado- 
lescents obtained a Q score of 3.9 or less while 
only six neurotic adolescents did so. Under 
interfering test conditions, 22 normal adoles- 
Cents obtained a score of 4.9 or less while 10 
neurotic adolescents scored 4.9 or less. Thus, 
rs some overlap existed, Q score clearly 
ifferentiated normal from neurotic maze 
etmang under both noninterfering and 
hoes ering test conditions, the neurotic adoles- 
ñ committing more Q errors than the 
ay adolescent. Additional findings of this 
y were: 


brace neurotic adolescent spent less time 
cent ae the mazes than the normal adoles- 
ime 50, Q score tended to increase as 
iz equired for maze completion decreased; 
Maz, It seems that the use of less time for 
onset em by the neurotic adolescent 
oti uted to the higher Q error of the neu- 

€ adolescent. 
neg Q error performance of the neurotic 
Siena was more variable than the Q 

t ee the normal adolescent under 
ions i and noninterfering test condi- 
Neuroti lso, the Q error performance of the 
Was men adolescent under test interference 
the Senn variable than the Q performance of 
i a iiit adolescent under noninterfering 
existe eee while homogeneity of variance 
two tst or the normal adolescent under the 
ance conditions. (In order to make allow- 
t or the heterogeneity of variance on 
orm of the F distribution [Lindquist, 
»D. 83], the level of significance for this 
Os dp set at the .01 rather than the 


M = Q error categories which contributed 
he oie | to the greater number of 
der i errors, the greater number of errors 
ease E interference, and the greater in- 
Ace y n neurotic errors under test interfer- 
line er ere the cut corner error, the crossed 
in Baa and the gross error. The greatest 
dep >C in the number of neurotic errors un- 
trop i interference occurred in the gross 
„Category. 

ang The first third error, the last third error 
? Particularly, the perseverative error in- 


SA7 


creased under conditions of neuroticism and 
test interference, thereby contributing to the 
increase in the number of mazes used under 
conditions of neuroticism and test interfer- 
ence. The power of the perseverative error 
was illustrated by an increase of 7.1 times in 
the neurotic mean perseverative error under 
conditions of test interference. 


DISCUSSION 


This research began with the clinical ob- 
servation that the performance of anxious 
persons differed from that of nonanxious 
persons in certain aspects of behavior during 
administration of the Porteus Maze test. The 
apparent differences were usually not reflected 
in the quantitative score but, rather, in the 
kinds of performance characteristics described 
by Porteus in his development of the Q 
score. Thus, this study was designed to test 
the validity of the clinical observation that 
Q score provides a sensitive and reliable 
measure of anxiety. The results of the re- 
search clearly support the initial clinical ob- 
servations. The results also are the first 
empirical demonstration of an existing rela- 
tionship between anxiety and Q score. 

Studies concerning muscle tension, motor 
coordination, and control functions provide 
a framework within which the increase in Q 
errors under conditions of neuroticism and 
test interference can be understood. There is 
considerable evidence that muscle tension is 
characteristically associated with anxiety 
(Jost, 1941; Jurko, Hill, & Jost, 1952; 
Malmo, Shagass, & Davis, 1950), and that 
this tension interferes with relatively fine 
motor coordination (Arnold, 1942; Malmo, 
1950, pp. 169-180). This study found that 
the greatest increase in qualitative errors un- 
der conditions of neuroticism and test inter- 
ference occurred in the cut corner, crossed 
line, and gross error categories, categories 
particularly susceptible to poor motor coor- 
dination. Several studies also suggest that con- 
trol of psychomotor functions tends to 
deteriorate under conditions of neuroticism, 
anxiety, and stress (Jost, 1941; King, 1954, 
p. 144; Malmo & Shagass, 1949). Thus, 
past and present research tends to support an 
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interpretation of the results of this study 
(more Q errors under neuroticism and stress) 
in terms of increased tension, disrupted mus- 
cular coordination in relatively fine motor 
tasks, and decreased control of motor impulses 
as a function of stress and anxiety. 

Previous research suggests that variability 
of performance (Eysenck, 1944; Jost, 1941; 
Jurko, Hill, & Jost, 1952; Malmo & Shagass, 
1949) and perseverative responses (Symonds, 
1958, p. 15; Maier, 1949) are characteristic 
of neurotic behavior and neurotic performance 
under stress. This experiment demonstrated 
that Q score, in addition to clearly differen- 
tiating normal adolescent from neurotic ado- 
lescent maze performance under both nonin- 
terfering and interfering test conditions, also 
revealed performance characteristic of neu- 
roticism, stress, and anxiety; namely, vari- 
ability and perseveration of responses. These 
findings seem to indicate that use of the Q 
error as defined in this study under both 
noninterfering and interfering test conditions 
provides a unique method for the study of 
performance under varying conditions of 
anxiety. Furthermore, the results of the study 
seem to indicate that the Q error is a sensi- 
tive and reliable measure of anxiety, and may 


be used as a measure of neuroticism and per- 
formance under stress. 


SUMMARY 


The results of this experiment are the first 
empirical demonstration of an existing rela- 
tionship between anxiety and Q score. Specif- 
ically, this research tested the following hy- 
potheses: (a) the Q score of neurotic adoles- 
cents is higher than the Q score of normal 
adolescents (p < :001); (b) the O scores of 
both normal and neurotic adolescents are 
higher under interfering than under noninter- 
fering test conditions (p < :01); and (c) the 
difference in Q scores between interfering and 
noninterfering test conditions is greater for 
neurotic adolescents than for normal adoles- 
cents (p < .001). 
For the experiment, 30 neurotic and 30 
normal adolescents were tested, first under 
noninterfering test conditions and then under 
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is s 
interfering test conditions. The two go 
were equated for age, intelligence, 
parental socioeconomic level. 
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TWO MEASURES OF ANXIETY: A VALIDATION? 


WILLIAM D. WINTER, ANTONIO J. FERREIRA, anv ROBERT RANSOM 
San Jose State College 


19 undergraduates were subjected to 2 conditions of low anxiety, 2 of experi- 
mentally produced high anxiety, and 2 of examination anxiety. During cach 
situation, the Ss were administered the Affect Adjective Check List (AACL) 


and the Palmar Sweat Index (PSI), 
findings were: (a) both the AACL and PSI 


and later the Taylor MA scale. The 


validly reflected anxiety; (b) 


both yielded stable scores characteristic of the S; (c) PSI did not cor- 
relate with AACL; (d) MA scale correlated with AACL, but not with PSI; 
(e) PSI and AACL responded differently to examination anxiety; (f) on the 
PSI, certain fingers seemed to sweat more than others. These findings indicate 


a lack of congruence between these verb 


Anxiety has been a core concept of every 
dynamic theory of behavior, and the measure- 
ment of anxiety has become a major task of 
scientific psychology. There have been two 
popular approaches to measuring anxiety: 
one involves the assessment of an individual’s 
physiological functioning, or changes therein 
under stress; another, his verbalized self- 
description in interviews or written question- 
naires. A good deal of effort has gone into 
developing adequate measuring instruments 
in these areas and into analyzing the theoret- 
ical meaning of their complex interrelation- 
ships (Martin, 1961; Sarason, 1960). One 
simple technique used to obtain a physiolog- 
ical correlate of anxiety is the Palmar Sweat 
Index (PSI), which has given promising 
results in a number of studies (Mowrer, 
1953). Among the verbalized self-description 
techniques, the Taylor (1953) Manifest 
Anxiety (MA) scale, the semantic differential, 
and adjective check lists have been very 
popular. Recently, Zuckerman (1960) de- 
veloped an Affect Adjective Check List 
(AACL) that permits a quick self-descrip- 
tion of degree of anxiety felt. He presented 
evidence of the validity of this technique, 
stating, for example, that the anxiety meas- 
ured “today” on the AACL of a group of 
college students increased on the day of an 
examination, and that the increase was re- 
lated to fear of failing the examination 
(Zuckerman, 1962). 


1 This study was partially supported by National 
Institute of Mental Health grant MH 06560-01, The 
authors wish to thank Charles E. Smith for his 
invaluable technical assistance on this project, 


al and physiological anxiety measures. 


The general purposes of the present 
were to determine whether these two we 
ent measures of anxiety, the PSI and AA na 
reflect experimentally induced anxiety, aen 
to what extent they overlap in what they & 
measuring. 


METHOD 
Materials and Subjects 


s 
The procedure used for obtaining the PSI t 
based on Mowrer (1953) and on our own ters 
vestigations of this technique (Ferreira and Wie 
1963). Briefly, cach S received a 3 X 5 inch Pio 
of Whatman #1 filter paper impregnated with ne 
tannic acid, and a bottle with a dropper cone he 
ing a solution of ferric chloride in acetone- ver 
Preparation of the chemicals followed the Mow jn 
description. The Ss were instructed as a STOUR ge 
the use of these materials, with suitable demon cit 
tions, They painted each of the four fingers of ride 
own left hands with a drop of the ferric ale 
solution and spread it over the finger tips wit® 
aid of the dropper. They waited 30 seconds 


. inte! 
the solution to dry and then placed their Pa” 


tw? 


For each finger print, the area of maximum prec? , 
tation (caused by sweating) was measure vre! 
specially built densitometer following onat 
design. The S’s PSI score for each experi ings 
condition was the sum of the densitometer Te% 
for his four finger prints. in the 
The AACL was administered and scored 1960): 
standard manner described by Zuckerman ctives 
except that the S was asked to check the å : This 
which described how he felt “right now opriate 
alteration of the instructions seemed APP cponse 
since we were primarily interested in the Sia pach 
to immediate, experimentally induced ansie"? 
S was also given the Taylor MA scale. ra wales 
The Ss in this experiment were 19 under ted in * 
(almost all sophomores and juniors) ent? 
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course dealing with elementary research methods in 
Psychology. The Ss appeared well motivated and 
cooperative. There were 13 boys and 6 girls, all 
but one in their early twenties. 


Design and Procedures 


The Ss were exposed to six experimental conditions 
designed to differ in the amount and content of 
Provoked anxiety. These were not meant to be 
equidistant along a scale of predetermined anxiety. 
The six conditions consisted of two conditions of 
low anxiety, two of examination anxiety, and two 
ol experimentally provoked high anxiety. The six 
experimental conditions were given 3 to 14 days 
ig Two weeks after the last session, the Ss 
äl the Taylor MA scale. Except for an occasional 
ti sentee, all Ss were tested in a group at the same 

me of day by the same Es. 

Briefly described, the six experimental conditions 
Were: 
tua High Anxiety (Instructions). The Ss were 
ng og in the PSI technique in a very formal, 
t ae ing manner by an E who was a stranger to 
are real danger inherent in spilling the ferric 
and ik solutions on thcir clothing was stressed, 
wa he PSI was described as a test to see how tense 
fells (eon they were. This approach worked quite 
peni he Ss seemed very sober and worried while 
Sr g to manipulate the PSI materials and were 

cerned whether their own prints were too dark. 
wane Anxiety (Neutral). E was friendly, con- 
n ated the class on handling the materials with 
ip accidents in the previous session. The actual 

Pose of the study was explained and stress was 
aired on our interest in the effects of the experi- 
ži ale Procedures on the PSIs of the group as a 
not nh stated that the individual PSI of a S did 
ing ef ect his personality, and that we were repeat- 
instru, PSI now to check on the reliability of the 
sa ent and its correlation with A CL. The 
taae Arel more relaxed and asked several intel- 

questions about the project. 
admite iation Day. The PSI and AACL were 
sched Stered just prior to the first regularly 
re class examination. 
severaj et, Aniety (Lobotomy film). The E made 
sults of rua remarks about the unpredictable re- 
obotomy and the dangers of its unrestrained 


Use, 

shown = a film, “Prefrontal Lobotomy,” was 
y t th i v 7 og: 

Sever th e point when the surgeon began to 


outs S neural fibers and the squirming and groans 
Stopped soe seemed to reach a peak, the film was 
and our measurements taken. 
- Low Anxiety (Pleasure film). The Ss were 
to relax, that they had a treat in store for 
for being such good Ss. They were shown 
i Protis pantomime film, “The Chairy Tale,” 
a “a gee a good deal of tension reduction at 
Were A the film, just before the PSI and AACL 
ase administered. Again, audience response was 
pected, 


told 
them 


u; 
Which 


an 
N 
= 


TABLE 1 


Suus OF RANKS AND SIGNIFICANCE LEVELS 
FOR FRIEDMAN TESTS 


Anxiety conditions 


High Exam Low x? 
I Y tt Vi II y 
PSI 39.5 45 84 65 735 90.5 31.2* 
AACL 81.5 38 36 43- 93.5 87 41,7* 
*p <.001. 


VI. Examination Day. The second regularly sched- 
uled classroom examination. 

For the two conditions involving the use of films, 
the fingers were painted before the films were begun 
and the prints taken when the film was stopped, 
thus avoiding a time lapse that might have per- 
mitted the emotional state to dissipate while the S 
was manipulating the PSI materials. 

The basic hypothesis tested was that the S's 
scores on the PSI and AACL would vary system- 
atically from condition to condition. We also tested 
(a) the correlations between PSI, AACL, and MA 
scale; (b) the consistency of the scores of a given 
S from condition to condition; (c) the relationship 
of our examination situations to the other anxiety 
conditions; (d) and whether the four fingers of our 
Ss were equally sweaty. 


RESULTS 


The PSI and AACL scores of each S on 
the six experimental conditions were ranked 
in terms of the degree of anxiety shown, with 
the rank “one” representing the greatest 
anxiety. Friedman two-way analyses of vari- 
ance (Siegel, 1956) were computed and the 
results were very significant (Table I), in- 
dicating that for both PSI and AACL, the Ss’ 
responses were influenced by the differential 
effects of the six anxiety conditions. 

The Ss were rank ordered within each 
condition according to their PSI and also 
their AACL scores. Kendall’s Ws for the PSI 
rankings on the six conditions (W = .682, 
p< .001) and for the AACL rankings (W 
= 319, p< .02) indicated that each S 
tended to keep his same relative position in 
the group in all six conditions. In other words, 
both measures reliably differentiated the Ss, 
inasmuch as the changes in the S’s scores 
tended to be of the same magnitude across 
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conditions, so that his ranking relative to the 
other Ss remained the same. Although both 
Ws were statistically significant, it should be 
noted that the PSI revealed a greater “con- 
cordance,” or agreement from condition to 
condition, than did the AACL. 

The two measures did not correlate sig- 
nificantly with each other within any single 
condition or across all conditions (r,= +.03). 
One factor involved in this lack of correla- 
tion can be discerned by referring to Figure 1, 
which presents the median values on each of 
our tests for the combined conditions of high 
(I plus IV), examination (III plus VI), 
and low (II plus V), anxiety. It will be seen 
from this figure that on the AACL the Ss 
treated the examination days as similar to 
the high anxiety conditions, and both of 
these as significantly different from the low 
anxiety conditions. However, the examination 
day scores on the PSI approximated those 
of the low anxiety conditions, and both 
differed significantly from the high anxiety 
situations. All four of these significant dif- 
ferences were at the p < .001 level by the 
sign test, two-tailed. Apparently, for our 
sample, the verbal and physiological measures 
of anxiety responded differently to the tension 
produced by classroom examinations. 

We also found that the combined AACL 
scores of our Ss for all six conditions cor- 
related significantly with their scores on the 
Taylor MA scale (r, = + .44, p < .05, one- 
tailed). This had been predicted on the basis 
of Zuckerman’s findings (Zuckerman, 1962) 


HIGH 


EXAM Low 


FIGI. MEDIAN VALUES OF PSI AND AACL FOR THE 
HIGH, EXAMINATION, AND LOW ANXIETY CON- 
DITIONS. (THE DIRECTION OF PSI SCORES is 
REVERSED SO THAT GOTH ORDINATES MEASURE 
INCREASING ANXIETY. ) 


Wiriiam D. Winter, Antonio J. FERREIRA, AND ROBERT RANSOM 


There was no correlation between the PSI 
and MA scale. No significant sex poe 
was found on either measure, but the ama 
number of female Ss in the sample ies 
cludes adequate statistical analysis of this 
variable. 

Two other interesting findings on the r 
should be mentioned. In a previous a 
investigating the methodological problema 
the PSI technique (Ferreira & Winter, 19 mat 
we found that the Ss’ fingers did not E 
equally but that some fingers produced dar F 
prints than others. To test the generality is 
this finding with our present sample, the P 2 
for the four fingers of every S were rank 0 
dered in all six conditions and a W compute 
for each S. The sum of the chi square oe 
for all Ss combined was very seni 
(x? = 116.89, dj = 57, p < 001), indica in 
that, as in the previous study, the Ss’ sW° a 
production varied from finger to finger tle 
way perhaps consistent for, and characters 
of, the given individual. , for 

The sums of the PSI scores obtained re 
each finger on all conditions combined “The 
ranked, and a W computed for all $8- i 
resulting rank ordering was significantly a 
ferent from chance (W =.19, p a4 or 
showing that, for our sample, the thir ost 
ring finger of the left hand was the mine 
sweaty and the fourth and index fingers 
least sweaty (only left hands tested). 


Discussion 


It seems from our results that both 
PSI and the AACL reflect the different 
perimental conditions of anxiety, ap. an 
both can be considered simple, practica’ oth 
valid measures of anxiety. In addition, este 
yielded stable scores characteristic of n tbe 
However, the lack of correlation ae to 
PSI and AACL, their different see or 
examination anxiety, and their cee est 
relations with the Taylor J374 scale S aring 
that these two instruments may be me® sara- 
different phenomena (Rosenstein, 196 pgrue” : 
son, 1960), This interpretation is yo and 
with the results of McGuigan, an co 
Richardson (1959), who also foun the pol. 
relation between the M/A scale and t np i 

We are still faced with the ™ 


the 
ex- 
that 

d 
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question of just what these various instru- 
ments actually do measure and how they 
relate to one another. We have some specula- 
tions in the literature, but little definitive 
research. One theoretical possibility, of course, 
is that anxiety is not a unitary concept. 
Eysenck (1961), later supported by Buss 
(1962), has pointed out two factors in 
anxiety: one, a somatic factor of autonomic 
reactivity, and the other, a psychic or idea- 
tional factor. The PSI probably represents 
One measure of Eysenck’s somatic factor. The 
AACL score probably measures the psychic 
factor of anxiety, but not directly, since it 
also reflects whether the S is willing and able 
to communicate the ideational content of his 
anxiety in an undistorted fashion. Even if the 
PSI represented pure somatic anxiety and the 
AACL pure psychic anxiety, we would gen- 
erally expect them to covary (as in our high 
Versus low anxiety experimental situations), 
but on some occasions to be uncorrelated or 
even negatively correlated (e.g., our examina- 
tion days). 

As stated above, part of the discrepancy 
between PSI and AACL is probably due to 
the communication variable on the AACL, 
e., to the S’s ability and willingness to 
admit his anxiety to himself or others. For 
example, to the S it might be socially ac- 
ceptable, or even expected, to admit pub- 
lically on the AACL that he is anxious on an 
examination day. But in Condition I, when 
St S may be privately worried about what 
Ms PSI might indicate about his personality, 
the typical AACL response may be a bland, 
efensive denial of anxiety, even if his PSI is 
elevated. Although there appears to be some 
truth to this approach, it does not explain 
why the PSI showed little anxiety on ex- 
amination days. This particular finding would 
seeni to contradict the results of other in- 
po tigators (Beam, 1955; Davis, 1957) who 
Ound that the PSI increased just before an 
examination, This inconsistency may be due 
a situational variables such as how the ex- 
*minations were viewed, the severity of test 
anxiety, etc. Confirmation of this possibility 
Cannot come from our data, but it is of in- 
terest that when our Ss were shown these 
Stoup results after the study was completed 


and asked to discuss them, many stated that 
they had not been really very worried about 
the examinations which came so early in the 
course with an “easy-going instructor” (sic). 

Some writers like McGuigan, et al. (1959), 
have suggested a somewhat different explana- 
tion for the lack of correlation between verbal 
and physiological anxiety measures, specif- 
ically the PSI and WA scale. They state that 
the PSI is a better measure of situational, 
temporary anxiety, while a questionnaire 
like the MA scale is a better measure of 
anxiety as a stable characteristic of person- 
ality. Presumably, the MA scale reflects the 
verbalized self-concept, which may be more 
permanent than the physiological imbalance 
reflected in the PSI. Again, there is probably 
some truth in this idea, but it should be 
remembered that in our study the PSI scores 
for a given S were generally stable and 
characteristic of him in all conditions. Also, 
our AACL instructions stressed how the S 
felt right at the moment of testing, rather 
than how he generally feels, which is the 
usual set for the MA scale. Therefore, the 
stability of our PSIs and the “immediacy” 
of our AACL scores would seem to make the 
explanation of McGuigan et al., inappropriate 
in this instance. In fact, we can only conclude 
our discussion of the lack of correlation be- 
tween the AACL and PSI by stating that 
no simple, adequate explanation presently 
exists for the complex interrelationship of 
these two anxiety measures. 

The remaining finding, that for every S 
some fingers are characteristically sweatier 
than others, agrees with our previous results 
with a different sample (Ferreira & Winter, 
1963). This has not been noted previously 
in the PSI literature, and it opens some im- 
portant questions that may well justify the 
efforts of future investigators. 
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THE RORSCHACH BARRIER SCORE AND 
SOCIAL BEHAVIOR * 


JOHN RAMER? 


University of Washington 


96 undergraduate females placed in 3 experimental situations were assessed 
with regard to quantity and quality of communication each sent. The situa- 
tions were those of no feedback, negative feedback, and positive feedback. 
Their communication behavior was related to a Rorschach content score, the 
barrier score. It was found that high barrier scores would attempt more 
communication, assert themselves more, tend to be less self-depreciating, and 
tend to express less discomfort in a situation increased in degree of threatening 


quality. 


Although many studies have attempted to 
telate Rorschach scores to other test scores 
and other kinds of behavior, none has enjoyed 
the success of the barrier score. The relation- 
Ships obtained have been many and various 
(Cleveland, 1957, 1960; Davis, 1960; Fisher 

Cleveland, 1956a, 1956b, 1957, 1958a, 
1958b; Fisher & Fisher, 1959). This score, 
the barrier percept, refers to all percepts on 
the Rorschach test having to do with the 
assignment of definite structure, definite sub- 
Stance, and definite surface qualities to the 
bounding peripheries. The authors of the bar- 
Mer score, Fisher and Cleveland, conceived of 
ìt as a projection of the person’s bodily 
organization and that the individual per- 
Celves his body exterior as having (or lacking) 
Protective characteristics. Controlled experi- 
mental studies relating barrier score and social 

ehavior have not been available. 

Viewing the barrier score as an unconscious 
Projection of a person’s definitive existence 
and his felt ability to adequately cope with 
the demands of reality, a communication task 
Was created to test three specific hypotheses. 

hey were : (a) that high barrier persons 
Would show a greater frequency of com- 
Munication attempts in an interpersonal situa- 
tion than the low barrier persons; (b) that 
* This article is based on a dissertation submitted 
ie Ge of the Department of Psychology of 
Of the ice enc tn in partial Hane 
wishes to eres i or tl E T aior 
Stotland Se is ny pele Wy. Bak ne thar 
guidance ata È ur Fisher, and A. D. Davis tor the: 

upport. 


Sk Soy with the Department of Psychology, West- 
tate Hospital, Fort Steilacoom, Washington. 


to 


a content analysis of the communication oc- 
curring in an interpersonal situation would 
show high barrier persons manifesting more 
committal and directive statements, more 
disagreement, less passive-acceptance state- 
ments and fewer self-depreciating statements; 
and (c) that qualitative differences would 
obtain greater magnitudes in a frustrating 
and/or threatening situation as opposed to 
a neutral situation. 


METHOD 
Subjects 


The final subject population for this study was 
96 females acquired from Seattle University, a co- 
educational Catholic school, and the University of 
Washington, a large state school. Forty-eight sub- 
jects were drawn from each school and all except 
five of the total subjects were gotten from women’s 
living groups. The one group was acquired on a 
completely random basis with the help of the 
Dean of Women. The other group was gotten on a 
completely volunteer sign up basis using sheets 
posted on the dorm bulletin boards. Only freshmen 
and sophomore girls were used. 


Task and Apparatus 


The communication task demanded discussion of 
written stories with an assumed partner via slips of 
paper that the experimenter was supposed to deliver. 
For this task four subjects at one time were 
individually seated at card tables, the four of which 
were arranged as along sides of a rectangle, with the 
experimenter in the middle. The upper halves of 
their bodies were surrounded by three-sided screens 
so as to completely block each subject's view of 
the other three subjects. Before each of them was a 
set of three pictures, two pencils and a large stack 


8 Five subjects were solicited from a psychology 
of adjustment class. 
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< slips of paper. The pictures were identical 
z regen ale Br Kurt Back in his classic study 
Oka amet consisted of standard pre- 
prepared messages. These messages: were basic to the 
controlled experimental situations. They were varied 
as to their threatening quality and each group of 
four subjects found themselves in one of three 
situations.* They might have received no replies. 
They might have received positive-neutral replies, 
ega “Id really like to know about your story, 
“Im sorry I’m not any more help than I am, but 
I really appreciate your messages, and I have a 
feeling my story agrees with yours, but Tm sure 
you’re a much better writer than I am.” Or thirdly, 
they might have received negative replies, eg. I 
hope we will have the same ideas for our Stories, 
“I have a feeling our stories are probably different,” 
and “Since I’ve had some experience writing this 
kind of story before, I’m really satisfied with what 
I've written.” All subjects received the same number 
of messages and at a similar time. 


Procedure and Instructions 


Subjects were run in groups of four. Each sub- 
ject upon arrival at the experimental room was 
seated at a card table behind a screen and was told 
the experiment would begin in a few minutes as 
soon as all four subjects were assembled. The groups 
of four were successfully chosen so as to eliminate 
complete familiarity with one another. When all 
were seated these directions were read: 


You are being asked to partake collectively in 
writing a group story, each group to consist of 
two persons. For this you have been given 
identical sets of three cards. This study is part 
of an effort to predict which groups of two, out of 
the 48 groups I will eventually test, can ac- 
complish the best results—please do your best, 

First, each of you has been given the same set of 
Pictures to be used in the final study and each of 
you will be allowed 7 minutes to write a short 
but comprehensive story with these directions, 
Your task is to write a story from the set of 
three photographs which depict quite a com- 
monplace incident. This gives you an opportunity 
to give play to your imagination, although the 
story should be plausible and supported by 
features of the Pictures. The pictures themselves, 
being taken from a film strip, form a sequence 
which you will have to reconstruct. Then you 
will write a story connecting the Pictures. 
Remember—you should write a good story but it 
is important to make it plausible by use of the 
available clues. You should Pay particular at- 
tention to the setting of the story, the relationship 
of the two people involved and the plot of the 


4 These three situations, the zero replies, the posi- 
tive replies, and the negative replies, are hereafter 
referred to as the Z-Situation, the P-Situation, and 
the N-Situation, respectively, 
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h iteria wi r the evalua- 
story. These criteria will be used for the eval 
tion of the stories. You may begin! 


The subjects were told when 2 minutes senimi 
and when 1 minute remained so they could put pe 
ings on their stories. When the time had expired S 
experimenter collected both the written stories mai 
the sets of pictures. Then he introduced the nex 
part with these instructions. 


The next part will be a group discussion ah 
your partner. Since I could not hope to write ar is 
record as fast as you all talk, especially with ba 
groups at a time, this will take the form 0 F 
paper and pencil discussion. At my signal you aw 
initiate your story discussion with your partie’ 
whom I have randomly chosen, by sliding vires 
sages under your front board. You will notice k 
groove in the middle of the board. I will the 
carry your message to your partner. The slips © 
Paper in front of you are to be used for pea 
your messages, however, each slip of paper see 
contain only one sentence or complete pa 
and please put your identity number in the ae 
right corner. Of course you may talk about oe 
and all aspects of your story or simply 7 
talk. The partitions are used to conceal a 
identity of your Partner and to prevent you ne 
observing any facial grimaces, smiles, and Eee 
behavioral signs. Remember—how many nol h 
you send is entirely up to you. Some may v 
to send just one note, some a few notes and pr 
may send no notes at all, If you receive a ney 
from someone and don’t wish to respond 1 
you may answer with whatever you wish. ee 
messages may be in transit and not apparer B 
answered, or you may need to send several i 
sages before your partner sdes your point p 
If there are no further questions—you will ” ur 
have seven minutes for your discussion with Y° 
partner, You may begin ! 


When this was over the subjects were asked F 
write their stories a second time without the ° 
of the pictures and told they could make Use as 
their group discussions. This additional data js 
not pertinent to the present study, however, aM gs 
not presented. A short Postsession questionnaire " 
then filled out by each subject. It attempts 
ascertain the efficacy of the experimental techn ire 
The subjects were then told what had transp til 
and asked not to discuss the experiment 
further word from the experimenter. 


Barrier Score xi- 
n ro? 
Each subject was seen a second time ap? with 
mately 2 to 3 weeks later on a chance ed 
reference to groups and situations, and Sia vere 
à Broup Rorschach. The number of respo” 


P) at 
controlled. There was no formal inquiry P" 
these directions, 


c 
ise in Pere 
_ This can be considered an exercise 1 of slide 
tion. You are going to be shown a serie 
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each of which contains the picture of an inkblot. 
This is a modification of the inkblot test or the 
Rorschach as it is sometimes called. I would like 
you to look at each inkblot and write down the 
first three things it looks like or reminds you of, 
or makes you think of. For some inkblots I will 
ask for just the first two things you are reminded 
of. I am asking for just a set number of answers 
because I know from experience that people can 
sce these many things. However, people see a great 
many things in these inkblots and so there are no 
Correct answers. Please describe as adequately or 
as fully as you can your perceptions and/or 
what about the blots reminds you or make you 
think of what they do. If I go too fast or you 
have any questions, please raise your hand. Also 
let me know if it is impossible for you to see the 
Screen, 


eae response total was limited to 25 _Tesponses 
Grn upon the card pull values as indicated by 
epee study (Grayson, 1956). The first three 
kg were asked for Cards I, II, III, VIII, 
nace The first two responses were asked for the 
we cards. The experimenter introduced each 
ii esi this is Card number x and for this 
fee. Would like just the first three (or two) 
thing’ this inkblot reminds you of or makes you 
hs of.” Three minutes seemed sufficient for the 
ieda ae cards while 5 minutes were sometimes 
ers for the three-response cards. All protocols 
inde typed i with carbon copies allowing three 
medusa dent judges to determine barrier scores.5 The 
meta barrier score was determined for cach experi- 
Ri a situation and those above the median were 
E Snated as high barrier persons while the others 
re designated as low barrier persons. 


RESULTS 
Barrier Score 


we can be seen in Table 1, the interjudge 
5 Ing of the barrier score was quite ade- 
luate with the average Pearsonian correlation 
te ha = 89. This lends further support to 
ree that the barrier score can be 
‘Sly assessed. The distribution of the 
ea scores ranges from 0 to 15. The mean 
= ne total distribution was 6.31, with the 
= rs being 5.82. An analysis of variance 
Sieni N barrier scores for the three experi- 
Teil conditions revealed no significant 
s. 

the 1 Seater importance was the analysis of 
regard Pr et samples of subjects with 
- o their barrier score distributions. 
Was ese Seares determined by Seymour Fisher who 
e three judges were used as criterion 


eas k c 
Sbeas S The evaluation of responses as barrier 
ollowed Fisher’s standards. 


un 
N 
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TABLE 1 


INTERJUDGE RELIABILITY OF THE 
BARRIER SCORE* 


ji J2 Js 
= a = 
J2 96 = 
Js SI -90 — 


a The scores were determined for 96 protocols and the cor- 
relations are product moment. 


A ¢ test analysis resulted in no statistically 
significant difference. These results allow one 
to consider both samples as being selected 
from the same general population. 


Assessment of Technique 


The success of this study was very depend- 
ent on the experimental subjects perceiving 
their fictional partners as real. Three ques- 
tions in the postquestionnaire are relevant. 

Question two asked, “How well do you 
know the other members—much, moderate, 
slight, not at all?” It revealed the fact that 
NO subject admitted to knowing the other 
members very well. For the most part they 
either did not know the other members or had 
only seen them around. They may have 
known one of the other members or even two - 
of the other members to some degree but in 
no case did they know all the other members. 
The desired effects were apparently ac- 
complished, meaning that the subject could 
not logically rule out all members of her 
group as the possible communicant. 

Question three, “Did you know who your 
partner was and how well do you know her?,” 
yielded different results according to the ex- 
perimental conditions. Among the total 64 
subjects in the N-Situation and P-Situation 
no one knew who their partner was although 
four subjects in the P-Situation chose an 
actual person in her group. In the Z-Situa- 
tion there were 8 subjects out of the total 32 
that expressed doubts about having had a 
partner. They wondered if the experimenter 
was delivering the messages. This is some- 
what logical behavior since they received no 
replies at all. Interestingly enough, 6 of the 
8 subjects who expressed this awareness were 
high barrier persons. 
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Question six asked the subjects what they 
thought the experiment was about and was 
another effort to uncover an awareness by the 
subjects of their fictive partners. Needless 
to say, no one knew what the experiment was 
about and only those who had previously ex- 
pressed doubts about their partner did so 
again. o 

Of equal importance was the affective 
values of the standard preprepared messages. 
Question five, “Would like to meet and know 
your partner—why?,” was an attempt to 
evaluate the effects of the three experimental 
conditions and the messages in particular. In 
the N-Situation there were only four subjects 
who did not care to meet their partner and 
learn to know her better. Unfortunately, the 
subjects did not verbalize exactly why they 
felt the way they did. The four answers were: 
“not necessarily so,” “frankly I don’t care,” 
“I thought her notes senseless,” and “wwe 
have different interests.” All four of these 
subjects who verbalized this ability to reject 
their partners were high barrier persons. In 
the P-Situation there were two high barrier 
persons and one low barrier person who ex- 
pressed a disinterest in meeting their partner. 
The high barrier answers were to the effect 
that “maybe, she doesn’t seem to have great 
confidence,” and “not more than the others 

for she didn’t seem interesting.” The low 
barrier person simply said, not especially. In 
the Z-Situation six low barrier persons ex- 
pressed a disinterest in meeting and 
to know their partner better, while o 
high barrier person did not wish to meet her 
partner because she thought she already knew 
her. This completely unstructured situation 
seemed to be much more upsetting to the 
low barrier person. Some of the answers were: 
“makes no difference,” “not particularly, 
since she was not very sociable,” and “doesn’t 
matter,” (this person said she felt frustrated 
by the whole situation). All of the other sub- 
jects in the three situations answered that 
they would like to meet their partner, with 
the usual answers being that they liked to 
meet people or to see what others wrote for 
their stories. The subjects in the P-situation 
were the only ones to describe their partner 
to any degree. The simulated partners were 


getting 
nly one 
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x : asiye, and 
described as friendly, vague, ev ae = 
some thought the partners lacked confidence. 


Analysis of Hypotheses 


Hypothesis I predicted that the high ce 
rier persons would attempt more pra e 
tion than the low barrier subjects. Rwa 
measures of attempts to communicate Wer 
used. : the 

The first measure used was simply it 
number of notes sent by the subjects to me 
simulated partners. An analysis of varian i 
of these data is presented in Table 2. As <a 
be noted there is a very significant age 
between high and low barrier groups. iow 
mean frequency of messages sent by the ier 
barrier group was 3.77, while the high E 
group sent a mean of 5.12 messages. ro 
was no significant interaction between i 
rier groups and situations. However, Tab a 
does reveal that the situations caning 
influence on the frequency of messages. ‘oft 
average frequency sent in the Z:Situaun 
was 3.84, in the N-Situation it was 5.12, oe 
in the P-Situation the mean frequency, a 
4.69. The ¢ tests indicated a significant di the 
ence only between the Z-Situation and no 
N-Situation. One could expect that the 
answering replies would tend to depress 
amount of intercommunication. 

Although the subjects were directed 
write only one sentence to a slip of pape!» 
was not the result. Since slips of paper 
quently contained several points for on 
cussion, another measure of attempted © 


to 
this 
fre- 
dis- 


. i the 
munication was used. This consisted Scored 
number of units of communication $ 

TABLE 2 
FOR Ê 
A COMPLETE ANALYSIS or VARIANCE FO 
3 X 2 FACTORIAL DESIGN 
es 
F of units r ot mest 
“ofun F 
Source ag ms F MS ot 
aie 
Between cells 5S asai 1.96 ae sie 
Situations (A) 2 3464 2.70 ast T% 
Barrier scores (B) 1 5251 410* spat ol? 
BXA 2 1.88 0.15 43 
Within cells 90 12.81 i 
Total 95 
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under the Bales system (Bales, 1950). In 
other words each subject’s score was the num- 
ber of units of communication instead of the 
frequency of messages. Admittedly, the num- 
ber of units of communication is dependent 
on the number of slips of paper sent and a 
Pearsonian correlation of 7 =.79 was ob- 
tained for the two measures. Table 2 also 
shows the results of an analysis of variance 
for the frequency of units. Once more con- 
firmation of Hypothesis I is found. High 
barrier subjects sent a mean of 8.12 units, 
While low barrier subjects sent a mean fre- 
i of 6.64 units. This represents a sig- 
ce cant difference at the .05 level. There were 
te ee differences among the situations 
were there any significant interactions. 
sure Pothesis II predicted that high barrier 
jects would give more directions and sug- 
aol make more committal and orienting 
abl ements, be less self-depreciating, and be 
€ to disagree more frequently than the low 
arrier subjects. In terms of the Bales’ cate- 
a for content scoring, Hypothesis II 
$ = be that high barrier persons would 
di of Category 4 (gives directions), 
show more preponderance of Category 
and 6 (gives opinion and orientation) over 
ioe 7 and 8 (asks for orientation and 
Glen would show less of Category 11 
Rote: epreciatory ) and show less of Cate- 
Bags, 1 and 3 combined (“going along” and 
ve acceptance).° 
ta Order to test this hypothesis, chi square 
aes were computed for the presence or 
Siem: of the selected categories between the 
St eatin efforts of the high and low 
the persons. The results revealed just one 
Hea m squares reached a statistically sig- 
n level and it indicated that the high 
tiles subjects as a group did more giving 
he oy i and orientation as opposed to asking 
wa nion and orientation than did the low 
are in orp: However, all of the chi squares 
Subjects ee direction. High barrier 
Our and r tend to show more of category 
ess of the behavior classified in 


ce pn Subiect showed the presence of cate- 

tka , & 8, combinations were formed such 

Wou] id sh Prediction was that high barrier subjects 

exten ROW Categories 5 & 6 > 7 & 8, to a greater 
as a group than the low barrier subjects. 
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category eleven and in the combined category 
one and three. 

Hypothesis III predicted that a frustrating 
and/or threatening situation would tend to 
increas? the differences predicted in Hy- 
pothesis II. In terms of this study this meant 
that the differential frequencies for predicted 
Bales’ categories would be heightened in the 
N-Situation and the Z-Situation. A chi square 
analysis broken down for the three experi- 
mental situations revealed only one chi square 
that reached a significant statistical level and 
it was in the N-Situation. However, the chi 
squares were greater in magnitude in the 
N-Situation for three of the four criterion 
categories and Z-Situation yielded the largest 
chi square for the lone category not accounted 
for by the N-Situation. It is difficult, just the 
same, to cite the present data as evidence for 
increased differences as due to the different 
situations. The frequencies within the chi 
square cells are so small and show such slight 
differences that one could easily question the 
stability of the present results. This is par- 
ticularly true for the Z-Situation. One might 
describe the evidence as slightly suggestive. 


DISCUSSION 


In general, the hypotheses in this study 
found some confirmation. In a controlled com- 
munication situation the high barrier persons 
did show significantly different social be- 
havior from the low barrier persons. There 
appeared to be both a quantitative and 
qualitative difference. The inability to demon- 
strate more than mere trends for some of the 
predictions can be seen as a partial result of 
the type of analysis undertaken. Originally, 
it was planned that the data would be ana- 
lyzed by an analysis of variance technique. 
However, due to the lack of interjudge 
reliability for the individual units using the 
Bales system, this technique had to be dis- 
carded. The plan then involved considering 
the already scored communication data and 
deciding on the presence or absence of each 
category. This approach did yield reliable 
judgments; however, scoring for the absence 
or presence of a category naturally neglected 
the amount of presence. It can be seen that 
this type of analysis would weaken the 
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chances for any kind of significant results 
although significant results were noted. i 

Another weakness in this study was its 
failure to clearly demonstrate effects for the 
three experimental situations. Several pilot 
studies were done in an effort to construct 
positive and negative preprepared messages. 
Since no one was forced into the experiment 
it was hard for the students to accept some 
of the experimenter’s more hostile messages. 
The subjects did not believe any female nice 
enough to take part would say such things. 
It also became apparent that all feedback 
would have to be rather vague and general 
so as to be universally applicable. In terms 
of real constructive help both sets of messages 
were negative. The final criteria for the 
messages were that they were successful in 
pilot studies to the extent of being plausible 
replies for the messages sent and elicited some 
of the desired effects although not totally so. 
The subjects were very hesitant in directing 
emotional replies to their fictional partner. 
In the postexperimental session feelings were 
verbalized but this was not true for the paper 
and pencil discussion. This was particularly 
evident in the pilot studies, Possibly if the 
subjects had not expected to ever meet their 
partners they would not have been so con- 
stricted. Or, if the subjects would have been 
paid a goodly amount of money they might 
have been more willing to express some feel- 
ings toward their partner who was now being 
paid to produce and was not. A monetary in- 
centive might have helped greatly. 

It would seem that body 
and the barrier score in par 
further investigation. The q 
does the barrier score really 
in the process of being a 
are some who feel the barrier 
cognitive style much in the Kleinian sense, 
The barrier scorer in a way similar to the 
“Jeveller-sharpener” (Klein, 1959) supposedly 
is oriented to distinguishing boundaries of 
objects and/or situations, The meager evi- 
dence bearing on this problem does not sup- 
port such an interpretation. However, this pos- 
sibility has not been sufficiently investigated. 
Holtzman (1961), on the basis of his series 
of factor studies, noted that the barrier score 


image research 
ticular demand 
uestion of what 
measure is still 
nswered. There 
score is another 


JoHN RaMER 


consistently clustered with indexes of the z 
struct of “ego integration.” If this constr A 
manifests itself by an active tienes 4 
cope with the demands of a situation he 
purposive and meaningful way even aes 
somewhat trying circumstances, then Et 
study can also be seen as lending suppor 
such an interpretation for the barrier scor! jer 

Still there is no denying that the ae 
score significantly relates to unique physi it 
ical processes consonant with a body i 
theory. More recent studies by Fisher hip 
Fisher have revealed a strong en S 
between the barrier score and concern ag 
body areas on an internal-periphery og 
indicated by introspective report. pe 
from the literature about the ce 
body-images and personality T 
(Arieti, 1955, Cornelison and Arsenian, i 
Sechehaye, 1951) the need is certainly re ad 
nizable for valid quantification in this a 3 
If the barrier score does measure an asp ee to 
a person’s self-image, one should be ab Jow 
predict more meaningfully what a high or ait 
scoring person’s behavior would be ores 
certain situations. Realistically spea 
within the area of body-image research, o 
surface or barrier is just being scratched, 
should one say penetrated. 
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NOTES AND COMMENTS 


THE VALIDITY OF THE BARRON EGO STRENGTH SCALE 
IN AN. INDIVIDUAL FORM 


JEROME SILVERMAN ! 


School of Medicine, University of Southern California 


Barron (1953) developed an Ego Strength 
scale (Es) which was validated against the pre- 
dictive criterion of successful outcome in psycho- 
therapy. The availability of a short, reasonably 
objective measure of ego strength can be a 
valuable tool for clinical and research purposes. 
Since the 68-item Es scale is extracted after 
administration of the entire MMPI, a problem 
arises for those interested in using the Es scale 
by itself. Different response sets may be elicited 
if items or single scales are isolated from a main 
body of items, thereby affecting the validity of 
the scale. This study investigated whether the 
Es scale by itself would measure the same thing 
as when the entire MMPI js administered. 


METHOD 


An individual Es scale was 
Barron’s 68 Es items in 
of the MMPI. 


constructed comprising 
addition to the 15 Lie items 
Forty Veterans 


psychologically,” 
chiatric disorder; pa 


frequently termed “psychosomatic”—stomach ulcers, 
xcluded from the 


Both groups were administered the full MMPI and 


1 This study was carried out while the author was 
a Veterans Administration Clinical Psychology 
Trainee and a graduate student at the University of 
Pittsburgh. 


validity as when administered in the 


ith the in- 
one week later they were retested with na of 
dividual form. Subjects who obtained Lie $ dual 


8 or more on either the full MMPI or the eer 
scale were not included in the statistical ana ex- 
Four normals and eight schizophrenics were 
cluded from the sample due to their Lie se 

The Es scores were then derived from bot 
Individual scale and the full MMPI. , ther 

The results as shown in Table 1 indicate ri u 
clearly that scores obtained from the ie ee 
Es scale closely approximate those Es suck 
obtained from the full MMPI, The pro ‘ 
moment correlations for the two forms veie 
for the normals and .85 for the schizophren g 
The mean Es scores of the two forms a the 
by only .2 for the normals and only .9 17 
schizophrenic group. 

If the Es scale were valid, one would E 
the normals to have higher ego strength § in- 
than the chronic schizophrenics, Table atly 
dicates that the normals did show signif oth 
higher Es scores than the schizophrenics 0? 


the 


ect 


res 


TABLE 1 i 
ranit” 
COMPARISON or Es Scores serween Hos?! oM 


FR 
NORMALS AND SCHIZOPHRENICS AS Drwy 
Inprvipvar Fors anp Furr M 
TDUAL Form anp Fu aa 
Chronic , s 
B ic 
gi nee schizophre™ 
SD 
Es score from M SD M 
MMPI 43.50 7.31 
Individual scale 43.70 7.84 
r= 91 A 
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the MMPI (t=2.97 <.01) and Individual 
forms (¢= 2.43 < .02). These results support 
the construct validity of the Es scale whether it 
be — from the full MMPI or Individual 
scale. ` 


Journal oj Consulting Psycholog, 
1963, Vol. 27, No. 6, 553-535 ” 


REFERENCE 
Barron, F. An ego-strength scale which predicts 
response to psychotherapy. J. consult. Psychol., 
1953, 17, 327-333. 
(Received April 17, 1963) 


EFFECT OF ORDER OF PRESENTATION OF TAT CARDS 


LEWIS BERNSTEIN + 
Veterans Administration Center, Wood, Wisconsin 
anD RICHARD H. DANA 


West Virginia University 


Dollin and Sakoda have suggested that standard order of TAT administration 
will produce happier stories toward the end of the series as a function of an 
adaptation-level effect, and that this effect could be lessened by administering 
the cards in alternate happy-sad order. To test their prediction, comparisons 
were made between Cards 1-10 and 11-20 on 67 Ss whose TAT stories had 
been rated for emotional tone and sadness of outcome. These comparisons 
yielded t ratios, significant beyond the .001 level of confidence, in a direction 
opposite from the prediction. The ranks of TAT cards for various samples of 
data were compared for emotional tone and outcomes, and indicate that 
standard order of TAT administration contains an approximate alternation 
of happy-sad order. Changing the standard order of TAT administration does 


e e s 


not appear warranted on the basis of current evidence. 


Dollin and Sakoda (1962) have suggested that, 
as a consequence of the relative sadness of TAT 
pictures, an adaptation to sadness occurs when 
TAT cards are administered in standard order. 
Thus, happier stories should occur to cards 
seen late in the series. Evidence concerning this 
Prediction is available from a previous study 
(Bernstein, 1956), in which TAT cards were 
administered in a standard order, with four 
Methods of administration. Bernstein’s TAT 
Stories had been rated for emotional tone (Eron, 
Terry, & Callahan, 1950) and for sadness of 
Outcome (Eron, 1950). According to Dollin 
and Sakoda’s prediction, the ratings for out- 
Come and emotional tone should be happier on 
the last 10 cards than on the first 10. 

Table 1 shows £ test comparisons of ratings 
On these scales for Cards 1-10 with 11-20, 
Under each method of administration. Two of 

e four comparisons for emotional tone and 
wae of four comparisons for outcomes are sig- 
qeeant in a direction opposite from that pre- 
mee by Dollin and Sakoda (1962). The non- 
Sige comparisons show opposite trends; 
piedi of the eight comparisons supports their 

iction. When the four groups are combined 


“Also affiliated with iversi 
the Marquette Universit; 
School of Medicine. i ” 


(N =67), the differences between emotional 
tone and outcome means for the first and last 
10 cards are significant beyond the .001 level. 

As a means of comparing picture rating with 
story rating, Dollin and Sakoda (1962) com- 
pared the ranks for emotional tone of the 19 
cards they used in common with Eron et al. 
(1950), with a resulting rho of .81 for men 
and .76 for women. A similar comparison of our 
female subjects with those of Eron et al. (1950) 
yields a rho of .74. When the ranks of the 
cards for emotional tone for our subjects are 
compared with Eron’s (1953) normative data 
for women, the resulting rho is .76. In com- 
paring the ranks of the cards for outcome for 
our subjects with those of Eron’s (1953) 
normative data for women, the rho is .81. These 
findings indicate high consistency for the relative 
ranks of the TAT cards for mood ratings be- 
tween the samples used by Dollin and Sakoda 
(1962); Eron et al. (1950); Eron (1953); and 
Bernstein (1956). However, an inspection of 
the ranks of the cards for the various samples 
(Table 2) indicates that there are as many or 
more sad rankings (1-10) among the last 10 
cards as among the first 10. 

Dollin and Sakoda (1962) suggest that the 
adaptation-level effect might be minimized by 
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TABLE 1 


DIFFERENCES IN MEANS FOR EMOTIONAL TONE AND OUTCOMES 
BETWEEN Carns 1-10 ann 11-20 


Emotional tone Outcomes 
Method of 
administration N  M(i-10) M(11-20) t M(1-10) M(11-20) t 
Oral, examiner absent 17 —9.598 —11.29 175 =365 —5.65 ve 
Oral, examiner present 16 4.13 —5.38 1.30 3.69 0.00 ae 
Written, examiner absent 18  —8.50 —11.33 3.58* 50 mary fa i T. E 
Written, examiner present 16  —3.25 —5.63  4.67* 4.69 —1.00 3.5 


® The greater the negative quantity, the sadder the rating for emotional tone and outcomes, 


*p <01. 


“noting the stimulus values of the pictures and 
administering them in alternate happy-sad order 
[p. 344].” Again, inspection of Table 2 reveals 
the adequacy of the standard order in meeting 
their suggestion. In fact, Dollin and Sakoda’s 
data reveal that of the 7 cards rated happiest, 
4 are numbered lower than 10 in the standard 
order; of the 7 cards rated saddest, 4 are num- 
bered higher than 10 in the standard order, 

This reanalysis of data from a Previous study 
fails to confirm the Dollin and Sakoda (1962) 
prediction that TAT cards seen toward the end 
of the series would evoke happier stories than 


those seen earlier. A significant difference in ie 
opposite direction from the prediction “he 
demonstrated. Our findings indicate that m ios 
standard order of administration, the stimu 
values of the cards are well randomized. 
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TABLE 2 
RANK ORDER 0 o% NAL TONE E 
a S F TAT Cts ‘BOE ol TONE AND OUTCOMES an 
Emotional tone Outcomes 
_ ici 
N Eron Eron et al. Bernstein Eron Bernstein 
Card (1953)» (1950) (1956) (1953)b (1956) 
1 15 11 18.5 5 15 
2 13.5 12.5 13 ti 16 
3 2 4 3 3 7 
4 "6 6 5 12 10 
5 12 15.5 8 8 A 
ó 5 14 16 11 11 
7 16 17 9 15.5 13 
S 7 19 14 9 17 
re 8 5 11.5 5 6 
A 18 12.5 20 20 20 
i 9 7 6 13 4 
a m 10 11.5 7 8 
3 1 1 2 
E 19 20 15 = 18 
4 3 4 4 $ 
16 20 18 18.5 19 19 
17 13.5 9 10 6 12 
18 3 2 2 3 2 
19 17 15.5 17 14 14 
20 11 8 7 10 _ ee 
‘ote.—The rank of 1 signifies the saddest card. chan = 
© Computed from Table’ feat saddest card: the rank of 2 


b Computed from Table 3, Eron (1953), 
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VALIDITY OF THE ROSENZWEIG PICTURE-FRUSTRATION 
STUDY WITH FELONS AND DELINQUENTS 


SAUL ROSENZWEIG 


Washington University, St. Louis 


_In the recent Brief Report by Mercer and 
Kyriazis (1962) a comparison of a group of 
Physically assaultive prisoners with a group of 
Matched normals on the Rosenzweig Picture- 
Frustration (P-F) Study yielded no significant 
differences, This result is not an isolated one. 
Similar findings for such populations have been 
reported by Fry (1949), Holzberg and Hahn 
(1952), and Norman and Kleinfeld (1958). 
Kaswan, Wasman, and Freedman (1960) in a 
quite carefully designed and comprehensive in- 
vestigation got somewhat more favorable results. 
Moreover, in an investigation by Lindzey and 
Tejessy (1956) involving both the P-F Study 
and, more centrally, the TAT, it was found that, 
contrary to hypothesis, “signs” of aggression on 
the TAT correlated outstandingly with subjects’ 
Self-ratings—not with ratings assigned by a 
Well-informed diagnostic council; and that the 
TAT variables correlated highly with scores on 
the P-F. The P-F proved to be at least as good 
an indicator of aggression as was the TAT. 
Since Mercer and Kyriazis in their brief and 
cautious conclusion state without further elabora- 
tion that their results “underscore the difficulty 
in relating reported attitudes to overt behavior,” 
it appears worthwhile to note some of the inter- 
Pretive considerations that underlie this dif- 
ficulty. A technique like the P-F may operate 
at any one of several (usually) undefined levels 
“opinion, overt or implicit (Rosenzweig, 1950). 
hese levels must obviously be taken into ac- 
Count in research on delinquents and criminals 
(Rosenzweig, 1960, p. 171) and, indeed. quite 
Senerally. Assaultive delinquents may be well 
versed in the denial of their hostile tendencies 
and, if so, would, at the opinion level, obtain 
ea or even “better than normal” extrapuni- 
me ba The paradoxical results of Lindzey 
iat ete (1954) are probably to be so ex- 
th + ; There is also the less obvious possibility 
at, at the implicit level, some criminals may 


actually be more intropunitive than normals (vide 
the psychoanalytic concept of criminality from a 
sense of guilt); and, if this level is being tapped 
by the P-F, the scores of such individuals on 
intropunitiveness might actually be higher than 
those of matched normals. It might then be the 
exception rather than the rule to find delinquents 
performing on the P-F at the overt level and 
thus reflecting the elevated hostility that had 
placed them in the asocial group. 

In short, the problem of determining the level 
at which a subject performs on any projective 
device at a particular time remains crucial and 
unsolved, not only for the P-F Study but for 
the entire range of these diagnostic instruments. 
(The P-F has the minor virtue of making this 
point somewhat more evident than do other pro- 
jective methods which are more complex and 
meaningful unless the data also supply a definite 
context for understanding why the results were 
negative, i.e., why they were contrary to the ex- 
plicit or implicit hypothesis under which the work 
was conducted. 
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NOTE ON TESTING CONDITIONS + 


ROBERT C. PIERCE 


Langley Porter N, 


Authors of intelligence 
mend that their test be a 
undistracting surroundings, 
example, says that testing 
room “free from distract 
trusions,” 

As part of a large multidi: 
geriatric mental illness (se 
1962; Simon & Fiske, 1960) 
casion to administer four W 


ligence Scale (WAIS) su 
Comprehension, i 


tests usually recom- 
dministered in quiet, 
Wechsler (1955), for 
should be done in a 
ing noises and in- 


1 This research w. 


2 as supporte 
National Institute 


d by grants from the 


europsychiatric Institute, 


San Francisco 


nd 
“fair,” and “poor” based on the frequency E 
disruptiveness of interruptions, noise, ET the 
trusions. A rating of good was given aalo 
testing was completed in a continuous we 
with no more than one or two minor nois oices 
distractions, such as traffic noise or the en i 
of neighbors. A rating of poor was Oe one 
there were several distracting noises oF (uch 
or More very disruptive events occurre home): 
as “kibitzing” of fellow patients in a rest scaled 
The unit of analysis is the sum of the ts the 
Scores for the four subtests. Table 1 presen cores 
Means and standard deviations of WATS § 


. or 
for patients tested under good, fair, “One-way 
conditions at first and second retests. the 


analyses of variance indicate that means “uber 
three testing conditions do not differ at 
first retest or second retest. , s 0 
Since there were several interviewer" nd)» 
during the first retest, 9 during the 5° rating 
it is possible that differences in criteria Be 
testing conditions account for the absence esting 


expected relationship between score and ‘ce 
Be, ria! 
and the Califor, f Mental Health (3M9145(C4)) conditions, Separate one-way analyses of views 
(62-1-16.3) ornia Department of Mental Hygiene Were therefore computed for each inter. ihe 
who tested more than 20 patients. None 
TABLE 1 
X WAIS ny TESTING Conpitions, 
First retest 
Testing Mean 
condition N WAIS» SD sD 
ae 30.1 11.4 12.5 
Fair 9 27.6 8.0 10.9 
Poor 28.6 55 15° 
Foye2=.246 
ptest” 
* Each individual's score is the f an stl 
The mean of the individual scores iy aoe ald Scores for Information, Comprehension, Arithmetic, and Digit ore 
b When fair and poor are combined and tested against good, 


= 1.16, > 10, 


—< 
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TABLE 2 
WAIS ny TESTING CONDITIONS, 
COMMUNITY SAMPLE 


Testing Mean 
condition N WAIS* SD 
Good 236 39.3 10.1 
Fair 27 37.6 10.0 
Poor 2 34.0 12.0 
Fajog9= 604 


ny Each individual's score is the sum of scaled scores for In- 
teste at Comprehension, Arithmetic, and Digit Span sub- 
S. The mean of the individual scores is shown here. 


analyses revealed significant differences between 
the three testing conditions. 

Although unlikely, it is also possible that the 
„arge variability of scores found in a mentally 
ill sample obscured small but real differences 
among the testing conditions. Such differences 
Might be revealed by examining a normal sample. 
Another phase of the project called for testing 
265 People over 60 years of age who were living 
in San Francisco. The same 3-point rating of 
testing conditions was available. Table 2 pre- 
Sents a one-way analysis of variance on the 
Means for the three testing conditions. Again, 
No significant differences are found. 
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In view of the widespread assumption that 
testing conditions have an important effect upon 
test scores, it is surprising to find that such an 
effect does not appear under the extreme range 
of conditions we encountered. Our rating of 
testing conditions is admittedly a crude one and 
will allow us to conclude only that poor testing 
conditions do not have a gross effect upon test 
scores. A more homogeneous group, such as 
college sophomores, might be better suited to a 
test of the hypothesis that distractions and noise 
affect test results, but at the least we have 
evidence that intelligence tests are more robust 
than test authors have thought. 


REFERENCES 


Croox, G. H, & Katz, L. Patterns of geriatric 
mental illness: Intellectual functioning of aged 
patients and non-patients. In C. Tibbetts & Wilma 
Donahue (Eds.), Social and psychological aspects 
of aging. New York: Columbia Univer. Press, 
1962. 

Sson, A., & Fiske, Marjorie. Studies in geriatric 
mental illness: Some research plans and assump- 
tions. J. Amer. Geriat. Soc., 1960, 8, 363-372. 

Wecuster, D., Manual for the Wechsler Adult 
Intelligence Scale. New York: Psychological Cor- 
poration, 1955. 


(Received December 3, 1962) 


INVESTIGATION OF THE STABILITY OF COLOR PREFERENCES 


WALTER A. HAAS 


Veterans Administration Hospital, Lyons, New Jersey 


An experiment to determine color preferences was performed with 20 college 
students who were their own controls. 3 techniques were used: tiles, free 
painting, and questionnaire. Relationships between the 3 techniques were 
positive and significant at the .01 level. The stability of the color choices was 
investigated by repeating the same 3 techniques 4 weeks later. Data indicated 
that the color preferences remained consistent. Significance was at the 01 
level. Ss were exposed to stimuli which produced frustration or success to note 
whether the color preferences could be altered. Data obtained with the chi 
square method indicated that the established color preferences were not 


altered by these stimuli. 


Past investigations have dealt with the color 
Preferences of specific groups; for example, 


1 This paper is adapted from i i 

a a dissertation sub- 

are to New York University in partial fulfillment 

4 cc requirements for the PhD degree. The writer 
A F to acknowledge the invaluable assistance of 
a onsoring committee, Edward Kemp, H. Harry 

Siles, Hale Woodruff, and Florence Halperii. 


psychotic patients, Indians, Negroes, infants 
(Block & Caldwell, 1959; Garth, 1922; Mercer, 
1925; Staples, 1932). A number of studies have 
connected specific personality attributes to choice 
of specific colors. Napoli (1951) associated 
green with creativity. Naumburg (1953) ascribed 
hate and depression to the color black. Alschuler 
and Hattwick (1947); Brick (1944); and 
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TABLE 1 
SUMMARY OF PROCEDURE 
Sitting 1 Sitting 2 
Part A Part A 7 


1. Tile choices Steps 1, 2, 3, of Sitting 
1 Repeated 
2. Free painting 


3. Questionnaire 


Part B 


1. Frustration 
Stimuli (word 
reconstruction) 


Part B 


1. Success stimuli 
(word reconstruction) 


Steps 2, 3 of Sitting 1 
2. Three designs repeated 


(tiles) 


3. Three designs 
(free painting) 


Note.—Sitting 2 was presented 4 weeks after Sitting 1. 


Lehman and Risquez (1953) have inferred 
similar relationships. 

The present study is concerned with the 
determination of color preference, the consist- 
ency thereof, and the effect of manipulation on 
the established color preference base. 


METHOD 
Subjects 


Twenty undergraduate students from the Univer- 
sity of Delaware volunteered for the study. Since 
each subject was seen individually and since he was 
his own control, no further delimitations on the 
selection of subjects were imposed, 


Procedure 


Subjects manifesting color blindness were excluded 
by the Jensen test for color blindness. Color prefer- 
ences were obtained by three methods: tiles, free 
painting technique, and a questionnaire. To note 
consistency of the given color Preferences, the three 
methods were repeated at least 4 weeks later. 

Tiles. Six tiles (red, blue, green, black, white, 
yellow) were taken from the Lowenfeld (1952) 
mosaic test. Each subject was asked to arrange the 
tiles in the order of his preference, 

Free painting technique. Tempera Paint was 
chosen. To facilitate a comparison, the same colors 
were selected. Gray paper was used so that each 
color, especially white, could show up sufficiently 
well. The subject was told: choose a color and 
paint anything you like. The subject could use only 
one color per paper. After a particular color was 
used, it was taken away. The instructions were 
repeated until all colors were used, 
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Questionnaire. Color choices were obtained er 
questionnaire without presentation of visual s! pare 
Questions pertaining to the rationale of me a 
of the colored tiles, factors that influence ubse- 
selection of particular color paints and their s 
quent productions yielded qualitative data. a in 

The summary of the procedure is give 
Table 1. 

Stimuli which produced frustration and ates 
were utilized to note whether the color bee 
base could be altered. The object of the a 
stimuli was to interfere, interrupt, or pene 
prevent the completion of goal-directed resp ce a 
The aim of the success stimuli was to produ oal- 
favorable and desirable termination of a el 
directed response, The frustration stimuli were Pare 
during the first session and the success stimuli V 
administered 4 weeks later. New 

Frustration Stimuli. Seventy words from chosen: 
York City school principal examination were € < The 
These words were unusual, difficult, and pompie nt 
letters of each word were so arranged to pre te), 
recognition. To illustrate: aacentttu Gate ain 
acehnopt (cenotaph), acerrst (serrate). To mal the 
the subject’s motivation every seventh word ei 50 
list was an easy word. A total time limit O was 
minutes was imposed and the time per word tion, 
limited to 40 seconds. During the adminis ten 
such disparaging remarks as, “For a college stu the 
you are doing poorly” were made. These were ak 
written instructions: “Rearrange the letters to ™ is- 
a proper word.” Below the instructions three 
arranged simple words provided a sample. enth 

Success Stimuli, Forty easy words from 2 sey ate: 
grade public school test were selected. To illust" 
omtor (motor), ciily (civil), conab (bacon). 
total time limit of 50 minutes was imposed. suc 
performance of the subject was encouraged bY jtten 
remarks as, “You are doing fine.” The w 
instructions were: “Rearrange the letters to ™ 
a proper word.” jmple 

Below the instructions three disarranged 5 
words served as a sample. jmuli 

After the presentation of the frustration sP ed 
and the success stimuli, the subjects were ^° ne 
(a) to make three designs with tiles, using 0” 7 ized 
color per design (the entire mosaic test was ut ings 
for this purpose) and (b) to make three pal? uld 
choosing one color, per paper. The same color was 
have been repeated, as no order of chow to 
requested. The object was to study respons 
the frustration and success stimuli. 


RESULTS AND DISCUSSION 


the 
, : A n 
The relative standings of each subject ih the 


three different techniques was examined Wii, in 
Spearman rank correlation method. The 
Table 2 indicate positive correlations hni a 
the responses obtained by the three tec the 
All correlation coefficients were significan erence? 
01 level, Correlation between color pea 
obtained by the tiles and the questionna! 


————— ee ee 


a 


nem o S 


mer 
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TABLE 2 
COMPARISON OF COLOR PREFERENCES AMONG THE 
THREE TECHNIQUES EXPRESSED IN 
RANK CORRELATIONS 


Free 
Tiles Tiles painting 
with with with 
Íree question- question 
painting naire naire 
Sitting 1 537 703 518 
Sitting 2 20355 -809 „548 
ED a as = 


higher than those between the color preferences 
obtained by the free painting technique and the 
other two techniques. There were qualitative 
Suggestions that form and content played a 
more important role on the free painting tech- 
nique. Color preferences remained reliable over 
the 4 week period (tiles rho = +.739, free paint- 
ing technique rho=-+.567, questionnaire rho 
=+.779). All correlations were significant at 
the .01 level. 

The hypothesis that the color preference base 
can be manipulated by frustration and success 
stimuli is rejected. The order of preference was 
maintained. The first three places of the color 
preference base were used in 71.7% of the tile 
designs and paintings. Significance was investi- 
gated with the chi square method and found 
to be at the .01 level. Data indicate that the 
stimuli used in-an effort to manipulate the color 
Preference base served the desired objectives. 
Remarks, performance, and overt behavior of 
the subjects during the administration of the 
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frustration stimuli indicate that responses to the 
tasks were interfered with, interrupted, and in- 
complete. Success stimuli on the other hand 
were easy and readily dealt with. 

Results of the study support the conclusion 
that color preferences remained stable over the 
4-week period and were not affected by the 
frustration and success stimuli. 
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THE PROBLEM OF SELECTING 


DRUG-FREE SCHIZOPHRENICS 


FOR RESEARCH +: 


LOREN J. CHAPMAN 


Southern Illinois University 


In selecting patients to be removed from phenothiazine therapy for purposes 


of research, 38 chronic male schizophrenics at a V. 


With the widespread use of tranquilizing drugs, 
it is frequently difficult to obtain drug-free 
samples of schizophrenics for purposes of re- 
search. Investigators often select their patients 
without regard to their current drug status, 
perhaps with the rationalization that it is not 
yet known just what effect the drugs have any- 
way. This is a perilous procedure, since the 
drugs clearly influence some aspects of the 
schizophrenic psychosis in many patients, and 
if the investigator’s measure is tapping some- 
thing of relevance to schizophrenia, there is a 
good chance that differences between normals 


and schizophrenics may be washed out by the 
drug. 


is to test only those 
currently receiving such 
however, this group is 
also likely to be com- 


ever, many of the patients may then show an 
accentuation of their ps: 


1This study was supported by a research grant 
MH-3481 from the National Institute of Mental 
Health, United States Public Health Service. The 
author is indebted to the Veterans Administration 
Hospital in Lexington, Kentucky for Permission to 
test patients. Glenn Miller assisted in testing the pa- 
tients and in analyzing the data, 
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e 
staff are likely to become concerned for gi 
patients’ welfare and to ask that pee T 
tients be returned to medication, This Poa breuió 
the investigator with an atypical sie awa 
sample, namely, those for whom the with mo 
of drugs does not result in an increase of eal 
toms. This group may differ on psycho! ee 
measures from the remaining patients ee ae 
helped by drugs. The present report pres 
some evidence on this point. 

Several investigators arranged at a 
Administration hospital to withdraw free 
from drugs for studies which required drug ie, 
schizophrenics, The subjects were whi 21 
long-term, chronic schizophrenics, betwee ica- 
and 56 years of age, with at least 6 years © ore 
tion, whose most recent admission was no 
recent than 6 months, who showed no signs 
mental deficiency or brain damage, and Wee a 
not received electric shock therapy in the Pi by 
ous 6 months. Patients were also selector J 
ability to take one of two simple pape” g in 
pencil screening tests, like those to be Y5 
the research, 


Veterans 
patients 


METHOD ony™ 
Instruments, The Screening tests were 4 how pencil 
test? (Burstein, 1961) and a paper-an F 


. man 
version of g breadth of concept test (Creed af 
Taylor, 1957). The homonym test cons 
48 items of the following format: 5 sea 

a e 

Circle the word that means almost the sam? 


See Ocean Idol 


The correct answer, of cour.., is r 
tendency to confuse homony™“ wihi 
would b - choice O 

: e reflected by the SHOW. guds 

?The author is indebted to Alvin 
supplying a copy of his homonym test. 


| 


y 
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TABLE 1 


MEAN Error Scores oF PATIENTS SUBSEQUENTLY Kept Orr Drucs AND RETURNED To Drucs 


Homonym test 


Breadth of concept test 


Uniden- Inclu- Inclu- 
tified sion sion Exclu- 
Homonym error Total similar dissimilar sion Total 
Kept off drugs 24 6 3.0 7.4 4 3.4 11.2 
Returned to drugs 17.5 1.0 18.5 10.7 1.6 7.7 20.0 


alternative. The third alternative, “Idol,” is intended 
to bear no relationship to the stimulus word, and is 
included in the item as a measure of errors of 
Unidentified origin. For a further description of this 
test, see Burstein (1961). 

The breadth of concept test was presented in the 
following format: 


Circle each word which names a fruit. 
Cross out each word which does not name a fruit. 
Peach Screwdriver Saw Peas Corn Pear, (ete.) 


The breadth of concept test contained six tasks 
like the one above, each of which had 30 items 
which were to be marked as belonging, or not 
belonging, to the named concept, and in each case 
the 30 alternatives included 10 items which cor- 
rectly belonged in the concept (e.g., “fruit,” above), 
10 which did not belong but were similar to the 
Correct items (eg. “vegetables,” above), and 10 
Which did not belong and were dissimilar (e.g., 
tools,” above). The six named categories for the 
tasks were “fruit,” “trees,” “alcoholic drinks,” 
„animals found in the home or on the farm,” 
kitchen tools or implements,” and “articles of 
clothing.” For a further description of this test, see 
Chapman and Taylor (1957). 

Half the subjects were given the homonym test 
for screening purposes, and half were given the 
breadth of concept test. All of these were scheduled 
to be removed from medication for 8 weeks, the 
first 6 weeks of which were to be used as a drying- 
out period, and the last 2 weeks for testing. 

Subjects. Of the 111 patients who were seen, 
52 took one of the two screening tests successfully. 
The hospital management consented to remove 38 
of them from medication, explaining that 12 of the 
remaining 14 were too disturbed, and two were 
about to leave the hospital. Of these 38 patients, 20 
had taken the homonym test, and 18 the breadth of 
concept test. Their mean age was 39.7, and mean 
education was 10.8. All of these patients were 
receiving phenot!åəzine therapy. 12 were receiving 
chlorpromazine, dasages ranging from 200 to 800 
milligrams a day. Others were receiving tri- 
fluoperazine, prochloroperazine, and _ thioridizine. 

After the Patients were removed from medication, 
many of them began to show an exacerbation of 


their psychotic symptoms. Some became more with- 
drawn, or untidy, or openly delusional, or showed 
ritualistic gestures. The nursing staff expressed con- 
cern to the ward physicians, and they in turn asked 
the hospital administration to remove certain pa- 
tients from the research project. This continued until 
18 of the 38 patients were returned to medication. 
These 18 patients were returned after a mean of 
26.9 drug-free days. 

When the experimental testing was begun after 
the 6 week drying-out period, it was noticed that a 
surprisingly large proportion of the patients seemed 
to be performing in a fairly normal manner. In 
order to get some light on this, these patients were 
compared as to their earlier performance on the 
screening tests with those patients who had been 
prematurely returned to medication. At the time of 
the screening tests, all patients were, of course, on 
drugs. Of the patients prematurely returned to drugs, 
11 had taken the homonym test, and 7 the breadth 
of concept test. Of the patients maintained off drugs, 
9 had taken the homonym test, and 11 the breadth 
of concept test. 


RESULTS 


The findings are shown in Table 1. As seen 
there, the two groups appeared to differ greatly in 
their initial pathology as reflected by perform- 
ance on the screening tests. On both the 
homonym and the breadth of concepts tests, the 
patients who later were returned prematurely to 
drugs made more total errors. This was sig- 
nificant at the 5% level in each case as deter- 
mined by a two-tailed Mann-Whitney U test. 
On the homonym test, the difference score of 
homonym errors minus unidentified errors also 
distinguished the two groups at the 5% level 
of significance. 

As the result of these findings, the investigators 
decided that they could not rely exclusively on 
the patients who were maintained off drugs as a 
source of data concerning the nature of schizo- 
phrenic psychopathology. They obtained else- 
where another sample of patients who were kept 
off drugs regardless of their symptoms. 


CONCLUSIONS 


Investigators who compare groups of schizo- 
phrenics with normals should take into account 
the drug status of their patients. In addition, if 
they use patients who were not currently 
receiving drugs, they should attempt to obtain a 
sample which is not biased by selective exclusion 
of subjects from the sample. Any conclusions 
that are reached using only patients whose 
symptoms do not worsen with the withdrawal 
of tranquilizing drugs may be limited in their 
generality. o. 

Schizophrenics who profit from phenothiazine 


l Consulting Psychology 
rva 27, No. 6, 542-544 
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; ; 7 r 
therapy appear to differ in their thought disoe, 
from patients who do not. This might i ges 
further exploration by investigators w GM ai 
concerned with finding subvarieties of schiz 
phrenia, 
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THE RELATION BETWEEN MATERNAL SELF-ACCEPTANCE 
AND CHILD ACCEPTANCE ! 


GENE R. MEDINNUS And FLOYD J. CURTIS 
San Jose State College 


2 measures of self-acceptance and one measure of child acceptance were 
administered 


to 56 mothers of children 


cooperative nursery school 
positive relation bet 
The hypothesis 

assessing child accı 


in order 
ween maternal 


Assessment of parental attitudes has received 
a tremendous amount of attention in research 
concerned with factors influencing the child’s 
personality development. While the content of 
most current parent attitude tests focuses on 
the parent’s attitude toward a number of aspects 
of child rearing, of at least equal importance is 
the parent’s attitude toward the individual child 
in terms of an acceptance-rejection dimension, 
In case studies of emotionally disturbed children, 
the inference frequently is made that 


yses, an acceptance- 
in addition to an 


s y theorists haye sug- 
gested that the attitudes held toward the self 
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are reflected in the attitudes held toward i 


Fromm (1939), for example, maintains that 5° 
love and the love of others go hand in hand. ae 
notes that a failure to love the self 35 ms 
companied by a basic hostility toward “ie 
Horney (1950) feels that the person who hers- 
not love himself is incapable of loving Ot nee 
Rogers (1951) argues that when a client aor 
therapy he typically holds a negative castes 
cept; he is unable to accept himself. ON ij 
once the client becomes more accepting of rs. 
self, he begins to be more accepting of o 
The attitudes which a child holds towa” jan 
self, especially those dealing with self-esteer ype 
self-worth, play an important role in his pe pild 
ality development. The extent to which @ cially 
develops a positive self-concept depends T the 
upon the extent to which he is accepte 5 in 
“significant others” ans 


the early years, i 
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TABLE 1 


CORRELATIONS AMONG MATERNAL SELF-ACCEPTANCE AND CHILD ACCEPTANCE MEASURES 


Semantic 
differential 
Bills self- Bills child ac- 
Measures acceptance adjustment ceptance 
Semantic differential 
self-acceptance = 57 os 33F 
Bills self-acceptance —.57** —.48* 
Bills adjustment .28 


r 5 
Pp <.05. 
"3 < 


search evidence for the relation between parental 
self-acceptance and acceptance of the child, 
though several investigations have described the 
effects on the child’s personality development of 
parental nonacceptance. Most of this research 
has dealt with clinic populations, however, and 
the research involving normal parents and 
children is meager and less confirmatory. 


HYPOTHESIS 


The purpose of the present investigation was 
to test the hypothesis that there is a significant 
positive relation between self-acceptance and 
child acceptance in a nonclinical group of 
mothers of young children. 


METHOD 
Subjects 


Fifty-six mothers of children enrolled in a parent- 
participating cooperative nursery school served as 
subjects. The mothers ranged in age from 25 to 
45 years with a mean of 32; years of education 
ranged from 11 to 18 with a mean of 14. The 
children in the nursery school on whom the mothers 
made the ratings ranged in age from 3 to 5 with a 
mean of 4 years. 


Procedure 


Two measures of self-acceptance and one measure 
of child acceptance were administered during two 
consecutive meetings of the mother group. The 
self-acceptance measures were the Bills Index of Ad- 
justment and Values (Bills, Vance, & McLean, 1951) 
and a semantic differential scale of 20 bipolar 
adjectives in which the distance (D) between the 
mothers’ ratings of “Me (as I am)” and “Me (as I 
would most like to be)” was defined operationally as 
the extent of self-acceptance. The child acceptance 
measure consisted of the same set of bipolar 
adjectives with the distance between the mothers’ 
ratings of “My child (as he is)” and “My child 
(as I would most like him to be)” defined as the 
extent of maternal acceptance of the child. 


RESULTS AND DISCUSSION 


The intercorrelations among the three self- 
regard scores and the child acceptance measure 
are provided in Table 1. All of the coefficients 
are in the expected direction; the negative cor- 
relations are due to the method of scoring the 
tests. On the Bills self-acceptance measure, the 
higher scores indicate greater self-acceptance 
while the Bills adjustment scores and the 
semantic differential self-acceptance scores are 
discrepancy scores with the higher scores de- 
noting less favorable self-attitudes. The findings 
strongly support the hypothesis of a significant 
positive relation between maternal self-accept- 
ance and child acceptance. 

It is likely that the present sample of mothers 
was restricted in terms of the child acceptance 
dimension. The mothers were concerned and in- 
terested enough in their children to spend the 
considerable time and effort which is involved in 
participation in the cooperative nursery school. 
Since moderate correlations were obtained in 
this rather restricted range of subjects, one 
might expect higher correlations to emerge in a 
study of mothers randomly selected from the 
general population. 

The validity of the measure employed to assess 
child acceptance remains to be established. Sub- 
sequent research may show that factors in the 
parent-child relationship other than acceptance 
are reflected in this instrument. However, the 
fact that positive results were obtained in the 
present study suggests that the semantic differ- 
ential approach holds promise as an instrument 
for assessing the dimension of parental accept- 
ance of the child. The technique is easy to 
administer and score, and its purpose in assessing 
child acceptance appears to be 
disguised. 
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The present findings suggest that the area of 
parental self-acceptance merits intensive study 
in any attempt to identify antecedents of child 
acceptance, an important element in the child's 
self-acceptance and in his personality develop- 
ment in general. 
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RORSCHACH Z SCORES ON DISTURBED SUBJECTS 


JOHN NICKOLS 1 
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Observations on the Rorschach Tesponses of 12 disturbed children and 75 


hospitalized neuropsychiatric adults provided evidence that 
significantly related to “breadth in mental efficiency.’ 
appeared strongly influenced by (a) the number of 


might be elicited by a task situ 


The responses of disturbed Ss were observed 
for empirical evidence relating to the following 
propositions. (a) If Z denotes a “more accurate” 
measure of intelligence which is independent of 
the type of intelligence used (Beck, 1949), there 
should be no discernible pattern to the correla- 
tions between Z and scores obtained on a 
variety of IQ tests. (b) Although its relation to 
organization activity (Beck, 1949) suggests that 
Z might correlate well with scores on perceptual 
organization, the requirement that scorable Z 
activity be communicated would also suggest that 
it might correlate well with a verbal factor, 


METHOD 


First, a single examiner tested 


12 disturbed 
children (M age = 13 +2). MA 


-equivalents were 


determined on the WISC (M=11 +2), Leiter 
International Intelligence Scale (M=9+ 2), and 
Wide Range Achievement Test (M=9+2). MAs 


on the Ammons Full Range Pict 


ure Vocabulary 
(M =11+2), Benton Visual Reten 


tion (M = 10+ 


Va. Data also 
pital, Lexington, 


142 Arcturus Lane, Alexandria, 
from Veterans Administration Hos 
Kentucky, 


sum Z was 
” In this sense, sum Z 


communicating responses also seems to 
. The level of Z capacity served as the 
adth of mental efficiency in children. 


2, using revised scoring method; cf., Nickols, 19630 
and Draw-A-Person Tests (M =10 +2) were 4 
averaged as Composite scores (M = 10 + 2). 7 
Second, test records were selected randomly o, 35 
relatively newly hospitalized veterans (M 28¢ To 
+ 10). Most of these Ss were diagnosed soe 
phrenic, but no attempt was made to control un 
diagnosis because Levine and Spivack (1959) hach 
similarly significant correlations between Rorse for 
variables on intelligence and Wechsler nrenics- 
hospitalized schizophrenics and nonschizoP tran- 
Neither was short-term or moderate use ° n the 
quilizers viewed as a significant factor whe 
Rorschach and Wechsler Scales are employ ols, 
the evaluative criteria in research (cf, NI 
1958, 1959, & 1962), g) and 
For each adult, the WAIS (M 19 = 91 £ 18) “4. 
Rorschach (M sum Z=20+ 14) had be? fhe 
ministered by the same examiner traine 
Beck system exclusively, WAIS variables a 
termined on the basis of empirical eviden ization 
tributed by Cohen (1959). Perceptual Org 
was measured by the sum of the weighte 
the Block Design and Object Assembly. 
(M = 17+ 6). Sum weighted scores on the 
tion, Comprehension, Similarities, and , te 
subtests quantified a verbal factor of i? 
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verbally retained knowledge, and the WAIS G factor 
all in one score (M = 35 + 13). 


RESULTS 


The Rorschach responses of each child were 
scored according to Beck’s system for obtaining 
sum Z on the individual protocol. Very sig- 
nificant Rhos (.66 to .72) occurred between 
(a) sum Z (M = 18 + 12), and (b) MAs on the 
Leiter, Composite, and WISC Scales, in that 
order. The three sets of MA values had inter- 
correlations of .67. Sum Z correlated significantly 
(about .55) with the Full Range and Draw-A- 
Person; these latter test scores correlated about 
-50 with Leiter and WISC scores, in each case. 
Sum Z correlated .50 with the Wide Range; .29 
with age; .28 with Benton scores. 

Results on adults showed sum Z to correlate 
.54 with total sum weighted scores for the sub- 
tests of the WAIS (N = 50); —.02 with age 
(N=75). Sum Z correlated .43 with verbal 
activity, and .21 with the perceptual organiza- 
tion factor (N = 75); with an r of .58 between 
the two r’s, the difference was significant (ty = 
2.3). The greater correlation with the sum of the 
weighted scores on the WAIS Full Scale would 
suggest that sum Z is a quantitative index most 
significantly related to composite tests capable 
of eliciting a wide range of mental activity (cf., 
Nickols, 1963b). 

A pattern thus evolved in which sum Z tended 
to vary with sampling “breadth” in mental 
efficiency. A second hypothesis is that sum Z 
correlated significantly better with the WAIS 
Verbal factor. If sum Z is a measure of thinking 
power, perhaps it refers more to verbal thought 
than perceptual organization as an expression of 
intelligence used to estimate organizational activ- 
ity. The preferred definition of sum Z, however, 
might concern more experessions of mental activ- 
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ity than those subsumed by verbal abilities alone. 

The individual protocols were inspected, and 
the three responses with the highest Z score 
values were used to determine Z capacity. These 
sums of the three highest Z scores obtained on 
each S correlated .90 with sum Z for children; 
76 for adults. Intertest correlations between 
Z capacity and scores on the six mental tests 
admisistered to children, as well as on the WAIS 
factors determined on adults, formed the same 
general patterns as obtained with the use of 
sum Z. A partial exception was that the size of 
the coefficients obtained with Z capacity was 
noticeably larger for children. Thus, Z capacity 
might provide an alternate quantitative expres- 
sion of intelligence on disturbed children. 
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RATER RELIABILITY AND PREDICTION OF DIAGNOSIS WITH 
THE WITTENBORN PSYCHIATRIC RATING SCALES ! 
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DANIEL F. MACHIR 2 
United States Naval Hospital, Bethesda, Maryland 


One chief difficulty of the Wittenborn Psy- 
chiatric Rating Scales (WPRS) (Wittenborn, 
1955) has been its rather low interrater agree- 
ment when used by hospital staff members, in- 
cluding physicians and nurses. Internal consist- 
ency reliability measures have not been con- 
sidered appropriate for this kind of instrument. 
Since these scales present easily obtainable data 
for making evaluations and prognoses, the 
present authors tried to improve interrater agree- 
ment by specifying the type and level of training 
of the raters, and by obtaining ratings under 
standard conditions. To determine the accuracy 
of predictions of later hospital staff diagnoses 
was a further purpose of the study. 

In this study 24 mental patients were rated 
by four staff Psychologists of similar training 
and experience. Each subject was given a 30- to 
45-minute structured, fact finding interview, all 
by the same rater, while being observed through 
a one-way window by the other three raters. 
Ratings of each subject on the WPRS completed 
independently by each observer were then tested 
for interrater agreement and compared with 
subsequent psychiatric diagnoses of the subjects. 

The results were: (a) Agreement between ob- 
Servers in rating each of nine WPRS symptom 
clusters tested by Kendall’s coefficient of con- 
to .86, median W =.72, all Significant at the 

1 An extended report of this study may be ob- 
tained without charge from Daniel F, Machir, 719 
Maryland, or for a fee 
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TABLE 1 
~ 5 z E PICATIONS 
COMPARISONS OF WITTENBORN SCALE CLASSIFIC 
WITH Srarr DIAGNOSES 


Wittenborn scale 
classification 


Negative Positive 
Stafi — - n b 
diagnosis False Truc Valse True 
Scat ma a 018 
Psychotic 5 6 1 1 2 017 
Neurotic 0 17 4 7 ‘008 
“Other” 1 19 1 3 mae 
: .62 
cordance (W) produced W’s ranging from 


‘001 level, thus indicating significant inten 
agreement on all WPRS symptom clusters. pa 
Comparisons of WPRS composite indexes, a 
sified broadly as Psychosis, Psychoneurosis, 6 
“Other,” with subsequent psychiatric pce 
(uncontaminated), were found significant og i 
the .02 level for each major category. Bight, 
diagnoses were correctly predicted by the WP. 
or 75% “hits”; six were false positive. intet 
These findings indicate that significant 17 


"i he 
rater agreement can be obtained using a 
WPRS under Furt 


Ecu A ho 
chiatric classifications of Psychosis, P EPRS 
neurosis, and “Other,” suggesting that ditions 
ratings obtained under controlled con 


4 ICs 
might be precise enough for clinical diagnos ot 
as well as descriptive, purposes, Pending in aie 
cross validation and refinement of diag” ings 
indicators, it js suggested that WPRS raV 
based upon controlled interview and obretric 
tional situations might predict broad psy? 


classifications dependably, thus providing 
valuable aid to clinical diagnosis. 
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FACTORS ASSOCIATED WITH INTERRATER DISCREPANCIES 
ON A PSYCHIATRIC RATING SCALE! 


ALLEN RASKIN anp PAUL D. SULLIVAN 


Georgetown University 


The Inpatient Multidimensional Psychiatric 
Scale (IMPS) has been widely used to measure 
change in patients undergoing various treatment 
procedures. This instrument was designed for use 
by psychiatrists and psychologists following a 
30- to 60-minute interview. As clinical skills 
and experience play some part in rating presence 
and magnitude of specified forms of psycho- 
pathology, a portion of the error variance in 
estimates of interrater reliability may be due 
to differences between raters on certain back- 
ground and other factors. Identification of these 
factors would enhance the reliability of IMPS 
ratings. 

The study sample comprised 51 male and 36 
female schizophrenic patients drawn from two 
large receiving hospitals, four state hospitals, 
and three private institutions, 

To obtain relatively stable measures of inter- 
rater discrepancies in IMPS ratings, pairs of 
raters were selected who had jointly seen at least 
three different patients at either the pretreatment 
period or at the end of the patient’s sixth week 
on medication. There were 29 pretreatment and 
26 six-week rater pairs. Eight rater variables 
sampled the rater’s clinical experience, experience 
with personal therapy, weekly time spent with 
each patient, contact with patient’s relatives or 
friends, frequency of discussion of patient's dy- 
namics, symptoms and problems with the other 


1 This investigation was supported by Research 
Grant MYP4663 from the Psychopharmacology 
Service Center, National Institutes of Health. 
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Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C. Remit in advance 
$1.25 for microfilm or $1.25 for photocopies, and 
make checks payable to: Chief, Photoduplication 
Service, Library of Congress, 


IMPS rater, time to complete IMPS, opinion of 
efficacy of tranquilizers and age. A rational 
scheme was developed for ordering interrater 
dissimilarity on seven of these variables. Age 
difference was not categorized. , 

Ebel’s analysis of variance technique for esti- 
mating the reliability of ratings was used to 
compute the measures of interrater dissimilarity 
in level of ratings (Level Difference) and dis- 
similarity due to nonsystematic differences (Ran- 
dom Difference). 

Two of the four rater variables in the pre- 
treatment analyses were significantly associated 
with Level Difference and all were significantly 
associated with Random Difference. In rater 
pairs with high Level Difference there was an 
age difference of 7 years or more and only 
one rater had some personal therapy or analysis. 
In rater pairs with high Random Difference, 
both raters had either seen fewer than 200 cases 
or one rater had seen over 300 and the other 
less than 200 cases, one rater had some personal 
therapy, the other none; one rater completed 
the IMPS immediately, the other an hour or 
more following the interview; and finally, there 
was an age difference of 7 years or more. 

As the IMPS items described behaviors indica- 
tive of psychopathology and most study patients 
improved, there was a significant 6-week drop 
in total variance. There was also a significant 
drop in intraclass reliability at 6 weeks due to 
an increase in the percentage of random to total 
variance. However, none of the eight rater vari- 
ables was significantly associated with the 6-week 
Random Difference scores. In contrast, age and 
experience with personal psychotherapy, the two 
rater variables associated with Level Difference 
at the pretreatment period were also significantly 
associated with the 6-week scores. Amount of 
psychopathology appears to have a greater 
influence on Random as compared to Level 
Difference. 
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THE PATTERN OF INTELLECTUAL FUNCTIONING IN 
PARKINSON PATIENTS 


CURTIS HARDYCK 1 


University of California Medical Center 
and LEWIS F. PETRINOVICH 


San Francisco State College 


Development of surgical methods for the 
treatment of Parkinsonism has stimulated in- 
terest in the study of intellectual functioning 
of these patients. The present study compares 
a factor analysis of the Wechsler Adult Intel- 
ligence Scale (WAIS) with results of factor 
analyses of the WAIS on normal and brain 
damage subjects. The subjects in the present 
study were 33 Parkinson patients, répresenting 
a cross section of patients with this diagnosis. 
Mean age of the group is 56, with SD of 10.04 
and range from 28 to 74. 28 subjects were males 
and 5 were females. 

The present results are compared with results 
of the following studies: (a) Birren (1952) has 
reported a factor analysis of the Wechsler- 
Bellevue (W-B) on 99 subjects, aged 60-74 
years. (b) Cohen (1952) has presented a factor 
analysis of the W-B on 100 subjects with organic 
brain damage. (c) Cohen (1957) has also pre- 
sented factor analyses of the WAIS standardiza- 
tion samples. Age groups 45-54 and 60-75+ are 
compared with the present study. 

A principal components factor analysis was 
done and four factors extracted and subjected 
to varimax rotation. Results indicate the pattern 
of loadings for Parkinson patients is similar to 
both the aged and the brain damage samples, 
with the greater similarity being with the brain 

1 An extended re 
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damage patients. The first factor has Logi 
ings on the Information, Comprehension, ee 
metic, Similarities, and Vocabulary subtests. the 
aged samples do not obtain high loadings on ve 
Arithmetic subtest as do the Parkinson ges i 
The second factor has loadings on Digi 
Symbol, Picture Completion, Block pegy 
Picture Arrangement, and Object Assembly. Ta 
factor is similar to Birren’s aged sample an 
Cohen’s brain damage sample and O 
sample, but differs from Cohen’s 45-54 es 
The third factor loads on Arithmetic an 
Digit Span and is similar to that reported by 
Cohen for both the brain damage and a 
groups, but differs from Cohen’s 60-75+ an 
Birren’s aged sample. . f 
In view of the large differences in the ages o, 
the groups compared, the similarity of the facto 
structure is quite striking. The Parkinson provi 
is similar both to a group chronologically m 
older and a much younger sample with a gee 
nosis of organic brain damage. The eae 
structure for the Parkinson patient is | aie 
similar to the group most similar in age. TES r 
dence suggests that the changes in intelle of a 
functioning accompanying Parkinsonism ee 
fairly drastic nature and possibly will Š 
more extreme with the progress of the disease. 
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DREAM REPORT AND SOME PSYCHOLOGICAL CONCOMITANTS * 


MARTIN S. WALLACH 
School of Medicine, University of North Carolina 


There has been a surprisingly slow accumula- 
tion of information relevant to dream experiences, 
the universality of dreaming, and with respect to 
psychological correlates of dreaming and reports 
of dreaming. The present study of dreaming and 
dream reports partially fills a gap in the current 
literature by presenting normative data and by 
suggesting certain psychological concomitants of 
dream reports. 

The sample consists of 332 male college 
students, 328 of whom are between the ages 
of 18 and 21. All subjects completed a dream 
questionnaire: ratings of frequency of dreaming, 
frequency of dream content recall and pleasant- 
ness of dreams; “yes-no” items concerning color 
dreaming, the self as a character in one’s dreams, 
unreality in dreams, duration of dreams, repeti- 
tive dreaming, movement in dreams, and enjoy- 
ment of discussing dreams with others; and a 
description of a dream in as much detail as pos- 
sible. Also, each subject completed the Minnesota 
Multiphasic Personality Inventory (MMPI). 

Only one subject claims never to dream, 98% 
of the sample state they sometimes or usually 
recall dream content and a large majority of the 
subjects report that about one half or more of 
their dream life is pleasant. The greater the 
frequency of dreaming reported the more likely 
it is that greater content recall will also be 
reported. In a general way, it can be asserted 
that a high percentage of the sample proclaim an 
active and vivid dream life in which they are one 


1 Appreciation is expressed for computational as- 
sistance provided by Daniel Brown, and by Ellen 
Kaplan of the University of North Carolina Com- 
putation Center. 
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of the characters. Subjects more often than not 
state their dreams last a long time (71% of 
subjects), even though “long-time” is not defined, 
that their dreams go beyond the limits of reality 
(85% of subjects), and that their dreams are 
a welcome subject of conversation (77% of 
subjects). The sample is about equally divided 
as to admissions of repetitive dreams. Reports 
of color dreaming, self involved as a dream char- 
acter, longer duration of dreams, repetitive 
dreaming, and enjoyment of discussing dreams 
with others are all statistically significantly 
related (by chi square) to reports of frequency 
of dreaming. 

Four of the 16 MMPI scales are negatively 
correlated (although nonstatistically significantly) 
with reports of dream frequency. Three of these 
four scales are L, K, and R and lead one to sug- 
gest that dream frequency reports are inversely 
related to defensiveness. Statistically significant 
positive correlations, for this sample of normals, 
occur on the Mf, Pt, Pa, Sc, and A scales (7's 
= .23, .16, .13, .11 and .14, respectively) and 
lead the writer to suggest that reports of dream 
frequency are more common among individuals 
who tend to be introspective, are sensitive to 
their feelings and inner states, tend to be ex- 
pressive, and are not overly bound adherents to 
reality demands. These findings replicate Tart's 
(1962) conclusion that “subjects classified as 
Sensitizers report recalling dreaming significantly 
more frequently than those classified as 
Repressors.” 

Dreaming is a usual path to unconscious ma- 
terial, and reports of dreaming represent the 
product filtered through consciousness. The 
present study is but a step toward elaboration of 
the relevance and significance of dreaming which 
is so often assumed in clinical practice. 
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ORIGINALITY IN PROBLEM SOLVING AS A FUNCTION OF 
ANXIETY AND WITHDRAWAL IN SCHIZOPHRENICS ! 
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One characteristic of personality disturbance 
may be a reduced ability to cope with problems 
in an original manner. To test this possibility, 
negative relationships were hypothesized be- 
tween originality in problem solving and two 
pervasive clinical variables, (a) anxiety and (b) 
social withdrawal (SW). 

Fifty male schizophrenics (mean age, 44; 
mean hospitalization, 9 years) served as subjects. 
The subjects were rated by four staff members 
on anxiety and SW. A Paper-pencil inventory 
(reliability .95), developed by McReynolds and 
Acker (1962), provided a second measure of 
anxiety. Two measures of problem solving were 
used: 

1. A “Life Problems Test” (LPT), devised by 
the author and consisting of 10 situational prob- 
lems (5 interpersonal in nature. 5 noninter- 
personal). The subject was asked to give as 
many different solutions as he could for each 
problem. Obtained scores were: (a) Quantity 
(reliability .81), based on the number of solu- 
tions given, and (b) Quality (reliability .77), 
based on originality of solutions (in terms of 
frequency within the sample), 

2. Form A of the Howard (1961) Maze Tests 
( average reliability, as reported by Howard, .80). 
This consists ^f several alternate paths to a 
goal. Each subject completed the maze five 
times. Score. was number of different segments 


traversed, higher scores reflecting greater origi- 
nality or novelty, 


The means of the i 
30 for anxiety and .7 
among the problem s 


nterrater reliabilities were 
3 for SW. Intercorrelations 
olving measures were: (a) 
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LPT-Quantity versus LPT-Quality .50; (b) LPT- ’ 
Quantity versus Maze .23; and (c) LPT-Qualit ) 
versus Maze .26. Only (a) was significant ($ <| 
01). The first hypothesis was not supporten 
The low reliability of the anxiety ratings an 
the constricted range in the anxiety inventory 
scores may account for this failure to support 
the hypothesis. The second hypothesis was sup- 
ported, the r's between SW and LPT-Quantity, 
LPT-Quality, and Maze being —.38 (p < .01), 
—.33 (p < .01), and —.30 (p < .025), respec- 
tively (p values based on one-tail tests). bee 
hypothesis is still supported when the effects of 
intelligence, rated severity of psychosis, an 
length of hospitalization are taken into account. 

It can be further hypothesized that the 1d be 
tive correlation between SW and LPT soult, : 
greater for the interpersonal problems nan, E 
the noninterpersonal problems. The obtaine H 
were: SW versus LPT-Quantity on Paa en 
items .36; SW versus LPT-Quantity on hex 
interpersonal items .33 (difference between | 5 
r's NS); SW versus LPT-Quality on TPT 
personal items .34; and SW versus ~ A 
Quality on noninterpersonal items .10 (iaca 
between these r's is ge a .05 levi 
on a one-tail test; £= 1.76; df = 48). e 

In summary the findings fail to suppor : a 
first hypothesis concerning anxiety, but “*°y i e} 
in accord with the hypothesized necative 3 
tionship between SW and originality in prob i 
solving. There is some evidence that socia A)! 
withdrawn individuals are less able to cope a 
an original manner with problems of an pom 
personal nature than with noninterperson4 
problems. 
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